Gene Csal_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1968 
SymbolflgL 
ID4027208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2223146 
End bp2224378 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content62% 
IMG OID637967164 
Productflagellar hook-associated protein FlgL 
Protein accessionYP_574019 
Protein GI92114091 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID[TIGR02550] flagellar hook-associated protein 3 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATCA GCACCGTAAC GATGTACGAG CAGGGCGTTT CGGCAATGAA TCGCCAGCAG 
CAGAACTTCA TGGACGTCGG CCAGCAGATC GCGTCCGGCA AGCGGGTGGT GAACCCGTCC
GACGACCCCC GTGCCGCCGC ACGGGCGGTG AGCGTGTCGC AGTCGCTGGC AGTCAATGCG
CAGCAGGAAA GCAGCCGGGT GACGGCACGC AATTCGTTGA GCCAGGAAGA GAGCGTCCTC
AACAGTGTCA GCGATGCCAT CGGCTCGGCC AAGTCCCTGG TCGTGCAGGC CGGTAACGGC
ACTCTGAGCG ATGCCGATCG CGAATCGCTG GCGTCCGATC TCGAGGGCGC GTTCGAGACG
CTGGTGGGGC TTGCCAATAC CACCGACGGC AACGGTACTT ACCTGTTCAG CGGCTATCAG
GACAACGCCA AGGCCTTCTC GCGTACCGAT GCGGGCGATG CTGTCGACAC CATCTCGTAT
GAAGGCGATC AGGGCGTCAA GCAGCAGAAG ATCGATGCCG AACGCCTCAT GAAGACCAGC
GATACCGGCA CCGATGTATT CATGCGCTTC TCGGCGGGCA GCGAATATAT CGCCGAAGCC
GATGAGGGCA ATACGGGAAA CGTGACCTTT TCCGGCCCTG ACGTTCGCGA TGCCGATGCC
GCCGGTTACG GCGAGACCTT CGACATCAGT TTCAACGGCG ATGGTACCTA TGACATTTCA
AGCTCCGGGG CGGGGTTTGC CGACCAGACG AACGTTGCCT ACACCGACGG CGAGACCATC
GAGTTCGGTG GCATGGCGTT GACGCTGGAG GGCGAGCCGG CGGCCGGTGA CTCGTTTACC
GTCACGCCGG GAGGCGACAT GAGTCAGGAG CAGGCCAGCC TGTTCAAGAC CATCGGCGAT
ACCATCAATG CCCTGCGTCA GCCCGTCGAG ACCGATGCCG ATCAAGCCGC GCTGGATAAC
ACGCTGTCCA CCGCGAGCCG CAAGCTGGAT GCCTCGCTGG ACAACGTGCT GACCACGCGG
GCCTCGGTGG GTGCGCGGAT GAACGAACTG GACGCGCTGG ACGACGTTGG CGGCAACCGC
GAAATCGCCT ACGAACAGAC GCGTTCCGAT CTCGTCGATC TGGATTACAA CACGGCGATT
TCCGACTACA TGCTGAGCCA GGTCGGGCTG CAGGCATCGC AGAAATCCTT CGCCGACATT
CAGCAGATGT CGCTGTTCCA GTTCCTCAAC TGA
 
Protein sequence
MRISTVTMYE QGVSAMNRQQ QNFMDVGQQI ASGKRVVNPS DDPRAAARAV SVSQSLAVNA 
QQESSRVTAR NSLSQEESVL NSVSDAIGSA KSLVVQAGNG TLSDADRESL ASDLEGAFET
LVGLANTTDG NGTYLFSGYQ DNAKAFSRTD AGDAVDTISY EGDQGVKQQK IDAERLMKTS
DTGTDVFMRF SAGSEYIAEA DEGNTGNVTF SGPDVRDADA AGYGETFDIS FNGDGTYDIS
SSGAGFADQT NVAYTDGETI EFGGMALTLE GEPAAGDSFT VTPGGDMSQE QASLFKTIGD
TINALRQPVE TDADQAALDN TLSTASRKLD ASLDNVLTTR ASVGARMNEL DALDDVGGNR
EIAYEQTRSD LVDLDYNTAI SDYMLSQVGL QASQKSFADI QQMSLFQFLN