Gene Mlg_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0903 
Symbol 
ID4269047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1021568 
End bp1022794 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content67% 
IMG OID638125655 
Productflagellar hook-associated protein 3 
Protein accessionYP_741747 
Protein GI114320064 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID[TIGR02550] flagellar hook-associated protein 3 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.432658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTGT CCACCAGCCA ACTGGCCCTG CAGGGGGTGA ACAGCATCCT CTCCCAGCAG 
GCCTCGCTGT CGAAGACCCA GGCGCAACTG GCCAGCGGCC GGCAGATCCT GACCCCGTCG
GACGACCCGG CCGGCGCCTC CCGCATCCTG GAGCTGGAGA AGGCCATCAA TACCGTGGAG
CGGTACAACC GCAACGCCGA CCAGGCGGAG ACCCGGTTGG GGCTATCGGA GAACATCCTC
AACGAGTTTG GCAACACCCT GCAGCGGGTG CGCGAGCTCT CCGTGCAGGC GGCCAACGGC
TCGCAGGACC GGGAGACCCG GGCCTACATC GCCTCTGAGC TGCGCCAGGC CCAGGATCAA
CTCGTACAGC TCGCCAACAC CGATGACGGC AACGGCGAGT TCCTGTTCGC CGGCTCCGAG
ACCCGCACCC AGCCCTTCAC CAAGACCGCG GGCGGGAAGG TGGTGTACAA CGGCGATCAG
GGCCAGCGGG AGGTACGTAT CGGCCCCTCG CGTACGCTGG CGGTGGACAG CTCGGGCTTC
GACGCCTTCA TGAAGATCCC CAACGGCAAT GGGGATTACC AGGCCCGCGA GGCGCAGGGG
AACAGCGGCA GCGGCATCAT CACCGTCGGC GACTCGCCCG CCCTGGTCCA GCCCGGGGAG
GCGTACACCA TCGCCTTCGA ACAGGACGGG GCGGGAGGCA TGACCTACCG GGTGCTGGAT
GGCGACGATC AGCCGGTGGC GGTGGACGGC GAGCCCGTCA CCGGGCCCTA CGAGCCCGGT
ATGACCCTCC GGTTTCCCGC CGAGAGCGGC CAGTCGCTGC AGGTGAAGCT GGACGGCCGG
CCGGAGGAGG GCGACCGGTT CGAGGTGAGC GCGGCCAGGC CTCAGTCTGT ATTCGAGACC
GTCAACAACC TCATCCGCAC CCTGGAAGAC GACGGTGACG GTCCGGCCCT GAATAATGCC
GTCAACCGCT TTCTGGCCGA CATCGACCAG GGGATGGAGA ACATCATCCG GGTACGCTCG
GAGCTGGGGG CCCGGCTGAA CACCCTGGAC GCCAGTCGCG ATGCCAACGA GGGCGCCCTG
CTGGACCTGA ATGCCGCGAA ATCCAGGTTG GCGGACCTGG ACTACGCCGA GGCCACCGGT
CGCTTCAACC AGGAGCTGGT GGGGCTGCAG GCCGCGCAGC AGACCTATAC CCGACTGCAG
GGGCTGTCGC TCTTCGAGTT CATCTAG
 
Protein sequence
MRLSTSQLAL QGVNSILSQQ ASLSKTQAQL ASGRQILTPS DDPAGASRIL ELEKAINTVE 
RYNRNADQAE TRLGLSENIL NEFGNTLQRV RELSVQAANG SQDRETRAYI ASELRQAQDQ
LVQLANTDDG NGEFLFAGSE TRTQPFTKTA GGKVVYNGDQ GQREVRIGPS RTLAVDSSGF
DAFMKIPNGN GDYQAREAQG NSGSGIITVG DSPALVQPGE AYTIAFEQDG AGGMTYRVLD
GDDQPVAVDG EPVTGPYEPG MTLRFPAESG QSLQVKLDGR PEEGDRFEVS AARPQSVFET
VNNLIRTLED DGDGPALNNA VNRFLADIDQ GMENIIRVRS ELGARLNTLD ASRDANEGAL
LDLNAAKSRL ADLDYAEATG RFNQELVGLQ AAQQTYTRLQ GLSLFEFI