Gene Mlg_0700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0700 
Symbol 
ID4268859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp781683 
End bp783143 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content61% 
IMG OID638125449 
Productflagellin domain-containing protein 
Protein accessionYP_741544 
Protein GI114319861 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.255578 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG TAATCAACAC CAACATCGCT TCTCTTAATG CGCAGCGGAA TCTGAACGCA 
TCGCAGAACC AACTGAATGT GTCGCTGGAG CGGCTCTCGT CCGGGCTGAG GATCAACAGC
GCCAAGGATG ATGCCGCGGG TCTCGCCATC TCCGAGCGAT TCACCGCTCA GATCAACGGC
AACAACCAAG CGGTCCGGAA CGCCAACGAT GGCATCTCCC TCTCTCAGAC CGCCGAAGGG
GCCTTGGAAG AGATCGGGAA CATCGGGCAG CGCATTCGTG AGCTGGCCGT CCAGGCAGCC
AATGACACCA ACTCCGCGTC GGACCGCCAG GCGTTGAACA ACGAGGTCCA GCAGCTGATT
GCGGAGGCCG GGCGTATCGC CCAGGCGACC CAGTTCAACG ATCAGAACGT GTTGGACGGC
AGCCTGGAGG AGATCCTGTT CCAAGTGGGT GCCAATCGTG GTCAAACCAT TTCGGTGGAT
GGCGTGGATG CCCGCAGTGA CCAACTGGGT GCCCAGCTCT TTGCCGGTGA TGACATTGAC
TTCACGGAAC TGGTGGATAA CGCGAATGCC ACCACCGCGA GCTTTAACAT CACCGATGAT
CTGACCATCA ACGGTGAAGC GGTGGATCTG TCCGGCATCG AGGCCGAGGA CGGTTTCGCG
GATGTGGATG ACATCGTTGC TGCGATCAAC GCCGTCTCTG GTGATACCGG CGTCGAGGCC
GATAGGGCGC TGGAGGTCGA AGCCACCATT GATGCCAGTG GGCACACTGC CAGTAATACC
GGGCTGGCCT TCTCGCTCAA TGGGGTTGAT ATCACCGTGG ACGGCACGGC GAGCCAGACC
GAGGCCGCTA CCAGGCTGGC CGATGCAATC AATGAGGCCG GAGTGAGTGG GGTCAGTGCT
CAGATCGATC CGGATGATGA CACCCAAGTG GTTCTCAGTG GGGACGGGGC TGATCTCCAA
TTCACCCAAG GCGCGGATTC GCAGGGGATC ACCTTTGTCG ACGGGGCCGA TCTCGGTGCA
AACGAAGATG ATATCGCAGG GTTTGCCCGG GCCATTGAGT TGACTGGGCA GCTGGGTGAG
TCGGTCGACG TACAGGGTAC GGATGTCGGA AACCTGGGTC TGGATAACCT GGGTGACGGG
GAGGAGAAAA CACTGGCCAG CGTAGATATC TCCACCCGTG ATAATGCCTC CGACGCCATC
CGGTCCATGG ACTTTGTCCT CGCCCAGGTA AACAGCCTTC GGGCCGACCT CGGTGCGGTC
CAGAACCGCT TCGAGTCCAC GGTCGCCAAC CTGTCGGTGA CCTCGGAGAA CCTGGAGGCC
TCCCGCAGCC GGATCCTGGA CGCGGACTTC GCCGCTGAGA CCGCGGCCCT GACCCGCAGC
CAGATCCTGC AGCAGGCTGG CACCTCGGTC CTGGCGCAGG CCAACCAGCT CCCGAACAAC
GTGCTGGCCC TGCTCCAGTA A
 
Protein sequence
MAQVINTNIA SLNAQRNLNA SQNQLNVSLE RLSSGLRINS AKDDAAGLAI SERFTAQING 
NNQAVRNAND GISLSQTAEG ALEEIGNIGQ RIRELAVQAA NDTNSASDRQ ALNNEVQQLI
AEAGRIAQAT QFNDQNVLDG SLEEILFQVG ANRGQTISVD GVDARSDQLG AQLFAGDDID
FTELVDNANA TTASFNITDD LTINGEAVDL SGIEAEDGFA DVDDIVAAIN AVSGDTGVEA
DRALEVEATI DASGHTASNT GLAFSLNGVD ITVDGTASQT EAATRLADAI NEAGVSGVSA
QIDPDDDTQV VLSGDGADLQ FTQGADSQGI TFVDGADLGA NEDDIAGFAR AIELTGQLGE
SVDVQGTDVG NLGLDNLGDG EEKTLASVDI STRDNASDAI RSMDFVLAQV NSLRADLGAV
QNRFESTVAN LSVTSENLEA SRSRILDADF AAETAALTRS QILQQAGTSV LAQANQLPNN
VLALLQ