Gene Mlg_0781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0781 
Symbol 
ID4269598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp868404 
End bp870677 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content67% 
IMG OID638125531 
ProductTonB-dependent receptor 
Protein accessionYP_741625 
Protein GI114319942 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.877086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAATG CTGCACACAG GCGGATCCGG CCACCCCTTC CTGCCCTTGG CGCAACCGCT 
GCGCTGCTCA CTATTCAGCC CCTTGCCACC ACCGACGCCG GTGAGCCGCC GGAATTGCCG
CCCATACGGG TCGAGTCCAC CACCATCGAC GACCGCTTCG CCGGTGACGA GACCAATCCC
ACCGCCACCT CCTTCGTCCG CGGTGACGAG GTGGACGCCG CCGCCGCCGA GCATATCGAG
CAGGTCCTGC GCCGTATCCC GGGGCTGACC GCCGACACCC GCACCGGCTC CGATGACGCG
GTCAAGATCA AGCTGCGGGG CGTCGAGGGC CAGCGCTACA TGGGCGAGGA CCCGGGGGTG
GCCATCATCA TCGATGGCGT GCCGGTGAAG GAGGACACCG GCCGGGTCAA TATCGATATG
GATAACATCG AGTCCATCCG GGTGATCCGG GGCAGCGCCT CTTACCTCTA CGGTGAAGAC
GCCCTGGCCG GTGCCGTGGT GATCACCACC AAGCGCGGGG CCGACATGGC CGGCTTTCGG
ACCGAGGGGG CCCTGGGCAG CCACGGGACG CAGCGCTGGT TGGGCCGGGC GGGGTATGCG
GACGAGCGGT TCAATGCTTA CCTGCAGGCC TCGCACCGGG AGTCGGACGG CTGGCATGCC
CGCGCCGGCT ACGAGGCCGA CTACCTGAAC GGCAAGCTGC AGTACTACCT CGACGACTAC
AGCGATATCA CCTTCAACTT TGAGCACACG GACCGGTTTC GCGATGAAAC CGGCACGGTC
CGTGGCCGCA GCCAGGCGGA GCGCGATCCG CGTAACCGGG AGGCGCAGCG CGGCTCCTAC
ACCCGTAACT TCCACCTGGA CCTGGCGCGC TACTCGGTCA CCTACATGCG CGACCTGCGC
GCCGCCGGCG AACTGACCCT GAGCGGTTAC CGCTTCACCG ACGAGACCTG GAACTGGTAC
GCGCCCATGC GCTACGACGG CGACGGCAAC GCGGTGAACG ACGCCGATCT CTACGAGAGC
CGCAGCGACA AGGAGCAGGT GCAACGCGGG CTCAAGGCCG AGTGGCGCAA CGACGGCCAG
CGGTTCGCCG GGCTGTTGGG GCTGGACCTG CGCGAGAACA CCTACGAGCA GGAGAACCAT
TACATCAATG ACAACAAGCC GAGTCCGTCG CCCTTTGCGC CGGTGCGGGA GGCGGGGACG
CGGAGCGCGG ACCACGAGAC CGATGAGACC ATCCGCGCCG CCTATGGCGA GTTGAAGTGG
CGGCTGGCCC CGCGCTGGAC CCTGACCGGA AACGCCCGCT TCGATCACAC CCGCCTGGAG
CACAGCGACC ACATGGAGGG CCGGTCGCTG GACCGCAGCT TCAACGTCTG GTCCTGGCGG
GCGGGCAGCG CCTTTCAGGC CAGTGACCGG CTGACCTTCT TTGCCAACGC CTCCACCGGG
TTCCGCAACC CCACCGTGGG CCAGTTGTTC GCTGGTGGCT TTGAGGACGA CACCGCCGGC
AACCCCGACC TGGACCCGGA GGAGGTGATC AACCTCGAGC TGGGGCTGCG TGCCCGGACC
CGCTGGCTGG GGCAGCCGGT GCGCGCCGAG CTGACCCTGT TTCAGATGGA CCGGGACGAC
TTCATCCTCC GCCAGGCCGG GCAATACGCC ACCACCGCCG AGGAACAGGA GGCCCGATAC
GAGAACATCG GCGGTGCCCG CCACCGCGGT CTTGAACTGG CCCTGAGCGG CGAGGCGGGG
CAGCGGGTGA GCTGGGGGGC GGCCTACACC CTGCTCGATG CCCGGTTCAC CGATTACGAC
AATTTCAACC TGGCGCTGGG GAACGCCCGC GGGGAGTTCC TGGGCGAGTG CGACCAGGTC
ACCCTGGACG ACCCGCGCAG CCAGTACTGC GTGGAACGCC ACGACAACAG CGGCAACCGC
ATCCCGCGGG TACCGCGCCA CACCCTCAAC CTGCTGGTGG ATTTCCACCT GACCCCGGCC
TTCACCCTGA CCACGGAGAG CCACACGGTC TCCTCCTGGT TCGCAGATGA ACTCAACGAG
TTCGAACTCT CCGGTCACAC CGTCTTCAAC CTGGTGGGCA ACTACCGCCA GCGCCTGGGC
CGGGTGGATC TGCGCGCGTT CCTGCGCGTG GACAACGTCT TCGACCAGTG GCACTACGAG
CGCGCCTCGG CCTTCTACGA CAACAACCAG GACGGCGAGT ACGACTGGGA GGACGTCACC
TTCCTGGTGA ACCCCGGGCG CACCTGGACC GCCGGTCTGG AGGCGCGCTT CTGA
 
Protein sequence
MINAAHRRIR PPLPALGATA ALLTIQPLAT TDAGEPPELP PIRVESTTID DRFAGDETNP 
TATSFVRGDE VDAAAAEHIE QVLRRIPGLT ADTRTGSDDA VKIKLRGVEG QRYMGEDPGV
AIIIDGVPVK EDTGRVNIDM DNIESIRVIR GSASYLYGED ALAGAVVITT KRGADMAGFR
TEGALGSHGT QRWLGRAGYA DERFNAYLQA SHRESDGWHA RAGYEADYLN GKLQYYLDDY
SDITFNFEHT DRFRDETGTV RGRSQAERDP RNREAQRGSY TRNFHLDLAR YSVTYMRDLR
AAGELTLSGY RFTDETWNWY APMRYDGDGN AVNDADLYES RSDKEQVQRG LKAEWRNDGQ
RFAGLLGLDL RENTYEQENH YINDNKPSPS PFAPVREAGT RSADHETDET IRAAYGELKW
RLAPRWTLTG NARFDHTRLE HSDHMEGRSL DRSFNVWSWR AGSAFQASDR LTFFANASTG
FRNPTVGQLF AGGFEDDTAG NPDLDPEEVI NLELGLRART RWLGQPVRAE LTLFQMDRDD
FILRQAGQYA TTAEEQEARY ENIGGARHRG LELALSGEAG QRVSWGAAYT LLDARFTDYD
NFNLALGNAR GEFLGECDQV TLDDPRSQYC VERHDNSGNR IPRVPRHTLN LLVDFHLTPA
FTLTTESHTV SSWFADELNE FELSGHTVFN LVGNYRQRLG RVDLRAFLRV DNVFDQWHYE
RASAFYDNNQ DGEYDWEDVT FLVNPGRTWT AGLEARF