Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0781 |
Symbol | |
ID | 4269598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 868404 |
End bp | 870677 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638125531 |
Product | TonB-dependent receptor |
Protein accession | YP_741625 |
Protein GI | 114319942 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.877086 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAATG CTGCACACAG GCGGATCCGG CCACCCCTTC CTGCCCTTGG CGCAACCGCT GCGCTGCTCA CTATTCAGCC CCTTGCCACC ACCGACGCCG GTGAGCCGCC GGAATTGCCG CCCATACGGG TCGAGTCCAC CACCATCGAC GACCGCTTCG CCGGTGACGA GACCAATCCC ACCGCCACCT CCTTCGTCCG CGGTGACGAG GTGGACGCCG CCGCCGCCGA GCATATCGAG CAGGTCCTGC GCCGTATCCC GGGGCTGACC GCCGACACCC GCACCGGCTC CGATGACGCG GTCAAGATCA AGCTGCGGGG CGTCGAGGGC CAGCGCTACA TGGGCGAGGA CCCGGGGGTG GCCATCATCA TCGATGGCGT GCCGGTGAAG GAGGACACCG GCCGGGTCAA TATCGATATG GATAACATCG AGTCCATCCG GGTGATCCGG GGCAGCGCCT CTTACCTCTA CGGTGAAGAC GCCCTGGCCG GTGCCGTGGT GATCACCACC AAGCGCGGGG CCGACATGGC CGGCTTTCGG ACCGAGGGGG CCCTGGGCAG CCACGGGACG CAGCGCTGGT TGGGCCGGGC GGGGTATGCG GACGAGCGGT TCAATGCTTA CCTGCAGGCC TCGCACCGGG AGTCGGACGG CTGGCATGCC CGCGCCGGCT ACGAGGCCGA CTACCTGAAC GGCAAGCTGC AGTACTACCT CGACGACTAC AGCGATATCA CCTTCAACTT TGAGCACACG GACCGGTTTC GCGATGAAAC CGGCACGGTC CGTGGCCGCA GCCAGGCGGA GCGCGATCCG CGTAACCGGG AGGCGCAGCG CGGCTCCTAC ACCCGTAACT TCCACCTGGA CCTGGCGCGC TACTCGGTCA CCTACATGCG CGACCTGCGC GCCGCCGGCG AACTGACCCT GAGCGGTTAC CGCTTCACCG ACGAGACCTG GAACTGGTAC GCGCCCATGC GCTACGACGG CGACGGCAAC GCGGTGAACG ACGCCGATCT CTACGAGAGC CGCAGCGACA AGGAGCAGGT GCAACGCGGG CTCAAGGCCG AGTGGCGCAA CGACGGCCAG CGGTTCGCCG GGCTGTTGGG GCTGGACCTG CGCGAGAACA CCTACGAGCA GGAGAACCAT TACATCAATG ACAACAAGCC GAGTCCGTCG CCCTTTGCGC CGGTGCGGGA GGCGGGGACG CGGAGCGCGG ACCACGAGAC CGATGAGACC ATCCGCGCCG CCTATGGCGA GTTGAAGTGG CGGCTGGCCC CGCGCTGGAC CCTGACCGGA AACGCCCGCT TCGATCACAC CCGCCTGGAG CACAGCGACC ACATGGAGGG CCGGTCGCTG GACCGCAGCT TCAACGTCTG GTCCTGGCGG GCGGGCAGCG CCTTTCAGGC CAGTGACCGG CTGACCTTCT TTGCCAACGC CTCCACCGGG TTCCGCAACC CCACCGTGGG CCAGTTGTTC GCTGGTGGCT TTGAGGACGA CACCGCCGGC AACCCCGACC TGGACCCGGA GGAGGTGATC AACCTCGAGC TGGGGCTGCG TGCCCGGACC CGCTGGCTGG GGCAGCCGGT GCGCGCCGAG CTGACCCTGT TTCAGATGGA CCGGGACGAC TTCATCCTCC GCCAGGCCGG GCAATACGCC ACCACCGCCG AGGAACAGGA GGCCCGATAC GAGAACATCG GCGGTGCCCG CCACCGCGGT CTTGAACTGG CCCTGAGCGG CGAGGCGGGG CAGCGGGTGA GCTGGGGGGC GGCCTACACC CTGCTCGATG CCCGGTTCAC CGATTACGAC AATTTCAACC TGGCGCTGGG GAACGCCCGC GGGGAGTTCC TGGGCGAGTG CGACCAGGTC ACCCTGGACG ACCCGCGCAG CCAGTACTGC GTGGAACGCC ACGACAACAG CGGCAACCGC ATCCCGCGGG TACCGCGCCA CACCCTCAAC CTGCTGGTGG ATTTCCACCT GACCCCGGCC TTCACCCTGA CCACGGAGAG CCACACGGTC TCCTCCTGGT TCGCAGATGA ACTCAACGAG TTCGAACTCT CCGGTCACAC CGTCTTCAAC CTGGTGGGCA ACTACCGCCA GCGCCTGGGC CGGGTGGATC TGCGCGCGTT CCTGCGCGTG GACAACGTCT TCGACCAGTG GCACTACGAG CGCGCCTCGG CCTTCTACGA CAACAACCAG GACGGCGAGT ACGACTGGGA GGACGTCACC TTCCTGGTGA ACCCCGGGCG CACCTGGACC GCCGGTCTGG AGGCGCGCTT CTGA
|
Protein sequence | MINAAHRRIR PPLPALGATA ALLTIQPLAT TDAGEPPELP PIRVESTTID DRFAGDETNP TATSFVRGDE VDAAAAEHIE QVLRRIPGLT ADTRTGSDDA VKIKLRGVEG QRYMGEDPGV AIIIDGVPVK EDTGRVNIDM DNIESIRVIR GSASYLYGED ALAGAVVITT KRGADMAGFR TEGALGSHGT QRWLGRAGYA DERFNAYLQA SHRESDGWHA RAGYEADYLN GKLQYYLDDY SDITFNFEHT DRFRDETGTV RGRSQAERDP RNREAQRGSY TRNFHLDLAR YSVTYMRDLR AAGELTLSGY RFTDETWNWY APMRYDGDGN AVNDADLYES RSDKEQVQRG LKAEWRNDGQ RFAGLLGLDL RENTYEQENH YINDNKPSPS PFAPVREAGT RSADHETDET IRAAYGELKW RLAPRWTLTG NARFDHTRLE HSDHMEGRSL DRSFNVWSWR AGSAFQASDR LTFFANASTG FRNPTVGQLF AGGFEDDTAG NPDLDPEEVI NLELGLRART RWLGQPVRAE LTLFQMDRDD FILRQAGQYA TTAEEQEARY ENIGGARHRG LELALSGEAG QRVSWGAAYT LLDARFTDYD NFNLALGNAR GEFLGECDQV TLDDPRSQYC VERHDNSGNR IPRVPRHTLN LLVDFHLTPA FTLTTESHTV SSWFADELNE FELSGHTVFN LVGNYRQRLG RVDLRAFLRV DNVFDQWHYE RASAFYDNNQ DGEYDWEDVT FLVNPGRTWT AGLEARF
|
| |