Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1720 |
Symbol | |
ID | 6144855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1728170 |
End bp | 1730272 |
Gene Length | 2103 bp |
Protein Length | 700 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616596 |
Product | TonB-dependent receptor |
Protein accession | YP_001743774 |
Protein GI | 170680750 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01783] TonB-dependent siderophore receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.000201404 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGATTT TTTCCGTCCG ACAGACCGTT TTGCCCGCAC TGCTTGTCCT TTCCCCCGTT GTTTTTGCCG CTGATGAACA GACCATGATT GTCAGTGCCG CGCCGCAGGT GGTTTCAGAA CTGGATACGC CAGCAGCAGT AAGCGTGGTG GATGGCGAGG AGATGCGCCT GGCAACACCG CGCATTAACT TGTCTGAATC ACTGACTGGC GTGCCTGGTT TGCAGGTACA AAACCGGCAG AACTATGCGC AAGATTTACA GCTGTCGATT CGCGGATTTG GCTCCCGCTC CACTTACGGA ATTCGCGGTA TTCGCCTGTA TGTGGACGGT ATTCCCGCCA CCATGCCCGA CGGGCAAGGG CAAACATCCA ACATCGATTT AAGCAGTGTG CAAAATGTGG AAGTGCTGCG TGGCCCCTTC TCTGCTCTGT ATGGCAACGC GTCTGGCGGT GTAATGAATG TCATCACCCA GACCGGACAA CAGCCACCAA CCATTGAAGC CAGTAGTTAC TACGGCAGTT TTGGCAGCTG GCGCTATGGG CTGAAAGCAA CAGGCGCAAC GGGCGACGGC ACACAGCCTG GCGATGTCGA TTACACCGTC TCAACCACGC GTTTTACGAC CCACGGCTAT CGTGACCATA GTGGCGCACA GAAAAATTTA GCCAATGCCA AACTGGGCGT TCGCATTGAT GAAGCCAGCA AATTAAGCCT GATTTTCAAT AGTGTGGATA TCAAAGCAGA TGACCCAGGT GGGCTAACCG AAGCAGAATG GAAGGCGAAT CCGCAACAAG CGCCACGCGC TGAACAGTAC GACACGCGAA AAACCATCAA GCAAACTCAG GCTGGGTTGC GCTATGAACG TAGCCTGAGC GCGCAAGATG ATATGAGTGT AATGATGTAT GCCGGAGAGC GAGAAACGAC CCAGTACCAG TCAATCCCGA TGGCCCCTCA ACTTAATCCG TCGCATGCGG GCGGCGTGAT TACTCTGCAA CGCCATTATC AGGGGATAGA CAGCCGCTGG ACACACCATG GCGAGCTGGG TGTTCCGGTC ACGTTCACTA CTGGCCTGAA CTACGAAAAC ATGAGTGAAA ACCGCAAGGG CTACAATAAC TTCCGCCTGA ATAGCGGCGT GCCGGAATAC GGGCAAAAAG GTGAGTTACG TCGCGACGAA CGCAATCTGA TGTGGAATGT CGATCCCTAT TTACAGACAC AGTGGCAGCT GAGCGAAAAA CTGTCGCTGG ATGCTGGCGT GCGCTACAGC TCCGTATGGT TTGATTCCAA CGACCATTAC GTTACTCCGG GTAACGGCGA TGACAGCGGT GATGCCAGTT ATCACAAATG GCTACCTGCC GGTTCGTTAA AATATGCAAT GAACGATGCC TGGAATATCT ATCTGGCAGC CGGGCGTGGT TTTGAAACGC CGACGATTAA TGAACTGTCT TATCGCGCTG ATGGGCAAAG CGGTATGAAC TTTGGTTTAA AACCATCTAC CAACGATACA ATTGAGATCG GCAGTAAAAC GCGTATTGGT GATGGGCTGC TGAGTCTCGC ATTGTTCCAG ACCGACACCG ATGATGAAAT TGTTGTCGAT AGCAGTAGCG GTGGGCGTAC CACATATAAA AATGCCGGAA AGACCCGTCG TCAAGGCGCT GAACTGGCAT GGGATCAACG TTTCGCGGGA GATTTTCGCG TAAAAGCGTC CTGGACCTGG CTTGATGCGA CCTATCGCAG CAATGTGTGC AATGAACAGG ATTGTAACGG TAACCGGATG CCAGGGATCG CCCGTAATAT GGGCTTTGCA TCGATAGGTT ATGTACCGGA AGAAGGCTGG TATGCAGGCA CGGAAGCGCG ATATATGGGC GATATTATGG CAGATGATGA AAATACGGCC AAAGCGCCGT CTTATACTCT CGTCGGCTTA TTCACCGGGT ATAAATACAA TTACCACAAT TTGACTGTGG ATTTATTTGG TCGTGTCGAT AATTTATTCG ATAAAGGATA CGTTGGTTCT GTCATTGTCA ATGAGTCAAA CGGTCGATAT TACGAACCTG CGCCCGGGCG AAATTATGGT GTCGGCATGA ATATTGCGTG GCGATTTGAG TAA
|
Protein sequence | MKIFSVRQTV LPALLVLSPV VFAADEQTMI VSAAPQVVSE LDTPAAVSVV DGEEMRLATP RINLSESLTG VPGLQVQNRQ NYAQDLQLSI RGFGSRSTYG IRGIRLYVDG IPATMPDGQG QTSNIDLSSV QNVEVLRGPF SALYGNASGG VMNVITQTGQ QPPTIEASSY YGSFGSWRYG LKATGATGDG TQPGDVDYTV STTRFTTHGY RDHSGAQKNL ANAKLGVRID EASKLSLIFN SVDIKADDPG GLTEAEWKAN PQQAPRAEQY DTRKTIKQTQ AGLRYERSLS AQDDMSVMMY AGERETTQYQ SIPMAPQLNP SHAGGVITLQ RHYQGIDSRW THHGELGVPV TFTTGLNYEN MSENRKGYNN FRLNSGVPEY GQKGELRRDE RNLMWNVDPY LQTQWQLSEK LSLDAGVRYS SVWFDSNDHY VTPGNGDDSG DASYHKWLPA GSLKYAMNDA WNIYLAAGRG FETPTINELS YRADGQSGMN FGLKPSTNDT IEIGSKTRIG DGLLSLALFQ TDTDDEIVVD SSSGGRTTYK NAGKTRRQGA ELAWDQRFAG DFRVKASWTW LDATYRSNVC NEQDCNGNRM PGIARNMGFA SIGYVPEEGW YAGTEARYMG DIMADDENTA KAPSYTLVGL FTGYKYNYHN LTVDLFGRVD NLFDKGYVGS VIVNESNGRY YEPAPGRNYG VGMNIAWRFE
|
| |