Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4674 |
Symbol | |
ID | 6144848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4771451 |
End bp | 4772422 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641619490 |
Product | TRAP transporter solute receptor DctP family protein |
Protein accession | YP_001746598 |
Protein GI | 170681376 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.841471 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAA AAGTATCCGC TGGCATTATC GGTGCTGTTC TTATGTTATC CGCAAGCCAG TCCTGGGCAG TGACATTAAA ACTGAGTCAT AATCAGGATA AGTCTCATCC TGTTCATAAA GCGATGGAGT TCTTTGCGAA AAAGAGCAAA GAGTACTCTA ACGGTGATAT TACTATTCGT ATTTATCCAA ATGGAACATT GGGTACTCAA CGAGAAACAA TGGAGCTGAT TCGTTCTGGC GCTATTCCAC TGGTAAAAAC CAACGCGGCA GAAATGGAAG CATTTGAAAA TTCCTATAAA TTATTTAGCC TGCCTTATTT GTTCCGCGAT CGTGATCATT ATTATCAGGT CATGCAGGGC GATATCGGGA GAAAAATCCT CGACTCAACG AAAAGCAAAG GTTATTTCGG GCTGACTTTT TATGATGGAG GCGCCCGCAG TTTCTACGGC AATAAACCAG TACTGAAACC AGACGATCTC AAAGGCATGA AAGTCCGTGT CCAGCCAAGT CCTGGCGCAG TTGAAATGAT CAAAGTCATG GGCGGTAACC CGACGCCACT GGATTACGGC GAGTTGTATA CAGCCTTACA GCAGGGTGTG GTCGATATGG CAGAAAACAG CGTGATGGCG CTGACCACCA TGCGTCACGG TGAAGTGGCA AAATCCTTCA GCCTTGACGA ACACACTATG GTTCCCGATG TGGTTCTGAT GAGCAATGCT GCGTTTGATA AACTTAGCCC GGAAAATCAG GCAGTTATAT TAAAAGCAGC TAAAGAATCA ATGAGCTACA TGAAAGACTT GTGGAGCGAG GAAGAGAAAC AAGAATTTGC AAAACTGGAT AAAATGGGCG TGAAAGTCTA CCAGGTAGAT AAAGCTCCGT TTATCGAGAA AGTACAGCCA ATGTACGCAA ACTTCGCTAA GGACAACCCA GCCCTTGCCC CAATGCTGGC TGATATTCAG GCAGCTAAGT AA
|
Protein sequence | MKIKVSAGII GAVLMLSASQ SWAVTLKLSH NQDKSHPVHK AMEFFAKKSK EYSNGDITIR IYPNGTLGTQ RETMELIRSG AIPLVKTNAA EMEAFENSYK LFSLPYLFRD RDHYYQVMQG DIGRKILDST KSKGYFGLTF YDGGARSFYG NKPVLKPDDL KGMKVRVQPS PGAVEMIKVM GGNPTPLDYG ELYTALQQGV VDMAENSVMA LTTMRHGEVA KSFSLDEHTM VPDVVLMSNA AFDKLSPENQ AVILKAAKES MSYMKDLWSE EEKQEFAKLD KMGVKVYQVD KAPFIEKVQP MYANFAKDNP ALAPMLADIQ AAK
|
| |