Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4765 |
Symbol | |
ID | 5586156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4758793 |
End bp | 4759764 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640928376 |
Product | TRAP transporter solute receptor DctP family protein |
Protein accession | YP_001465704 |
Protein GI | 157158149 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.844835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA AAGTATCCGC TGGCATTATC GGTGCTGTTC TTATGTTATC CGCAAGCCAG TCCTGGGCAG TGACATTAAA ACTGAGTCAT AATCAGGATA AGTCTCATCC TGTTCATAAA GCGATGGAGT TCTTTGCGAA AAAGAGCAAA GAGTACTCTA ACGGTGATAT TACTATTCGT ATTTATCCAA ATGGAACATT GGGTACTCAA CGAGAAACAA TGGAGCTGAT TCGTTCTGGC GCTATTCCCC TGGTAAAAAC CAATGCGGCA GAAATGGAAG CATTTGAAAA TTCCTATAAA TTATTTAGCC TGCCTTATTT GTTCCGCGAT CGTGATCATT ATTATCAGGT CATGCAGGGC GATATCGGGA GAAAAATCCT CGACTCAACG AAAAGCAAAG GTTATTTCGG GCTGACTTTT TATGATGGAG GCGCCCGCAG TTTCTATGGC AATAAACCAG TACTGAAACC AGACGATCTG AAAGGCATGA AAGTCCGTGT CCAGCCAAGC CCTGGCGCAG TTGAAATGAT CAAAGTCATG GGCGGTAACC CGACGCCACT GGATTACGGC GAGTTGTATA CAGCCTTACA GCAGGGTGTG GTCGATATGG CAGAAAACAG CGTGATGGCG CTGACCACCA TGCGTCACGG TGAAGTGGCA AAATCCTTCA GCCTTGACGA ACACACTATG GTTCCCGATG TGGTTCTGAT GAGCAATGCT GCGTTTGATA AACTTAGCCC GGAAAATCAG GCAGTTATAT TAAAAGCAGC TAAAGAATCA ATGAGCTATA TGAAAGACTT GTGGAGCGAG GAAGAGAAAC AAGAATTTGC GAAACTGGAT AAAATGGGCG TGAAAGTCTA CCAGGTAGAT AAAGCTCCGT TTATCGAGAA AGTACAGCCA ATGTACGCAA ACTTCGCTAA GGACAACCCA GCCCTTGCCC CAATGCTGGC TGATATTCAG GCAGCTAAGT AA
|
Protein sequence | MKIKVSAGII GAVLMLSASQ SWAVTLKLSH NQDKSHPVHK AMEFFAKKSK EYSNGDITIR IYPNGTLGTQ RETMELIRSG AIPLVKTNAA EMEAFENSYK LFSLPYLFRD RDHYYQVMQG DIGRKILDST KSKGYFGLTF YDGGARSFYG NKPVLKPDDL KGMKVRVQPS PGAVEMIKVM GGNPTPLDYG ELYTALQQGV VDMAENSVMA LTTMRHGEVA KSFSLDEHTM VPDVVLMSNA AFDKLSPENQ AVILKAAKES MSYMKDLWSE EEKQEFAKLD KMGVKVYQVD KAPFIEKVQP MYANFAKDNP ALAPMLADIQ AAK
|
| |