Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3307 |
Symbol | |
ID | 6147460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3382660 |
End bp | 3383964 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641618136 |
Product | TRAP transporter, DctM subunit |
Protein accession | YP_001745286 |
Protein GI | 170682721 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1593] TRAP-type C4-dicarboxylate transport system, large permease component |
TIGRFAM ID | [TIGR00786] TRAP transporter, DctM subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.512903 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00482175 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGACTTTG AATATATCTA CCCCGTCTTA ATTTTATTTG GCAGTTTTGC CGTCATGCTG GCAATCGGTG TGCCAATTAC TTTTGCGATT GGTCTTTCTT CGCTGTTATC TATTATTACT GCCTTACCAC CCGATGCCGC CATTTCTGTG ATTTCGCAAA AGATGACTGT GGGGCTGGAT GGCTTTACGC TATTAGCCAT TCCCTTCTTC GTGTTAGCCG GAAACATTAT GAATACCGGT GGTATAGCCA GACGACTGGT TAACCTGGCG CAAGCATTAG TTGGGCGTCT TCCTGGCTCA CTGGCTCATT GTAATATCCT CGCGAATACG CTGTTTGGTG CGATTTCAGG TTCAGCCGTT GCGTCAGCCG CCGCGGTAGG TGGAATTATG TCACCACTGC AAGAAAAAGA GGGCTATGAT CCGGCGTTTT CAGCAGCGGT TAATATTGCC TCTGCCCCCA TTGGCCTGAT GATCCCACCG AGCAATGTGT TGATTGTTTA TTCCCTCGCC AGTGGCGGGA CTTCTGTTGC TGCGTTGTTC TTAGCCGGAT ACTTGCCAGG CATTCTCACC GCTGCTGCTT TAATGTTTGT GGCGGCACTT TATGCGCGAC GTAACCATTA TCCGGTGGCC GAACGTATCA ATTTTCATCA ATTTTTGCAG GTATTCCGCG AATCAATTCC CAGCCTGATG CTTATTTTTA TCATTATTGG CGGTATTATC GCAGGGGTAT TCACGCCCAC GGAAGCATCG GCAATTGCGG TAATTTATAG TTTAGCCCTG GCGATGATTT ACCGGGAAAT CACTTTTAAG AAGCTCAATG ATATTCTGTT AGATTCGGTA GTAACCAGTT CAATTGTTCT GTTACTGGTA GGCTGCTCGA TGGGGATGTC ATGGGCCATG ACGAATGCTG ATGTTCCTGA GTTGATCAAC GAACTGATTA CCAGTGTTTC GGATAACAAA TGGGTTATTC TGTTTATCAT CAATATCATT CTGTTGATCG TCGGTACCTT TATGGATATC ACACCGGCGA TCCTGATATT TACGCCTATT TTCCTGCCGA TCGCTCAGCA TCTGGGAATA GATCCCATCC ACTTCGGTAT TATTATGGTG TTCAACTTGA CCATTGGCCT TTGTACACCA CCCGTTGGCA CCATTCTGTT TGTTGGTTGC AGCATTGGTA AGGTCAGTAT CGACAGGGCA ATAAAACCAT TACTGCCGAT GTTTCTGGCA TTGTTTGTAG TAATGGCAAT TATTTGTTAT TTCCCGCAGC TTAGTCTGAT GCTGCCAGGA TTATTTTCGA CCTGA
|
Protein sequence | MDFEYIYPVL ILFGSFAVML AIGVPITFAI GLSSLLSIIT ALPPDAAISV ISQKMTVGLD GFTLLAIPFF VLAGNIMNTG GIARRLVNLA QALVGRLPGS LAHCNILANT LFGAISGSAV ASAAAVGGIM SPLQEKEGYD PAFSAAVNIA SAPIGLMIPP SNVLIVYSLA SGGTSVAALF LAGYLPGILT AAALMFVAAL YARRNHYPVA ERINFHQFLQ VFRESIPSLM LIFIIIGGII AGVFTPTEAS AIAVIYSLAL AMIYREITFK KLNDILLDSV VTSSIVLLLV GCSMGMSWAM TNADVPELIN ELITSVSDNK WVILFIINII LLIVGTFMDI TPAILIFTPI FLPIAQHLGI DPIHFGIIMV FNLTIGLCTP PVGTILFVGC SIGKVSIDRA IKPLLPMFLA LFVVMAIICY FPQLSLMLPG LFST
|
| |