Gene EcSMS35_3307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3307 
Symbol 
ID6147460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3382660 
End bp3383964 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content46% 
IMG OID641618136 
ProductTRAP transporter, DctM subunit 
Protein accessionYP_001745286 
Protein GI170682721 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.512903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00482175 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACTTTG AATATATCTA CCCCGTCTTA ATTTTATTTG GCAGTTTTGC CGTCATGCTG 
GCAATCGGTG TGCCAATTAC TTTTGCGATT GGTCTTTCTT CGCTGTTATC TATTATTACT
GCCTTACCAC CCGATGCCGC CATTTCTGTG ATTTCGCAAA AGATGACTGT GGGGCTGGAT
GGCTTTACGC TATTAGCCAT TCCCTTCTTC GTGTTAGCCG GAAACATTAT GAATACCGGT
GGTATAGCCA GACGACTGGT TAACCTGGCG CAAGCATTAG TTGGGCGTCT TCCTGGCTCA
CTGGCTCATT GTAATATCCT CGCGAATACG CTGTTTGGTG CGATTTCAGG TTCAGCCGTT
GCGTCAGCCG CCGCGGTAGG TGGAATTATG TCACCACTGC AAGAAAAAGA GGGCTATGAT
CCGGCGTTTT CAGCAGCGGT TAATATTGCC TCTGCCCCCA TTGGCCTGAT GATCCCACCG
AGCAATGTGT TGATTGTTTA TTCCCTCGCC AGTGGCGGGA CTTCTGTTGC TGCGTTGTTC
TTAGCCGGAT ACTTGCCAGG CATTCTCACC GCTGCTGCTT TAATGTTTGT GGCGGCACTT
TATGCGCGAC GTAACCATTA TCCGGTGGCC GAACGTATCA ATTTTCATCA ATTTTTGCAG
GTATTCCGCG AATCAATTCC CAGCCTGATG CTTATTTTTA TCATTATTGG CGGTATTATC
GCAGGGGTAT TCACGCCCAC GGAAGCATCG GCAATTGCGG TAATTTATAG TTTAGCCCTG
GCGATGATTT ACCGGGAAAT CACTTTTAAG AAGCTCAATG ATATTCTGTT AGATTCGGTA
GTAACCAGTT CAATTGTTCT GTTACTGGTA GGCTGCTCGA TGGGGATGTC ATGGGCCATG
ACGAATGCTG ATGTTCCTGA GTTGATCAAC GAACTGATTA CCAGTGTTTC GGATAACAAA
TGGGTTATTC TGTTTATCAT CAATATCATT CTGTTGATCG TCGGTACCTT TATGGATATC
ACACCGGCGA TCCTGATATT TACGCCTATT TTCCTGCCGA TCGCTCAGCA TCTGGGAATA
GATCCCATCC ACTTCGGTAT TATTATGGTG TTCAACTTGA CCATTGGCCT TTGTACACCA
CCCGTTGGCA CCATTCTGTT TGTTGGTTGC AGCATTGGTA AGGTCAGTAT CGACAGGGCA
ATAAAACCAT TACTGCCGAT GTTTCTGGCA TTGTTTGTAG TAATGGCAAT TATTTGTTAT
TTCCCGCAGC TTAGTCTGAT GCTGCCAGGA TTATTTTCGA CCTGA
 
Protein sequence
MDFEYIYPVL ILFGSFAVML AIGVPITFAI GLSSLLSIIT ALPPDAAISV ISQKMTVGLD 
GFTLLAIPFF VLAGNIMNTG GIARRLVNLA QALVGRLPGS LAHCNILANT LFGAISGSAV
ASAAAVGGIM SPLQEKEGYD PAFSAAVNIA SAPIGLMIPP SNVLIVYSLA SGGTSVAALF
LAGYLPGILT AAALMFVAAL YARRNHYPVA ERINFHQFLQ VFRESIPSLM LIFIIIGGII
AGVFTPTEAS AIAVIYSLAL AMIYREITFK KLNDILLDSV VTSSIVLLLV GCSMGMSWAM
TNADVPELIN ELITSVSDNK WVILFIINII LLIVGTFMDI TPAILIFTPI FLPIAQHLGI
DPIHFGIIMV FNLTIGLCTP PVGTILFVGC SIGKVSIDRA IKPLLPMFLA LFVVMAIICY
FPQLSLMLPG LFST