Gene EcSMS35_3356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3356 
Symbol 
ID6142707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3433542 
End bp3435005 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content55% 
IMG OID641618185 
Productanion transporter 
Protein accessionYP_001745335 
Protein GI170681701 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000525621 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTT CCACTGAATG GTGGCGATAC CTTGCGCCGC TGGCGGTCAT CGCCATTATT 
GCTCTAATTC CGGTTCCCGC AGGGCTGGAG AGTCATACCT GGCTCTACTT TGCTGTTTTT
ACTGGCGTGA TCGTTGGACT GATCCTCGAA CCCGTGCCGG GTGCCGTGGT GGCGATGGTG
GGTATCTCCA TTATCGCCAT TCTCTCTCCC TGGCTGCTGT TCAGCCCGGA GCAGCTCGCC
CAGCCAGGTT TTAAGTTCAC CGCCAAATCC CTCTCCTGGG CGGTTTCTGG TTTTTCTAAT
TCGGTTATCT GGCTGATTTT CGCCGCCTTT ATGTTTGGCA CAGGCTATGA AAAAACCGGG
CTGGGGCGAC GTATCGCGCT GATTCTGGTG AAAAAGATGG GGCATCGCAC GCTGTTTCTC
GGCTATGCGG TGATGTTCTC CGAGCTTATC CTGGCACCTG TAACACCGTC CAACTCGGCG
CGTGGTGCGG GGATTATCTA TCCCATCATC CGTAACCTGC CACCGCTCTA TCAATCACAA
CCAAACGACA GCAGTTCGCG CAGCATTGGC TCGTACATCA TGTGGATGGG GATTGTTGCC
GACTGCGTGA CCAGCGCCAT TTTCCTGACG GCGATGGCAC CTAACTTGCT GTTAATTGGC
CTGATGAAAA GCGCATCTCA CGCCACACTG AGTTGGGGCG ACTGGTTCCT CGGGATGTTG
CCGCTCAGTA TTTTACTGGT TCTGCTGGTT CCCTGGCTGG CTTACGTGCT GTACCCGCCG
GTACTGAAGT CTGGTGATCA GGTGCCGCGC TGGGCAGAGA CGGAACTGCA GGCAATGGGC
CCGCTCTGTT CGCGTGAAAA ACGGATGCTG GGGCTGATGG TAGGCGCGCT GGTGCTGTGG
ATTTTCGGCG GTGATTATAT CGATGCCGCG ATGGTCGGTT ACAGCGTGGT GGCGCTGATG
CTGCTTCTGC GCATTATCAG TTGGGACGAC ATTGTCAGTA ATAAAGCCGC GTGGAACGTT
TTCTTCTGGC TGGCCTCGCT TATCACCCTC GCTACCGGAC TCAACAACAC CGGTTTTATT
AGCTGGTTTG GCAAACTGTT AGCAGGCAGC TTAAGCGGTT ATTCGCCAAC GATAGTGATG
GTGGCGTTGA TTGTGGTGTT TTATCTACTG CGCTACTTTT TCGCCAGCGC CACGGCGTAT
ACCTCCGCTC TCGCGCCGAT GATGATTGCC GCCGCGCTGG CGATGCCGGA AATCCCGCTG
CCGGTATTCT GCCTGATGGT TGGCGCGGCA ATTGGTCTGG GGAGCATTCT TACGCCATAC
GCCACCGGAC CCAGCCCGAT TTACTACGGT AGTGGTTATC TGCCAACGGT GGATTACTGG
CGACTGGGGG CAATTTTTGG GCTGATATTC CTCGTATTGC TGGTGATTAC CGGCTTACTG
TGGATGCCCG TGGTGTTGCT TTAA
 
Protein sequence
MKPSTEWWRY LAPLAVIAII ALIPVPAGLE SHTWLYFAVF TGVIVGLILE PVPGAVVAMV 
GISIIAILSP WLLFSPEQLA QPGFKFTAKS LSWAVSGFSN SVIWLIFAAF MFGTGYEKTG
LGRRIALILV KKMGHRTLFL GYAVMFSELI LAPVTPSNSA RGAGIIYPII RNLPPLYQSQ
PNDSSSRSIG SYIMWMGIVA DCVTSAIFLT AMAPNLLLIG LMKSASHATL SWGDWFLGML
PLSILLVLLV PWLAYVLYPP VLKSGDQVPR WAETELQAMG PLCSREKRML GLMVGALVLW
IFGGDYIDAA MVGYSVVALM LLLRIISWDD IVSNKAAWNV FFWLASLITL ATGLNNTGFI
SWFGKLLAGS LSGYSPTIVM VALIVVFYLL RYFFASATAY TSALAPMMIA AALAMPEIPL
PVFCLMVGAA IGLGSILTPY ATGPSPIYYG SGYLPTVDYW RLGAIFGLIF LVLLVITGLL
WMPVVLL