Gene EcSMS35_0793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0793 
Symbol 
ID6142643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp793898 
End bp795331 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content50% 
IMG OID641615681 
Productanion transporter 
Protein accessionYP_001742873 
Protein GI170683868 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.575691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA AATCGTTATG GAAGCTAATT CTGATATTAG CAATCCCATG TATTATTGGC 
TTTATGCCAG CTCCAGCAGG ATTAAGCGAA CTGGCGTGGG TGCTTTTTGG TATTTACCTG
GCGGCCATTG TGGGGCTGGT TATCAAGCCT TTCCCGGAAC CTGTCGTACT GTTAATTGCC
GTCGCCGCAT CGATGGTAGT GGTGGGTAAC TTATCAGGTG GGGAATTTAA AACCACCGCT
GTATTAAGCG GTTACTCTTC CGGTACCACC TGGCTGGTGT TTTCTGCGTT TACTTTAAGC
GCCGCGTTTG TAACCACAGG CTTAGGTAAA CGTATTGCCT ATCTGCTGAT TGGTAAAATT
GGTAGCACTA CCCTGGGTCT GGGTTACGTT ACGGTATTTC TCGATCTGGT ACTGGCTCCG
GCAACACCGT CTAACACCGC GCGTGCGGGC GGCATCGTGT TACCGATCAT CAACAGCGTG
GCAGTGGCTT TGGGATCAGA ACCGGAAAAA AGTCCGCGTC GTGTTGGACA TTACCTGATG
ATGTCCATTT ACATGGTCAC CAAAACCACC AGCTATATGT TCTTTACCGC AATGGCGGGG
AACATTCTGG CGCTGAAAAT GATCAACGAC ATTCTGCACC TGCAAATTAG CTGGGGTGGA
TGGGCGCTAG CCGCCGGATT GCCTGGCATC ATTATGCTGC TGGTCACCCC GCTGGTGATT
TACACCATGT ATCCGCCAGA AATTAAGAAG GTGGATAACA AAACCATCGC CAAAGCGGGC
CTTGCCGAAC TGGGACCGAT GAAAATCCGC GAAAAAATGC TGCTCGGTGT CTTCGTGCTG
GCGCTGCTGG GCTGGATTTT CAGTAAGTCA CTGGGGGTTG ATGAATCCAC CGTGGCAATC
GTTGTTATGG CGACTATGCT GCTGCTGGGT ATCGTTACCT GGGAAGACGT GGTTAAAAAT
AAAGGCGGCT GGAATACCTT AATCTGGTAC GGCGGTATTA TCGGCTTAAG CTCCTTATTA
TCGAAAGTTA AATTCTTCGA ATGGTTAGCT GAAGTCTTTA AAAATAACCT GGCATTTGAT
GGTCACGGTA ACGTTGCTTT CTTCGTTATT ATTTTCCTCA GCATCATCGT GCGTTATTTC
TTCGCTTCCG GTAGTGCCTA TATCGTTGCC ATGTTACCGG TATTTGCCAT GCTGGCGAAC
GTCTCCGGCG CGCCGTTAAT GTTAACCGCG CTGGCACTGT TGTTCTCTAA CTCCTATGGC
GGCATGGTTA CTCACTATGG CGGCGCGGCA GGTCCGGTCA TCTTTGGCGT GGGTTACAAC
GATATTAAAT CCTGGTGGTT GGTCGGTGCG GTACTGACGA TATTAACCTT CCTGGTGCAT
ATCACCCTCG GCGTGTGGTG GTGGAATATG CTGATCGGCT GGAACATGCT GTAA
 
Protein sequence
MNKKSLWKLI LILAIPCIIG FMPAPAGLSE LAWVLFGIYL AAIVGLVIKP FPEPVVLLIA 
VAASMVVVGN LSGGEFKTTA VLSGYSSGTT WLVFSAFTLS AAFVTTGLGK RIAYLLIGKI
GSTTLGLGYV TVFLDLVLAP ATPSNTARAG GIVLPIINSV AVALGSEPEK SPRRVGHYLM
MSIYMVTKTT SYMFFTAMAG NILALKMIND ILHLQISWGG WALAAGLPGI IMLLVTPLVI
YTMYPPEIKK VDNKTIAKAG LAELGPMKIR EKMLLGVFVL ALLGWIFSKS LGVDESTVAI
VVMATMLLLG IVTWEDVVKN KGGWNTLIWY GGIIGLSSLL SKVKFFEWLA EVFKNNLAFD
GHGNVAFFVI IFLSIIVRYF FASGSAYIVA MLPVFAMLAN VSGAPLMLTA LALLFSNSYG
GMVTHYGGAA GPVIFGVGYN DIKSWWLVGA VLTILTFLVH ITLGVWWWNM LIGWNML