Gene EcSMS35_0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0631 
Symbol 
ID6143864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp644517 
End bp645947 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content53% 
IMG OID641615523 
Productsodium:sulfate symporter family protein 
Protein accessionYP_001742729 
Protein GI170680678 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGCCCC TGGTGGTGAT GGGTGTCATG TTTCTTATCC CTGTACCTGA CGGTATGCCG 
CCGCAGGCGT GGCATTACTT TGCAGTGTTT GTGGCAATGA TTGTCGGCAT GATCCTCGAG
CCAATTCCGG CAACGGCGAT CAGTTTTATT GCGGTTACTA TTTGCGTTAT TGGCAGTAAT
TACCTGCTCT TTGATGCCAA AGAATTAGCT GACCCAGCGT TTAATGCGCA AAAACAGGCG
CTGAAATGGG GGCTGGCAGG TTTCTCCAGC ACCACCGTCT GGCTGGTGTT TGGCGCATTT
ATTTTTGCGC TGGGCTATGA AGTCACCGGT CTGGGCCGTC GTATCGCCCT TTTCCTGGTG
AAATTCATGG GTAAACGCAC GCTGACGCTG GGTTACGCGA TTGTCATTAT CGACATTCTG
CTGGCACCGT TTACACCGTC CAACACCGCG CGTACCGGGG GTACGGTTTT TCCGGTCATT
AAAAACCTGC CGCCGCTGTT TAAATCATTC CCGAACGATC CGTCCGCGCG TCGTATTGGC
GGCTATTTGA TGTGGATGAT GGTCATTAGT ACCAGTCTGA GTTCGTCCAT GTTTGTCACC
GGTGCGGCAC CAAACGTGCT GGGTCTGGAG TTCGTCAGCA AAATCGCAGG GATCCAGATT
AGCTGGCTGC AATGGTTCCT GAGCTTCCTG CCGGTCGGTA TTATTTTGCT GATCGTTGCT
CCGTGGCTCT CCTATGTGCT GTACAAGCCG GAAGTGACTC ACAGTGCCGA AGTGGCAGCA
TGGGCCGGTG ATGAACTGAA AACGATGGGT GCATTGAGCC GTAAAGAGTG GACCCTGATA
GGTCTGGTGC TGCTGAGCTT AGGCTTATGG GTATTCGGCG GCGAAATGAT CGACGCCACG
GCGGTAGGTC TGCTGGCGGT TTCGCTGATG CTGGCCCTGC ACGTTGTACC GTGGAAAGAC
ATTACCCGCT ACAACAGCGC CTGGAACACA CTGGTCAACC TGGCAACGCT GGTTGTTATG
GCGAACGGTT TGACCCGCTC TGGTTTTATC GACTGGTTCG CTAGCACCAT GAGCACGCAC
CTGGAAGGCT TCTCACCGAA CGCAACGGTA ATTGTACTGG TTCTGGTGTT CTACTTTGCA
CACTACCTGT TTGCCAGCCT GTCTGCGCAC ACCGCAACCA TGCTGCCGGT TATTCTGGCC
GTCGGTAAAG GTATTCCGGG CGTACCAATG GAACAACTGT GTATCCTGCT GGTGCTGTCT
ATCGGTATCA TGGGCTGTCT GACGCCGTAT GCAACCGGTC CTGGGGTGAT TATTTACGGC
TGTGGCTATG TGAAATCAAA AGATTACTGG CGTCTTGGCG CAATCTTCGG GGTGATTTAC
ATCTCTATGC TGCTGTTGGT TGGCTGGCCG ATTCTCGCCA TGTGGAACTA A
 
Protein sequence
MAPLVVMGVM FLIPVPDGMP PQAWHYFAVF VAMIVGMILE PIPATAISFI AVTICVIGSN 
YLLFDAKELA DPAFNAQKQA LKWGLAGFSS TTVWLVFGAF IFALGYEVTG LGRRIALFLV
KFMGKRTLTL GYAIVIIDIL LAPFTPSNTA RTGGTVFPVI KNLPPLFKSF PNDPSARRIG
GYLMWMMVIS TSLSSSMFVT GAAPNVLGLE FVSKIAGIQI SWLQWFLSFL PVGIILLIVA
PWLSYVLYKP EVTHSAEVAA WAGDELKTMG ALSRKEWTLI GLVLLSLGLW VFGGEMIDAT
AVGLLAVSLM LALHVVPWKD ITRYNSAWNT LVNLATLVVM ANGLTRSGFI DWFASTMSTH
LEGFSPNATV IVLVLVFYFA HYLFASLSAH TATMLPVILA VGKGIPGVPM EQLCILLVLS
IGIMGCLTPY ATGPGVIIYG CGYVKSKDYW RLGAIFGVIY ISMLLLVGWP ILAMWN