Gene EcHS_A0663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0663 
Symbol 
ID5595372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp681197 
End bp682660 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content51% 
IMG OID640919844 
Productsodium:sulfate symporter family protein 
Protein accessionYP_001457426 
Protein GI157160108 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTAG CAAAAGATAA TATATGGAAA CTATTGGCCC CACTGGTGGT GATGGGTGTC 
ATGTTTCTTA TCCCTGTCCC CGACGGTATG CCACCACAGG CATGGCATTA CTTCGCTGTG
TTTGTGGCAA TGATTGTCGG CATGATCCTC GAGCCAATTC CGGCAACAGC GATCAGTTTT
ATTGCGGTTA CTATTTGCGT TATTGGCAGT AATTACCTGC TCTTTGATGC CAAAGAATTA
GCTGACCCAG CGTTTAATGC GCAAAAACAG GCGCTGAAAT GGGGTCTGGC TGGTTTTTCC
AGCACCACGG TATGGCTGGT ATTTGGCGCA TTTATTTTTG CATTAGGGTA TGAAGTTTCC
GGGTTAGGTC GTCGCATTGC CCTTTTCCTG GTGAAATTCA TGGGCAAACG CACGCTGACG
TTGGGTTATG CGATTGTCAT TATCGACATT CTGCTGGCAC CGTTTACACC GTCCAACACC
GCGCGTACCG GGGGTACGGT TTTTCCGGTC ATTAAAAACC TGCCGCCGCT GTTTAAATCA
TTCCCCAACG ATCCGTCCGC GCGTCGTATT GGCGGCTATT TGATGTGGAT GATGGTCATT
AGTACCAGTC TGAGTTCGTC CATGTTTGTC ACCGGTGCGG CACCAAACGT GCTGGGTCTG
GAGTTCGTCA GCAAAATTGC CGGTATCCAG ATTAGCTGGT TGCAGTGGTT CCTCTGCTTC
CTGCCGGTTG GGGTTATCTT GCTTATCATT GCGCCGTGGC TTTCCTACGT GCTGTACAAA
CCGGAAATCA CACACAGTGA AGAAGTGGCA ACCTGGGCGG GTGATGAACT AAAAACCATG
GGTGCGCTGA CACGCAGAGA GTGGACGCTG ATTGGCCTTG TATTGCTCAG CTTAGGTTTG
TGGGTATTTG GCAGTGAAGT CATTAATGCT ACTGCGGTTG GTCTGCTGGC AGTTTCGCTA
ATGCTGGCTC TGCACGTTGT GCCGTGGAAA GACATTACCC GCTATAACAG CGCATGGAAC
ACGCTGGTCA ACCTGGCAAC TCTGGTTGTG ATGGCTAACG GCCTGACTCG TTCTGGTTTT
ATTGACTGGT TCGCCGGTAC CATGAGTACG CACCTGGAAG GATTCTCACC AAACGCAACG
GTGATTGTAC TGGTTCTGGT GTTCTACTTT GCACACTACC TGTTTGCCAG CCTGTCTGCG
CACACCGCAA CCATGCTGCC GGTTATTCTG GCCGTCGGTA AAGGTATTCC GGGCGTACCA
ATGGAACAAC TGTGTATCCT GCTGGTGCTG TCTATCGGTA TCATGGGCTG TCTGACGCCG
TATGCAACCG GTCCTGGGGT GATTATTTAC GGCTGTGGCT ATGTGAAATC AAAAGATTAC
TGGCGTCTTG GCGCAATCTT CGGGGTGATT TACATCTCTA TGTTGCTGTT GGTTGGCTGG
CCGATTCTCG CCATGTGGAA CTAA
 
Protein sequence
MSLAKDNIWK LLAPLVVMGV MFLIPVPDGM PPQAWHYFAV FVAMIVGMIL EPIPATAISF 
IAVTICVIGS NYLLFDAKEL ADPAFNAQKQ ALKWGLAGFS STTVWLVFGA FIFALGYEVS
GLGRRIALFL VKFMGKRTLT LGYAIVIIDI LLAPFTPSNT ARTGGTVFPV IKNLPPLFKS
FPNDPSARRI GGYLMWMMVI STSLSSSMFV TGAAPNVLGL EFVSKIAGIQ ISWLQWFLCF
LPVGVILLII APWLSYVLYK PEITHSEEVA TWAGDELKTM GALTRREWTL IGLVLLSLGL
WVFGSEVINA TAVGLLAVSL MLALHVVPWK DITRYNSAWN TLVNLATLVV MANGLTRSGF
IDWFAGTMST HLEGFSPNAT VIVLVLVFYF AHYLFASLSA HTATMLPVIL AVGKGIPGVP
MEQLCILLVL SIGIMGCLTP YATGPGVIIY GCGYVKSKDY WRLGAIFGVI YISMLLLVGW
PILAMWN