Gene EcSMS35_1598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1598 
SymboltqsA 
ID6146931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1587543 
End bp1588577 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content50% 
IMG OID641616475 
Productputative transport protein 
Protein accessionYP_001743653 
Protein GI170683134 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGC CGATCATCAC GCTCAATGGC CTAAAAATCG TCATTATGTT GGGAATGCTG 
GTCATTATTC TCTGCGGTAT CCGTTTTGCC GCCGAGATCA TCGTGCCGTT TATTCTCGCA
TTATTTATTG CTGTTATTCT TAACCCGCTG GTGCAACACA TGGTCCGCTG GCGAGTGCCG
CGTGTACTGG CGGTGTCGAT TTTGATGACC ATCATCGTGA TGGCGATGGT GTTGCTGTTA
GCTTATCTGG GTTCCGCGCT CAACGAGTTG ACGCGGACGT TACCGCAATA TCGCAACTCT
ATTATGACGC CGCTGCAAGC GATTGAACCG TTGTTGCAAC GCGTAGGGAT TGATGTCTCA
GTTGACCAAC TGGCGCATTA CATTGATCCG AACGCGGCAA TGACGTTGCT CACCAACTTA
TTGACGCAGT TATCTAATGC CATGTCATCA ATCTTTTTAT TGCTGCTGAC GGTGCTGTTT
ATGTTGCTCG AAGTGCCACA ATTGCCCGGA AAATTTCAGC AAATGATGGT GCGTCCGGTT
GAAGGGATGG CGGCGATTCA GCGTGCGATT GACAGTGTGT CTCATTATCT GGTGCTGAAA
ACAGCCATCA GCATCATCAC CGGCCTGGTC GCCTGGGCGA TGCTCGCCGC ACTCGATGTT
CGCTTCGCTT TTGTCTGGGG ATTGCTGGCC TTTGCGCTTA ATTACATCCC TAATATTGGT
TCAGTTCTCG CGGCAATCCC CCCTATCGCT CAGGTACTGG TGTTTAATGG CTTCTACGAA
GCGTTGCTGG TGCTGGCGGG ATATCTGCTA ATTAATCTGG TCTTCGGCAA TATTCTGGAG
CCGCGCATCA TGGGACGTGG GCTGGGGCTT TCCACATTGG TGGTATTTTT GTCGTTGATT
TTTTGGGGAT GGTTGTTAGG ACCGGTGGGT ATGCTGCTTT CCGTGCCGTT AACAATTATT
GTCAAAATTG CGCTTGAACA AACAACGGGA GGTCAAAGCA TCGCCGTTCT GTTAAGCGAT
CTCAACAAAG AGTGA
 
Protein sequence
MAKPIITLNG LKIVIMLGML VIILCGIRFA AEIIVPFILA LFIAVILNPL VQHMVRWRVP 
RVLAVSILMT IIVMAMVLLL AYLGSALNEL TRTLPQYRNS IMTPLQAIEP LLQRVGIDVS
VDQLAHYIDP NAAMTLLTNL LTQLSNAMSS IFLLLLTVLF MLLEVPQLPG KFQQMMVRPV
EGMAAIQRAI DSVSHYLVLK TAISIITGLV AWAMLAALDV RFAFVWGLLA FALNYIPNIG
SVLAAIPPIA QVLVFNGFYE ALLVLAGYLL INLVFGNILE PRIMGRGLGL STLVVFLSLI
FWGWLLGPVG MLLSVPLTII VKIALEQTTG GQSIAVLLSD LNKE