Gene EcSMS35_3758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3758 
Symbol 
ID6145708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3823613 
End bp3824662 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content51% 
IMG OID641618584 
Producthypothetical protein 
Protein accessionYP_001745724 
Protein GI170679689 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCC CTCAACCCGA TAAAACGGGC ATGCACATTC TGCTCAAGCT GGCCTCGCTG 
GTGGTGATCC TCGCGGGAAT TCACGCAGCG GCAGATATCA TTGTGCAGCT GTTACTGGCG
CTGTTTTTTG CCATCGTCCT CAACCCGCTC GTCACCTGGT TTATTCGTCG GGGAGTACAA
CGCCCCGTTG CCATTACGAT TGTGGTGGTG GTGATGCTGA TCGCACTAAC CGCGCTGGTC
GGCGTACTGG CGGCATCGTT TAACGAATTT ATCTCTATGC TGCCGAAGTT TAATAAGGAG
CTGACGCGCA AACTTTTTAA ATTGCAGGAG ATGTTGCCTT TTCTTAATTT GCATATGTCG
CCGGAGCGAA TGCTGCAGCG GATGGACTCG GAAAAAATGG TTACCTTCAC CACGGCGCTA
ATGACCGGGC TTTCCGGGGC AATGGCGAGC GTGCTTTTGC TGGTGATGAC CGTAGTTTTC
ATGCTGTTTG AAGTGCGCCA CGTCCCTTAC AAAATGCGTT TTGCGTTGAA TAATCCACAG
ATTCACATCG CCGGACTACA CCGCGCCTTA AAAGGTGTTT CGCACTATCT TGCATTGAAG
ACACTGCTAA GTTTATGGAC AGGCGTAATC GTCTGGCTGG GGCTGGCGCT AATGGGCGTA
CAGTTTGCGC TGATGTGGGC AGTACTGGCG TTTTTGCTCA ACTACGTGCC CAATATCGGC
GCGGTAATTT CCGCCGTACC GCCAATGATT CAGGTGCTGC TGTTTAATGG CGTTTACGAA
TGTATTCTGG TCGGCGCATT GTTTTTAGTG GTCCATATGG TCATCGGCAA TATTTTAGAA
CCACGGATGA TGGGCCATCG CCTGGGGATG TCCACCATGG TGGTATTTCT TTCATTGTTA
ATTTGGGGAT GGCTGCTCGG CCCGGTAGGG ATGCTACTTT CGGTACCATT AACCAGCGTG
TGTAAAATCT GGATGGAAAC CACCAAAGGC GGTAGCAAAC TGGCGATTTT ACTGGGACCG
GGCAGACCGA AAAGTCGATT ACCGGGATGA
 
Protein sequence
METPQPDKTG MHILLKLASL VVILAGIHAA ADIIVQLLLA LFFAIVLNPL VTWFIRRGVQ 
RPVAITIVVV VMLIALTALV GVLAASFNEF ISMLPKFNKE LTRKLFKLQE MLPFLNLHMS
PERMLQRMDS EKMVTFTTAL MTGLSGAMAS VLLLVMTVVF MLFEVRHVPY KMRFALNNPQ
IHIAGLHRAL KGVSHYLALK TLLSLWTGVI VWLGLALMGV QFALMWAVLA FLLNYVPNIG
AVISAVPPMI QVLLFNGVYE CILVGALFLV VHMVIGNILE PRMMGHRLGM STMVVFLSLL
IWGWLLGPVG MLLSVPLTSV CKIWMETTKG GSKLAILLGP GRPKSRLPG