Gene EcSMS35_1576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1576 
Symboladd 
ID6144345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1559430 
End bp1560431 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content54% 
IMG OID641616453 
Productadenosine deaminase 
Protein accessionYP_001743631 
Protein GI170684142 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1816] Adenosine deaminase 
TIGRFAM ID[TIGR01430] adenosine deaminase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.859905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATA CCACCCTGCC ATTAACTGAT ATCCATCGCC ACCTTGATGG CAACATTCGT 
CCCCAGACCA TTCTTGAACT TGGCCGCCGG TATAATATCT CGCTTCCTGC ACAATCCCTG
GAAACACTGA TTCCCCACGT TCAGGTCATT GCCAACGAAC CCGATCTTGT GAGCTTTCTG
ACCAAACTTG ACTGGGGCGT TAAAGTTCTC GCCTCTCTTG ATGCCTGTCG CCGCGTGGCA
TTTGAAAACA TTGAAGATGC AGCCCGTAAC GGCCTGCACT ATGTCGAGCT GCGTTTTTCA
CCGGGCTACA TGGCAATGGC ACATCAGCTG CCTGTAGCGG GTGTTGTCGA AGCGGTGATC
GATGGCGTAC GTGAAGGTTG CCGCACCTTT GGTGTGCAGG CGAAGCTTAT CGGCATTATG
AGCCGGACCT TCGGTGAAGC CGCCTGTCAG CAAGAGCTGG AGGCCTTTTT AGCCCACCGT
GACCAGATTA CCGCACTTGA TTTAGCCGGT GATGAACTTG GTTTCCCGGG AAGTCTGTTC
CTTTCTCACT TCAACCGCGC GCGTGATGCG GACTGGCATA TTACCGTCCA TGCAGGCGAA
GCTGCCGGGC CGGAAAGCAT CTGGCAGGCG ATTCGTGAAC TGGGGGCGGA ACGTATTGGA
CATGGCGTAA AAGCCATTGA AGATCGGGCG CTGATGGATT TTCTCGCCGA GCAGCAAATT
GGTATTGAAT CCTGTCTGAC CTCCAATATT CAGACCAGCA CCGTGGCAGA GCTGGCGGCA
CATCCGCTGA AAATGTTCCT TGAGCATGGC ATTCGTGCCA GCATTAACAC TGACGATCCC
GGCGTACAGG GAGTGGATAT CATTCACGAA TATACCGTTG CCGCGCCGGC TGCTGGGTTA
TCCCGCGAGC AAATCCGCCA GGCGCAGATT AATGGTCTGG AAATGGCTTT CCTCAGCGCA
GAGGAAAAAC GCGCACTGCG AGAAAAAGTC GCTGCGAAGT AA
 
Protein sequence
MIDTTLPLTD IHRHLDGNIR PQTILELGRR YNISLPAQSL ETLIPHVQVI ANEPDLVSFL 
TKLDWGVKVL ASLDACRRVA FENIEDAARN GLHYVELRFS PGYMAMAHQL PVAGVVEAVI
DGVREGCRTF GVQAKLIGIM SRTFGEAACQ QELEAFLAHR DQITALDLAG DELGFPGSLF
LSHFNRARDA DWHITVHAGE AAGPESIWQA IRELGAERIG HGVKAIEDRA LMDFLAEQQI
GIESCLTSNI QTSTVAELAA HPLKMFLEHG IRASINTDDP GVQGVDIIHE YTVAAPAAGL
SREQIRQAQI NGLEMAFLSA EEKRALREKV AAK