Gene EcolC_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2007 
Symbol 
ID6068080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2213441 
End bp2214442 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content54% 
IMG OID641601421 
Productadenosine deaminase 
Protein accessionYP_001724980 
Protein GI170020026 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1816] Adenosine deaminase 
TIGRFAM ID[TIGR01430] adenosine deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00173018 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.960599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATA CCACCCTGCC ATTAACTGAT ATCCATCGCC ACCTTGATGG CAACATTCGT 
CCCCAGACCA TTCTTGAACT TGGCCGCCAG TATAATATCT CGCTTCCTGC ACAATCCCTG
GAAACACTGA TTCCCCACGT TCAGGTCATT GCCAACGAAC CCGATCTGGT GAGCTTTCTG
ACCAAACTTG ACTGGGGCGT TAAAGTTCTC GCCTCTCTTG ATGCCTGTCG CCGCGTGGCA
TTTGAAAACA TTGAAGATGC AGCCCGTCAC GGCCTGCACT ATGTCGAGCT GCGTTTTTCA
CCAGGCTACA TGGCAATGGC ACATCAGCTG CCTGTAGCGG GTGTTGTCGA AGCGGTGATC
GATGGCGTAC GTGAAGGTTG CCGCACCTTT GGTGTGCAGG CGAAGCTTAT CGGCATTATG
AGCCGGACCT TCGGCGAAGC CGCCTGTCAG CAAGAGCTGG AGGCCTTTTT AGCCCACCGT
GACCAGATTA CCGCACTTGA TTTAGCCGGT GATGAACTTG GTTTCCCGGG AAGTCTGTTC
CTTTCTCACT TCAACCGCGC GCGTGATGCG GGCTGGCATA TTACCGTCCA TGCAGGCGAA
GCTGCCGGGC CGGAAAGCAT CTGGCAGGCG ATTCGTGAAC TGGGTGCGGA GCGTATTGGA
CATGGCGTAA AAGCCATTGA AGATCGGGCG CTGATGGATT TTCTCGCCGA GCAACAAATT
GGTATTGAAT CCTGTCTGAC CTCCAATATT CAGACCAGCA CCGTAGCAGA GCTGGCTGCA
CATCCGCTGA AAACGTTCCT TGAGCATGGC ATTCGTGCCA GCATTAACAC TGACGATCCC
GGCGTACAGG GAGTGGATAT CATTCACGAA TATACCGTTG CCGCGCCAGC TGCTGGGTTA
TCCCGCGAGC AAATCCGCCA GGCACAGATT AATGGTCTGG AAATGGCTTT CCTCAGCGCT
GAGGAAAAAC GCGCACTGCG AGAAAAAGTC GCCGCGAAGT AA
 
Protein sequence
MIDTTLPLTD IHRHLDGNIR PQTILELGRQ YNISLPAQSL ETLIPHVQVI ANEPDLVSFL 
TKLDWGVKVL ASLDACRRVA FENIEDAARH GLHYVELRFS PGYMAMAHQL PVAGVVEAVI
DGVREGCRTF GVQAKLIGIM SRTFGEAACQ QELEAFLAHR DQITALDLAG DELGFPGSLF
LSHFNRARDA GWHITVHAGE AAGPESIWQA IRELGAERIG HGVKAIEDRA LMDFLAEQQI
GIESCLTSNI QTSTVAELAA HPLKTFLEHG IRASINTDDP GVQGVDIIHE YTVAAPAAGL
SREQIRQAQI NGLEMAFLSA EEKRALREKV AAK