Gene ECH74115_5095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5095 
Symbolade 
ID6969467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4734200 
End bp4735966 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content55% 
IMG OID643388770 
Productcryptic adenine deaminase 
Protein accessionYP_002273196 
Protein GI209400507 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1001] Adenine deaminase 
TIGRFAM ID[TIGR01178] adenine deaminase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0272837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT CCATTAACCA TAAATTTCAT CACATTAGCC GGGCGGAATA CCAGGAATTG 
TTAGCCGTTT CCCGTGGCGA CGCTGTTGCC GATTATATTA TTGATAATGT CTCTATTCTC
GACCTGATTA ATGGCGGAGA AATTTCCGGC CCAATTGTGA TTAAAGGGCG TTACATTGCC
GGTGTTGGCG CAGAATACAC TGATGCTCCG GCTTTGCAGC GGATTGATGC CCACGGCGCA
ACGGCGGTGC CAGGGTTTAT TGATGCTCAC CTGCATATTG AATCCAGCAT GATGACGCCG
GTCACCTTTG AAACCGCCAC CCTGCCTCGC GGCCTGACAA CTGTTATTTG CGATCCTCAT
GAAATCGTCA ACGTGATGGG CGAAGCCGGT TTCGCCTGGT TTGCCCGCTG TGCCGAACAG
GCAAGGCAAA ACCAGTACTT ACAGGTCAGC TCTTGCGTAC CCGCTCTGGA AGGCTGCGAT
GTTAACGGTG CCAGTTTTAC CCTTGAACAG ATGCTCGCCT GGCGGGACCA TCCGCAGGTT
ACCGGCCTTG CAGAAATGAT GGACTACCCT GGCGTAATTA GCGGGCAGAA TGCGCTGCTC
GATAAACTGG ATGCATTTCG CCACCTGACG CTGGACGGTC ACTGCCCGGG TTTGGGTGGT
AAAGAACTTA ACGCCTATAT TGCTGCGGGT ATTGAAAACT GCCACGAAAG TTATCAGCTG
GAAGAAGGAC GCCGGAAATT ACAACTCGGC ATGTCGTTGA TGATCCGCGA AGGGTCCGCT
GCCCGCAATC TCAACGCACT GGCACCGTTG ATCAACGAAT TTAACAGCCC GCAATGCATG
CTCTGTACTG ATGACCGTAA CCCGTGGGAG ATCGCCCATG AAGGACACAT CGATGCCTTA
ATTCGCCGCC TGATCGAACA ACACAATGTG CCGCTGCATG TGGCATATCG CGTCGCCAGC
TGGTCGACGG CGCGCCACTT TGGTCTGAAT CACCTCGGCT TACTGGCACC TGGTAAGCAG
GCTGATATCG TCCTGTTGAG CGATGCGCGT AAGGTCACGG TGCAGCAGGT ACTGGTGAAA
GGCGAACCGA TTGACGCGCA AACCTTACAG GCGGAAGAGT CGGCGAAACT GGCACAATCC
GCTCCGCCAT ATGGCAACAC CATTGCCCGC CAGCCAGTTT CCGCCAGCGA CTTTGCCCTG
CAATTTACGC CCGGAAAACG CTATCGGGTC ATTGACGTCA TCCATAACGA ATTGATTACG
CACTCCCACT CCAGCGTCTA CAGCGAAAAT GGTTTTGAGC GCAATGATGT GTGCTTTATT
GCCGTACTTG AGCGTTACGG GCAACGTCTG GCTCCGGCTT GTGGTTTGCT CGGCGGCTTT
GGTCTGAATG AAGGCGCGCT GGCTGCGACG GTCAGCCATG ACAGCCATAA TATTGTGGTG
ATCGGTCGCA GTGCCGAAGA GATGGCGCTA GCGGTCAATC AGGTGATTCA GGATGGCGGC
GGGTTGTGCG TGGTACGTAA CGGCCAGGTC CAAAGCCATC TGCCGTTACC CATTGCCGGG
CTGATGAGCA CCGACACGGC GCAGTCGCTG GCGGAGCAAA TTGACGCCTT GAAAGCCGCC
GCCCGTGAAT GCGGTCCGTT ACCCGATGAG CCGTTTATTC AGATGGCGTT TCTTTCCCTG
CCAGTGATCC CCGCGCTGAA ACTAACCAGC CAGGGGCTGT TTGATGGCGA GAAGTTTGCC
TTCACCACGC TGGAAGTCAC TGAATAA
 
Protein sequence
MNNSINHKFH HISRAEYQEL LAVSRGDAVA DYIIDNVSIL DLINGGEISG PIVIKGRYIA 
GVGAEYTDAP ALQRIDAHGA TAVPGFIDAH LHIESSMMTP VTFETATLPR GLTTVICDPH
EIVNVMGEAG FAWFARCAEQ ARQNQYLQVS SCVPALEGCD VNGASFTLEQ MLAWRDHPQV
TGLAEMMDYP GVISGQNALL DKLDAFRHLT LDGHCPGLGG KELNAYIAAG IENCHESYQL
EEGRRKLQLG MSLMIREGSA ARNLNALAPL INEFNSPQCM LCTDDRNPWE IAHEGHIDAL
IRRLIEQHNV PLHVAYRVAS WSTARHFGLN HLGLLAPGKQ ADIVLLSDAR KVTVQQVLVK
GEPIDAQTLQ AEESAKLAQS APPYGNTIAR QPVSASDFAL QFTPGKRYRV IDVIHNELIT
HSHSSVYSEN GFERNDVCFI AVLERYGQRL APACGLLGGF GLNEGALAAT VSHDSHNIVV
IGRSAEEMAL AVNQVIQDGG GLCVVRNGQV QSHLPLPIAG LMSTDTAQSL AEQIDALKAA
ARECGPLPDE PFIQMAFLSL PVIPALKLTS QGLFDGEKFA FTTLEVTE