Gene EcolC_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0034 
Symbol 
ID6068468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp35635 
End bp37401 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content55% 
IMG OID641599438 
Productcryptic adenine deaminase 
Protein accessionYP_001723048 
Protein GI170018094 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1001] Adenine deaminase 
TIGRFAM ID[TIGR01178] adenine deaminase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT CTATTAACCA TAAATTTCAT CACATTAGCC GGGCTGAATA CCAGGAATTG 
TTAGCCGTTT CCCGTGGCGA CGCTGTTGCC GATTATATTA TTGATAATGT CTCTATTCTC
GACCTGATCA ATGGCGGAGA AATTTCCGGC CCAATTGTGA TTAAAGGACG TTACATTGCC
GGTGTTGGCG CAGAATACGC TGATGCTCCG GCTTTGCAGC GGATTGATGC TCGCGGCGCA
ACGGCGGTGC CAGGGTTTAT TGATGCTCAC CTGCATATTG AATCCAGCAT GATGACGCCG
GTCACTTTTG AAACCGCTAC CCTGCCGCGC GGCCTGACGA CCGTTATTTG CGACCCTCAT
GAAATCGTCA ACGTGATGGG AGAAGCCGGA TTCGCCTGGT TTGCCCGCTG TGCCGAACAG
GCAAGGCAAA ACCAGTACTT ACAGGTCAGC TCTTGCGTAC CCGCCCTGGA AGGCTGCGAT
GTTAACGGTG CCAGTTTTAC CCTTGAACAG ATGCTCGCCT GGCGGGACCA TCCGCAGGTT
ACCGGCCTTG CAGAAATGAT GGACTACCCT GGCGTAATTA GCGGGCAGAA TGCGCTGCTC
GATAAACTGG ATGCATTTCG CCACCTGACG CTGGACGGTC ACTGCCCGGG TTTGGGTGGT
AAAGAACTTA ACGCCTATAT TACTGCGGGT ATTGAAAACT GCCACGAAAG TTATCAGCTG
GAAGAAGGAC GCCGGAAATT ACAACTCGGC ATGTCGTTGA TGATCCGCGA AGGGTCCGCT
GCCCGCAATC TCAACGCGCT GGCACCGTTG ATCAACGAAT TTAACAGCCC GCAATGCATG
CTCTGTACCG ATGACCGTAA CCCGTGGGAG ATCGCCCATG AAGGACACAT CGATGCCTTA
ATTCGCCGCC TGATCGAACA ACACAATGTG CCGCTGCATG TGGCATATCG CGTCGCCAGC
TGGTCGACGG CGCGCCACTT TGGTCTGAAT CACCTCGGCT TACTGGCACC CGGCAAGCAG
GCCGATATCG TCCTGTTGAG CGATGCGCGT AAGGTCACGG TGCAGCAGGT ACTGGTGAAA
GGCGAGCCGA TTGATGCGCA AACCTTACAG GCGGAAGAGT CGGCGAGACT GGCACAATCC
GCTCCGCCAT ATGGCAACAC CATTGCCCGC CAGCCAGTTT CCGCCAGCGA CTTTGCCCTG
CAATTTACGC CCGGAAAACG CTATCGGGTC ATTGACGTCA TCCATAACGA ATTGATTACG
CACTCCCACT CCAGCGTCTA CAGCGAAAAT GGTTTTGATC GCGATGATGT GAGCTTTATT
GCCGTACTTG AGCGTTACGG GCAACGGCTG GCTCCGGCTT GTGGTTTGCT TGGCGGCTTT
GGACTGAATG AAGGTGCGCT GGCTGCGACG GTCAGCCATG ACAGCCATAA TATTGTGGTG
ATCGGTCGCA GTGCCGAAGA GATGGCGCTG GCGGTCAATC AGGTGATTCA GGATGGCGGC
GGGCTGTGCG TGGTACGTAA CGGCCAGGTC CAAAGTCATC TGCCGTTGCC CATTGCCGGC
CTGATGAGCA CCGACACGGC GCAGTCGCTG GCGGAACAGA TTGACGCCTT GAAAGCCGCC
GCCCGTGAAT GCGGTCCGTT ACCCGATGAG CCGTTTATTC AGATGGCGTT TCTTTCTCTG
CCAGTGATCC CCGCGCTAAA ACTAACCAGT CAGGGGCTAT TTGATGGCGA GAAGTTTGCC
TTCACTACGC TGGAAGTCAC GGAATAA
 
Protein sequence
MNNSINHKFH HISRAEYQEL LAVSRGDAVA DYIIDNVSIL DLINGGEISG PIVIKGRYIA 
GVGAEYADAP ALQRIDARGA TAVPGFIDAH LHIESSMMTP VTFETATLPR GLTTVICDPH
EIVNVMGEAG FAWFARCAEQ ARQNQYLQVS SCVPALEGCD VNGASFTLEQ MLAWRDHPQV
TGLAEMMDYP GVISGQNALL DKLDAFRHLT LDGHCPGLGG KELNAYITAG IENCHESYQL
EEGRRKLQLG MSLMIREGSA ARNLNALAPL INEFNSPQCM LCTDDRNPWE IAHEGHIDAL
IRRLIEQHNV PLHVAYRVAS WSTARHFGLN HLGLLAPGKQ ADIVLLSDAR KVTVQQVLVK
GEPIDAQTLQ AEESARLAQS APPYGNTIAR QPVSASDFAL QFTPGKRYRV IDVIHNELIT
HSHSSVYSEN GFDRDDVSFI AVLERYGQRL APACGLLGGF GLNEGALAAT VSHDSHNIVV
IGRSAEEMAL AVNQVIQDGG GLCVVRNGQV QSHLPLPIAG LMSTDTAQSL AEQIDALKAA
ARECGPLPDE PFIQMAFLSL PVIPALKLTS QGLFDGEKFA FTTLEVTE