Gene EcHS_A2352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2352 
Symbolada 
ID5590978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2350322 
End bp2351386 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content57% 
IMG OID640921478 
Productregulatory protein Ada 
Protein accessionYP_001459013 
Protein GI157161695 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0350] Methylated DNA-protein cysteine methyltransferase
[COG2169] Adenosine deaminase 
TIGRFAM ID[TIGR00589] O-6-methylguanine DNA methyltransferase 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value0.970088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACG CCACATGCTT AACTGACGAT CAACGCTGGC AATCTGTCTT AGCCCGCGAC 
CCGAATGCCG ACGGCGAATT CGTTTTCGCC GTGCGCACCA CGGGTATCTT TTGCCGTCCG
TCTTGCCGCG CCAGACATGC CTTGCGGGAA AATGTCTCCT TCTACGCAAA TGCCAGCGAG
GCACTCGCCG CCGGATTTCG CCCCTGCAAA CGTTGTCAGC CAGACAAAGC CAATCCCCGG
CAACATCGGT TGGATAAAAT CACCCACGTG TGTCGACTGC TGGAACAGGA AACGCCTGTA
ACGCTGGAAG CCTTAGCCGA CCATGTGGCG ATGAGTCCGT TCCATCTGCA TCGGTTGTTT
AAAGCTACTA CCGGAATGAC GCCTAAAGCC TGGCAACAGG CCTGGCGCGC TCGCCGTTTG
CGCGAATCGC TGGCGAAAGG GGAGAGCGTG ACGACGTCTA TTCTTAACGC CGGATTCCCC
GACAGCAGCA GTTACTATCG CAAAGCTGAC GAAACGCTGG GCATGACGGC TAAACAATTC
CGTCACGGTG GCGAAAATCT GGCGGTGCGT TACGCGCTGG CTGATTGTGA GCTGGGTCGT
TGCCTGGTGG CAGAAAGCGA GCGGGGGATT TGCGCGATAT TGCTGGGCGA TGATGACGCG
ACACTAATCA GCGAGTTGCA GCAGATGTTT CCCGCTGCCG ACAACGCGCC TGCCGATCTG
ATGTTTCAGC AACATGTGCG TGAAGTGATC GCCAGCCTCA ATCAACGCGA TACGCCGCTG
ACGTTACCGC TGGACATTCG CGGCACTGCT TTTCAGCAAC AAGTCTGGCA GGCACTGCGC
ACGATACCTT GCGGTGAAAC CGTCAGTTAT CAGCAACTGG CTAACGCCAT CGGCAAACCG
AAAGCGGTAC GGGCCGTTGC CAGCGCCTGT GCCGCCAACA AGCTGGCTAT CGTAATACCC
TGTCATCGAG TGGTCCGTGG TGATGGCACA CTTTCCGGTT ACCGCTGGGG CGTGTCGCGT
AAAGCGCAAC TGCTGCGCCG CGAAGCTGAA AATGAGGAGA GGTAA
 
Protein sequence
MKNATCLTDD QRWQSVLARD PNADGEFVFA VRTTGIFCRP SCRARHALRE NVSFYANASE 
ALAAGFRPCK RCQPDKANPR QHRLDKITHV CRLLEQETPV TLEALADHVA MSPFHLHRLF
KATTGMTPKA WQQAWRARRL RESLAKGESV TTSILNAGFP DSSSYYRKAD ETLGMTAKQF
RHGGENLAVR YALADCELGR CLVAESERGI CAILLGDDDA TLISELQQMF PAADNAPADL
MFQQHVREVI ASLNQRDTPL TLPLDIRGTA FQQQVWQALR TIPCGETVSY QQLANAIGKP
KAVRAVASAC AANKLAIVIP CHRVVRGDGT LSGYRWGVSR KAQLLRREAE NEER