Gene EcSMS35_2362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2362 
Symbolada 
ID6146437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2393061 
End bp2394125 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content57% 
IMG OID641617235 
Productregulatory protein Ada 
Protein accessionYP_001744407 
Protein GI170684032 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0350] Methylated DNA-protein cysteine methyltransferase
[COG2169] Adenosine deaminase 
TIGRFAM ID[TIGR00589] O-6-methylguanine DNA methyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.960218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACG CCACATGCTT AACAGACGAT CAACGCTGGC AATCTGTCTT AGCCCGCGAC 
CCGAATGCCG ACGGCGAATT CGTTTTCGCC GTGCGCACCA CAGGTATCTT TTGCCGTCCG
TCTTGCCGCG CCAGACATGC CTTGCGGGAA AATGTCTCCT TCTACGCAAA TGCCAGCGAG
GCACTCGCCG CCGGATTTCG CCCCTGCAAA CGTTGTCAGC CAGACAAAGC CAATGCCCAG
CAACATCGCC TTGATAAAAT TACCCACGCC TGCCGACTTC TGGAACAGGA AACGCCTGTA
ACGCTGGAAG CCTTAGCCGA CCATGTGGCG ATGAGTCCGT TCCATCTGCA TCGGTTGTTT
AAAGCGACTA CCGGAATGAC GCCTAAAGCC TGGCAACAGG CCTGGCGCGC TCGCCGTTTG
CGCGAATCGC TGGCGAAAGG GGAGAGCGTG ACGACGTCTA TTCTTAACGC CGGATTCCCC
GACAGCAGCA GTTACTACCG CAAAGCTGAC GAAACGCTGG GCATGACGGC TAAACAATTC
CGTCACGGTG GCGAAAATCT GGCGGTGCGT TACGCGCTGG CTGATTGTGA GCTGGGCCGT
TGCCTGGTGG CAGAAAGTGA GCGGGGGATT TGCGCGATAT TACTGGGCGA TGATGACGCG
ACACTAATCA GCGAGTTACA GCAGATGTTT CCCGCTGCCG ACAGCGCGCC TGCCGATCTG
ACGTTTCAGC AACATGTGCG TGAAGTGATC GCCAGCCTCA ATCAACGCGA TACGCCGCTG
ACGTTACCGC TGGACATTCG CGGCACTGCT TTTCAGCAAC AAGTCTGGCA GGCACTGCGC
ACGATACCTT GCGGTGAAAC CGTCAGTTAT CAGCAACTGG CTAATGCCAT CGGCAAACCG
AAAGCGGTAC GGGCTGTTGC CAGCGCCTGT GCCGCCAACA AGCTAGCTAT CGTTATACCT
TGTCATCGGG TGGTCCGTGG TGATGGCACA CTTTCCGGTT ACCGCTGGGG CGTGTCGCGC
AAAGCGCAAC TGCTGCGCCG CGAAGCTGAG AATGAGGAGA GGTAA
 
Protein sequence
MKNATCLTDD QRWQSVLARD PNADGEFVFA VRTTGIFCRP SCRARHALRE NVSFYANASE 
ALAAGFRPCK RCQPDKANAQ QHRLDKITHA CRLLEQETPV TLEALADHVA MSPFHLHRLF
KATTGMTPKA WQQAWRARRL RESLAKGESV TTSILNAGFP DSSSYYRKAD ETLGMTAKQF
RHGGENLAVR YALADCELGR CLVAESERGI CAILLGDDDA TLISELQQMF PAADSAPADL
TFQQHVREVI ASLNQRDTPL TLPLDIRGTA FQQQVWQALR TIPCGETVSY QQLANAIGKP
KAVRAVASAC AANKLAIVIP CHRVVRGDGT LSGYRWGVSR KAQLLRREAE NEER