Gene ECD_02716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02716 
SymbolguaD 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2856627 
End bp2857946 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content49% 
IMG OID 
Productguanine deaminase 
Protein accessionACT44533 
Protein GI253978863 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.712282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTCAG GAGAACACAC GTTAAAAGCG GTACGAGGCA GTTTTATTGA TGTCACCCGT 
ACGGTCGATA ACCCGGAAGA GATTGCCTCT GCGCTGCGGT TTATTGAGGA TGGTTTATTA
CTCATTAAAC AGGGAAAAGT GGAATGGTTT GGCGAATGGG AAAACGGAAA GCATCAAATT
CCTGACACCA TTCGCGTGCG CGACTATCGC GGCAAACTGA TAGTACCGGG CTTTGTCGAT
ACACATATCC ATTATCCGCA AAGTGAAATG GTGGGGGCCT ATGGTGAGCA ATTGCTGGAG
TGGTTGAATA AACACACCTT CCCTACTGAA CGGCGTTATG AGGATTTAGA GTACGCCCGC
GAAATGTCGG CGTTCTTCAT CAAGCAGCTT TTACGTAACG GAACCACCAC GGCGCTGGTG
TTTGGCACTG TTCATCCGCA ATCCGTTGAT GCGCTGTTTG AAGCCGCCAG TCATATCAAT
ATGCGTATGA TTGCCGGTAA GGTGATGATG GACCGTAACG CACCGGATTA TCTGCTCGAC
ACTGCCGAAA GCAGCTATCA CCAAAGCAAA GAACTGATTG AACGCTGGCA CAAAAATGGT
CGTCTGCTAT ATGCGATTAC GCCACGCTTC GCCCCGACCT CATCTCCTGA ACAGATGGCG
ATGGCGCAAC GCCTGAAAGA AGAATATCCG GATACGTGGG TACATACCCA TCTCAGTGAA
AACAAAGATG AAATTGCCTG GGTGAAATCG CTTTATCCTG ACCATGATGG TTATCTGGAT
GTTTACCATC AGTACGGCCT GACCGGTAAA AACTGTGTCT TTGCTCACTG CGTCCATCTC
GAAGAAAAAG AGTGGGATCG TCTCAGCGAA ACCAAATCCA GCATTGCTTT CTGTCCGACC
TCCAACCTTT ACCTCGGCAG CGGCTTATTC AACTTGAAAA AAGCATGGCA GAAGAAAGTT
AAAGTGGGCA TGGGAACGGA TATCGGTGCC GGAACCACTT TCAACATGCT GCAAACGCTG
AACGAAGCCT ACAAAGTATT GCAATTACAA GGCTATCGCC TCTCGGCATA TGAAGCGTTT
TACTTGGCCA CGCTCGGCGG AGCGAAATCT CTGGGCCTTG ACGATTTGAT TGGCAACTTT
TTACCTGGCA AAGAGGCTGA TTTCGTGGTG ATGGAACCCA CCGCCACTCC GCTACAGCAG
CTGCGCTATG ACAACTCTGT TTCTTTAGTC GACAAATTGT TCGTGATGAT GACGTTGGGC
GATGACCGTT CGATCTACCG CACCTACGTT GATGGTCGTC TGGTGTACGA ACGCAACTAA
 
Protein sequence
MMSGEHTLKA VRGSFIDVTR TVDNPEEIAS ALRFIEDGLL LIKQGKVEWF GEWENGKHQI 
PDTIRVRDYR GKLIVPGFVD THIHYPQSEM VGAYGEQLLE WLNKHTFPTE RRYEDLEYAR
EMSAFFIKQL LRNGTTTALV FGTVHPQSVD ALFEAASHIN MRMIAGKVMM DRNAPDYLLD
TAESSYHQSK ELIERWHKNG RLLYAITPRF APTSSPEQMA MAQRLKEEYP DTWVHTHLSE
NKDEIAWVKS LYPDHDGYLD VYHQYGLTGK NCVFAHCVHL EEKEWDRLSE TKSSIAFCPT
SNLYLGSGLF NLKKAWQKKV KVGMGTDIGA GTTFNMLQTL NEAYKVLQLQ GYRLSAYEAF
YLATLGGAKS LGLDDLIGNF LPGKEADFVV MEPTATPLQQ LRYDNSVSLV DKLFVMMTLG
DDRSIYRTYV DGRLVYERN