Gene B21_02678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02678 
SymbolguaD 
ID8116387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2855835 
End bp2857154 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content49% 
IMG OID644848874 
Producthypothetical protein 
Protein accessionYP_003000447 
Protein GI251786143 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.600174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTCAG GAGAACACAC GTTAAAAGCG GTACGAGGCA GTTTTATTGA TGTCACCCGT 
ACGGTCGATA ACCCGGAAGA GATTGCCTCT GCGCTGCGGT TTATTGAGGA TGGTTTATTA
CTCATTAAAC AGGGAAAAGT GGAATGGTTT GGCGAATGGG AAAACGGAAA GCATCAAATT
CCTGACACCA TTCGCGTGCG CGACTATCGC GGCAAACTGA TAGTACCGGG CTTTGTCGAT
ACACATATCC ATTATCCGCA AAGTGAAATG GTGGGGGCCT ATGGTGAGCA ATTGCTGGAG
TGGTTGAATA AACACACCTT CCCTACTGAA CGGCGTTATG AGGATTTAGA GTACGCCCGC
GAAATGTCGG CGTTCTTCAT CAAGCAGCTT TTACGTAACG GAACCACCAC GGCGCTGGTG
TTTGGCACTG TTCATCCGCA ATCCGTTGAT GCGCTGTTTG AAGCCGCCAG TCATATCAAT
ATGCGTATGA TTGCCGGTAA GGTGATGATG GACCGTAACG CACCGGATTA TCTGCTCGAC
ACTGCCGAAA GCAGCTATCA CCAAAGCAAA GAACTGATTG AACGCTGGCA CAAAAATGGT
CGTCTGCTAT ATGCGATTAC GCCACGCTTC GCCCCGACCT CATCTCCTGA ACAGATGGCG
ATGGCGCAAC GCCTGAAAGA AGAATATCCG GATACGTGGG TACATACCCA TCTCAGTGAA
AACAAAGATG AAATTGCCTG GGTGAAATCG CTTTATCCTG ACCATGATGG TTATCTGGAT
GTTTACCATC AGTACGGCCT GACCGGTAAA AACTGTGTCT TTGCTCACTG CGTCCATCTC
GAAGAAAAAG AGTGGGATCG TCTCAGCGAA ACCAAATCCA GCATTGCTTT CTGTCCGACC
TCCAACCTTT ACCTCGGCAG CGGCTTATTC AACTTGAAAA AAGCATGGCA GAAGAAAGTT
AAAGTGGGCA TGGGAACGGA TATCGGTGCC GGAACCACTT TCAACATGCT GCAAACGCTG
AACGAAGCCT ACAAAGTATT GCAATTACAA GGCTATCGCC TCTCGGCATA TGAAGCGTTT
TACTTGGCCA CGCTCGGCGG AGCGAAATCT CTGGGCCTTG ACGATTTGAT TGGCAACTTT
TTACCTGGCA AAGAGGCTGA TTTCGTGGTG ATGGAACCCA CCGCCACTCC GCTACAGCAG
CTGCGCTATG ACAACTCTGT TTCTTTAGTC GACAAATTGT TCGTGATGAT GACGTTGGGC
GATGACCGTT CGATCTACCG CACCTACGTT GATGGTCGTC TGGTGTACGA ACGCAACTAA
 
Protein sequence
MMSGEHTLKA VRGSFIDVTR TVDNPEEIAS ALRFIEDGLL LIKQGKVEWF GEWENGKHQI 
PDTIRVRDYR GKLIVPGFVD THIHYPQSEM VGAYGEQLLE WLNKHTFPTE RRYEDLEYAR
EMSAFFIKQL LRNGTTTALV FGTVHPQSVD ALFEAASHIN MRMIAGKVMM DRNAPDYLLD
TAESSYHQSK ELIERWHKNG RLLYAITPRF APTSSPEQMA MAQRLKEEYP DTWVHTHLSE
NKDEIAWVKS LYPDHDGYLD VYHQYGLTGK NCVFAHCVHL EEKEWDRLSE TKSSIAFCPT
SNLYLGSGLF NLKKAWQKKV KVGMGTDIGA GTTFNMLQTL NEAYKVLQLQ GYRLSAYEAF
YLATLGGAKS LGLDDLIGNF LPGKEADFVV MEPTATPLQQ LRYDNSVSLV DKLFVMMTLG
DDRSIYRTYV DGRLVYERN