Gene CPS_4872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_4872 
SymbolguaD 
ID3520762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp5166705 
End bp5168054 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content41% 
IMG OID637287311 
Productguanine deaminase 
Protein accessionYP_271511 
Protein GI71280297 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.673098 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAACA ACACTGCAAA CACTTCTTCA ACGAGTGAAG CAATAAGTAA AGCTACAAGT 
ACCGGCCGCA AAGCATATCG TGGCGAAGTA CTACACTTTT TAGCGGATCC TGCCAAAGTA
TCAGAAGAAG AGAGTTATCA GTACTTTGAA GATGGACTAT TAGTCATCAA CCATGGCTTA
GTTGAAGCCG TCGGTAACGC CAAGGATTTA CTGAAAACGT TACCCGCCGA CGTTGTCGTT
ACCCAATATG ACAATGGCCT AATCATGCCT GGTTTTATTG ATACGCATGT ACATTATGCA
CAATCCGAAA TGGTCGCTTC TTACGGCGAA CAATTACTCG AGTGGTTAGA AAACTATACC
TTCCCTGAAG AAAAAAAATT TGCTGATCTT GAACACGGTA AACGTGTTGC TGAATTTTTC
TTAAGCCAAT TATTAGATGC TGGTACCACC ACAGCATTGG TCTTTGGCAC AGTACATAAA
GAATCTGTTG AAGCTTTTTT TACCGTCGCT CAACAGAAAA AATTACGCAT GATTTGCGGT
AAAGTGTTGA TGAATCAAAA CTGTCCTGAT GATTTATCAG ATACCGTTGA ATCAGGTTAC
GCCGACAGTA AAGCGCTCAT TGAAAAATGG CATAACACTG ACAGATTACA ATATGCGGTA
ACGCCACGTT TTGCACCGAC TTGCTCAACG GAACAACTGA ATAAAGCCGG TGAGTTATTA
AAAGAATATC CTAGTGTTTA TTTACATACC CATTTATCTG AAAACAAAGA TGAAATTGCA
TGGGTGAGTG AATTATTCCC TGACAGTGAC GGTTACCTTG ATGTGTACGA TAAAAGCAGT
CTATTAGGTC GCCGTAGTGT TTTTGCTCAC GGTGTACATT TGCACGATCA TGAGTGTCAG
CGCTTAAGTG AGACCAATTC AGCCATTGCT TTTTGCCCAA CCTCAAACTT ATTTTTAGGT
AGCGGTTGTT TCAACTTAAA GCAAGCTGAA GAATTTGATG TGAATGTCGG CTTAGGTACT
GATATTGGTG CCGGTAGCAG TTTCTCTATG TTAACCACAC TCAACGAAGG TTATAAAACT
CAGCAATTAC GTGGTGATAA ATTAAGCCCC TACAAATCAT TATATTTAGC GACCTTAGGG
GGCGCTATTG CCTTAGATTT AGAAGGGACT ATTGGTAACT TTATTCAAGG CGCTGAAGCT
GACTTTATCG TGCTTGATTA TCAAGCAACA CCTTTAATGG ATGTACGCAT CAAACGCTGT
ACAACCTTAA CTGAAAAATT ATTCGTGTTG AGCATGCTAG GTGACGATAG ACACGTTAAA
GCGACGCACA TCATGGGCGA AAAAGTTTAA
 
Protein sequence
MSNNTANTSS TSEAISKATS TGRKAYRGEV LHFLADPAKV SEEESYQYFE DGLLVINHGL 
VEAVGNAKDL LKTLPADVVV TQYDNGLIMP GFIDTHVHYA QSEMVASYGE QLLEWLENYT
FPEEKKFADL EHGKRVAEFF LSQLLDAGTT TALVFGTVHK ESVEAFFTVA QQKKLRMICG
KVLMNQNCPD DLSDTVESGY ADSKALIEKW HNTDRLQYAV TPRFAPTCST EQLNKAGELL
KEYPSVYLHT HLSENKDEIA WVSELFPDSD GYLDVYDKSS LLGRRSVFAH GVHLHDHECQ
RLSETNSAIA FCPTSNLFLG SGCFNLKQAE EFDVNVGLGT DIGAGSSFSM LTTLNEGYKT
QQLRGDKLSP YKSLYLATLG GAIALDLEGT IGNFIQGAEA DFIVLDYQAT PLMDVRIKRC
TTLTEKLFVL SMLGDDRHVK ATHIMGEKV