Gene BURPS668_A1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1698 
Symbol 
ID4888650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1649346 
End bp1650779 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content62% 
IMG OID640131636 
Productamidohydrolase family protein 
Protein accessionYP_001062693 
Protein GI126445486 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGGGCAA CCACCTTAAT TACGGGCGGT ATCGTCGTAT GTGGAGACGA TGCGGGTACG 
GTTATAACCG ACGGAGCCGT GCTCGTGGAC GGCAACTGCA TTGCGGATGT GGGAGACACT
GCGGCCGTGG CGACACGGCA CCCCGCACCC GATCGTATTC TTGACGCGGC CGGGAAAATC
GTCGCGCCCG GTTTCGTGAG CAGCCACAAT CATCTCGGGT TCGTGTTGTT TCGCAGTATG
GCGGAGGATG TCGGTGCAAC ACCGACGCCG ACGATGTTCT TGCCGATGCG CGCTTTGGTG
ACGACCCACG AGCGGATCGT ATTCGCCCGG CTTGGCGTAC TCGAGTTGCT GCGCGGCGGC
GTTACGACGA TTCTCGCGAT GGAGGAAGAT GCCACGGCCA TTGCTCCTGT GCTCGTCGAA
TCCGGAATGC GCGCGCGAAT CGCACTGATG ACGAACGATG TCGATGTGCC GAGCCTGCTG
GAGGGAAAGA CCGCGTTCGA CCCGAATCTT ACTCGACATG CGCTGCGACA AAGCGAGGAG
CTGATCGTTC AATGGCCCGG AGGCGCATCG GAACGGATCG GCGCCATGAT CGGCGCGAAC
ATGATGCTCA GCTGCTCGGA CGCCTTGCTG CGCGGCCTAG GCGAGCTTGC CGCCCGGCAT
GGCGTCGGTG TCACGTGTCA TGTCGGGCTA GGTGACTATG AGGTCGAGTT GAGCCACGCG
CTGTATGGGA AGGGCCCATT CGCGTTGCTG CGGGACGTCG GCTTTCTCGA ATCTGATTGC
GTCGCGGCAC ATTGCCATCA CGTGGCGGAC GACGATCTCG CGATTTTGCG CAAGGCGAAG
GCGGCGGTGG CGCATTGCCC GGTGCTTAAT GCATTGCGCG GCGCCGCCGC GCCAGTCCAG
CGGATGATTG ACGAGGGGCT TTCCGTTGCG CTCGGCCTAG ACAATTATTT CGCAGACTGC
TTCGACGCGA TGCGCATGTA CGTCTCGGTC GCACGCATGC GTGAGCGTGA TGCCACGCTG
TTCCGGTCTC GCGATGCACT CCGGGCGGCG ACGCTAGACG CCGCACGCGC GCTGCACATG
GACGACCAGA TCGGGTCGAT CGAGATCGGG AAAAAGGCAG ATATTCAGCT CATCGACGTG
ACGCGCCCGG GGATCGTGCC CATTATCGAT CCCGTAGGGA CGCTCGTCTA TCACGCTCAT
GCTGGCGATG TCAGCACCGT ATTGATCGAC GGGGTCGTGG TATTGCTCGA CGGGCGTGTC
ATACCCATCG ACGAGCCGAC AGCGCTTGAG GAAGGACAGC GGCTTGGCGC ACTCGCCTGG
CACCGTTTCG CCGCGCACTA TCCCGAGAAG CTTGGCGTAT TGCAATGGAC TCCCGATTCG
ATTCCGAATC TGTCTGCAGC CTGGCGGTCG ACCGCCTCAG GCGCAAATCA TTGA
 
Protein sequence
MGATTLITGG IVVCGDDAGT VITDGAVLVD GNCIADVGDT AAVATRHPAP DRILDAAGKI 
VAPGFVSSHN HLGFVLFRSM AEDVGATPTP TMFLPMRALV TTHERIVFAR LGVLELLRGG
VTTILAMEED ATAIAPVLVE SGMRARIALM TNDVDVPSLL EGKTAFDPNL TRHALRQSEE
LIVQWPGGAS ERIGAMIGAN MMLSCSDALL RGLGELAARH GVGVTCHVGL GDYEVELSHA
LYGKGPFALL RDVGFLESDC VAAHCHHVAD DDLAILRKAK AAVAHCPVLN ALRGAAAPVQ
RMIDEGLSVA LGLDNYFADC FDAMRMYVSV ARMRERDATL FRSRDALRAA TLDAARALHM
DDQIGSIEIG KKADIQLIDV TRPGIVPIID PVGTLVYHAH AGDVSTVLID GVVVLLDGRV
IPIDEPTALE EGQRLGALAW HRFAAHYPEK LGVLQWTPDS IPNLSAAWRS TASGANH