Gene Bcen_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen_4038 
Symbol 
ID4096144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia AU 1054 
KingdomBacteria 
Replicon accessionNC_008061 
Strand
Start bp1207098 
End bp1208339 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content66% 
IMG OID638017332 
ProductN-isopropylammelide isopropylaminohydrolase 
Protein accessionYP_623900 
Protein GI107026389 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTGT TCAACGTACG TCTGCGCGGC CGCGACGGCC TGTTCACGAT CGGCGTCGAT 
GCCGGCAAGA TCGCGCGGAT CGATGCGCAA ACCGCGCCGA TCGCGTCGAC GAACCCCGAC
CATATCGACG GCGGCGGCCG TCTCGCGATT GCGCCGCTCG TCGAGCCGCA TATCCACCTC
GATGCCGTGC TGACGGCCGG CGAACCCGCG TGGAACATGA GCGGCACGCT GTTCGAAGGC
ATCGAGCGCT GGGCCGAGCG CAAGGCGACG ATCACGCACG AGGACACCAA AGCGCGCGCG
CATGCGGCGA TCGGCATGCT GCGCGACCAC GGGATCCAGC ACGTGCGCAC GCACGTCGAC
GTGACCGATC CGACGTTGGC CGCGCTGAAG GCGATGCTGG AAGTGAAGGA CGAGGCGCGC
GGGCTGATCG ATCTGCAGAT CGTCGCGTTC CCGCAGGAGG GCATCGAATC GTTCGACGGC
GGCCGCGCGC TGATGGAGCA GGCGATCGAG CTCGGCGCGG ACGTCGTCGG CGGAATTCCG
CATTTCGAGA ACACGCGGGA GCAGGGCGTC AGCTCGATCC GCTTTCTGAT GGATCTCGCG
GAACGCACGG GCTGCCTCGT CGACGTGCAC TGCGACGAAA CCGACGATCC GCATTCGCGC
TTTCTCGAAG TGCTCGCGGA AGAAGCGCGC GTGCGCGGCA TGGGCGCGCG GGTCACCGCG
AGCCACACGA CCGCGATGGG TTCGTACGAC AACGCGTACT GCTCGAAGCT GTTCCGGCTG
CTGAAGCGGG CGGGACTGAA CTTCATCTCG TGCCCGACCG AGAGCATTCA CCTGCAAGGG
CGCTTCGACA CGTTTCCGAA GCGGCGCGGC GTCACGCGCG TCGCGGAACT CGACCGGGCC
GGCATCAACG TGTGCTTCGG GCAGGATTCG ATCAAGGACC CGTGGTATCC GCTCGGCAAC
GGCAACATCC TGCGCGTGCT CGATGCGGGC CTTCATATCT GTCACATGAT GGGTTACCAG
GACCTGCAGC GCTGCCTCGA CTTCGTGACC GACCACAGCG CGACGACGAT GCATCTCGGC
GAGGGCTACG GCATCGAGAT CGGGCGTCCG GCGAATCTCG TCGTGCTCGA CGCGGACAGC
GATTACGAAG CCGTACGCCG GCAGGCGAAG GCCACGCTGT CGATGCGCCA CGGGAAGGTC
ATCATGCGGC GTGAGCCGGA GCGCATCACG TATCCGGATT GA
 
Protein sequence
MNLFNVRLRG RDGLFTIGVD AGKIARIDAQ TAPIASTNPD HIDGGGRLAI APLVEPHIHL 
DAVLTAGEPA WNMSGTLFEG IERWAERKAT ITHEDTKARA HAAIGMLRDH GIQHVRTHVD
VTDPTLAALK AMLEVKDEAR GLIDLQIVAF PQEGIESFDG GRALMEQAIE LGADVVGGIP
HFENTREQGV SSIRFLMDLA ERTGCLVDVH CDETDDPHSR FLEVLAEEAR VRGMGARVTA
SHTTAMGSYD NAYCSKLFRL LKRAGLNFIS CPTESIHLQG RFDTFPKRRG VTRVAELDRA
GINVCFGQDS IKDPWYPLGN GNILRVLDAG LHICHMMGYQ DLQRCLDFVT DHSATTMHLG
EGYGIEIGRP ANLVVLDADS DYEAVRRQAK ATLSMRHGKV IMRREPERIT YPD