Gene BURPS1106A_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2040 
Symbol 
ID4900535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2022135 
End bp2023520 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content69% 
IMG OID640135270 
Productputative D-hydantoinase 
Protein accessionYP_001066305 
Protein GI126453113 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.417545 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATT TCGAGCAGGT GGTGCGCGGC CGGCTGGTTG ACGCGCAGCA GATTGTCGAG 
GACGGCTGGC TCGCAATTCG GGGCGGCCGG ATCGCGGCGC GCGGCGCGGG GGCGCCTCCG
GCGGCGCGCG ACTTGATCGA CGCGCGCGGG CAGTGGGTGC TGCCGGGTGT CGTCGACGGC
CAGGTGCACG CGGGCAGCCA GGCGAACCAC GAAGGGCTCG GGCGTGCGTC GCGCGCGGCG
GCGGCGGGCG GAGTGACCGT GATGGTCGAC ATGCCGTACG ACGATCCGGA ACCTGTCGCG
TCGCGGGCGC AGCTCGATCG CAAGATCGCG GAAGCCGAGC GCGATTGCCA CGTCGACATC
GCGCTGTACG GCACGCTCAA CGCAAAGCAC GGCCTCGACG CGGCGGCCGG GCTGATCGAC
GGCGGCGTCT GCGCGTTCAA GTTCTCGATG TTCGAGGCGA CGCCCGGCCG GTTCCCGCGT
GTCGACGAGG ACGTGTTGTA CGACGCATTC CGGCTGGTCG CCCCGTCGGG CCTCGCGTGC
GGCGTGCACA ACCAGATGCA GGACCTCACG CGCAAGAATA TCGCGCGGAT GATCGAGGCC
GGCGACACGG GCTGGGATGC GTTCCTGCGC GCGCATCCAC CGCTGATCGA GAACCTGGCG
ACCGCGCTGA TCTACGAGAT CGGCGCGGAG ACGGGCGCCC GCGCGCACGC GGTGCACGTG
TCGACCTCGC GCGGCTTCGA GCTGTGCAAC ATGTTCCGGC GCGCCGGCCA TCACGCGAGC
ATCGAAACCT GCGTGCAGTA CCTGATGCTC GATCACGAAA CGCATACGAA ACGCTTCGGC
GCGAAGACGA AGCACTACCC GCCGATTCGC CCGCGCGCGG AGCAGGAATT GCTGTGGACG
CATGTCGCGC GCGGCGAGTG CACGTTCGTG TCGTCGGATC ACGTGAGCTG GGAGCTCAAA
CGCAAGGGGG ACGCCAACGT GTTCCGCAAC GCGTCGGGCG GTCCGGGGCT CGAAACGTTG
CTGCCGGCGT TCTGGACCGG CTGCGAGCAG CATGGCATCG CGCCGACGCG GGTCGCCGAG
CTGCTGGCGA CGAATCCGGC GCGGCACTTC CTGCTCGACG ATCGCAAGGG GTCGCTCGAC
GTCGGCGCCG ACGCGGATTT CGTGATCCTC ACGCCCGAAC GCTACGCGTT CGATCCGTCG
TGCAGCCTGT CGGCCGTGCA GTGGAGCGCG TTCGAGGGCA TGGAATTCGC GGTGCGCATC
GCCGCCACAT ATTGTCGCGG CGCGCTCGTG TACGACGGCG CACGCATCGT CAATCCGGCG
GGCTCGGGCC GCTTCCTGAA GCCGCATGGC AGCCGGCCGA TCGTCACGCA ACCGGAGCGC
GCATGA
 
Protein sequence
MSDFEQVVRG RLVDAQQIVE DGWLAIRGGR IAARGAGAPP AARDLIDARG QWVLPGVVDG 
QVHAGSQANH EGLGRASRAA AAGGVTVMVD MPYDDPEPVA SRAQLDRKIA EAERDCHVDI
ALYGTLNAKH GLDAAAGLID GGVCAFKFSM FEATPGRFPR VDEDVLYDAF RLVAPSGLAC
GVHNQMQDLT RKNIARMIEA GDTGWDAFLR AHPPLIENLA TALIYEIGAE TGARAHAVHV
STSRGFELCN MFRRAGHHAS IETCVQYLML DHETHTKRFG AKTKHYPPIR PRAEQELLWT
HVARGECTFV SSDHVSWELK RKGDANVFRN ASGGPGLETL LPAFWTGCEQ HGIAPTRVAE
LLATNPARHF LLDDRKGSLD VGADADFVIL TPERYAFDPS CSLSAVQWSA FEGMEFAVRI
AATYCRGALV YDGARIVNPA GSGRFLKPHG SRPIVTQPER A