Gene BURPS1106A_3112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3112 
SymbolureC 
ID4902574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3031725 
End bp3033431 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content68% 
IMG OID640136338 
Producturease subunit alpha 
Protein accessionYP_001067350 
Protein GI126452548 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATTAC GTTTGAGCCG CCGCGCGTAC GCGGAAATGT TCGGGCCGAC GACGGGCGAC 
CGCATCCGCC TCGCGGACAC CGAGCTGCTG ATCGAAGTCG AGCGCGACCA CACGCTCTAC
GGCGAGGAAG TGAAGTTCGG CGGCGGCAAG GTGATCCGCG ACGGCATGGG CCAATCGCAG
CTCCCCGCGG CCGACGTGGC GGACACCGTG ATCACGAACG CGGTGATTCT CGATCACTGG
GGCATCGTGA AGGCGGACAT CGCGATCAAG CACGGCCGCA TCGCGGCGAT CGGCAAGGCG
GGCAATCCGG ACATCCAGCC GGGCGTGACG ATCGCGATCG GCGCGGCGAC CGAGATCATC
GCGGGCGAAG GCCTGATCGT GACGGCGGGC GGCATCGATA CGCACATCCA CTTCATCAGC
CCGCAGCAGA TCGACGAAGC GCTCGCATCG GGCGTGACGA CGATGATCGG CGGCGGCACG
GGCCCCGCGA CCGGCACCAA CGCGACGACC TGCACGCCGG GGCCGTGGCA CATGGAGCGG
ATGCTGCAGG CGGCCGACGG CTGGCCGATC AATCTCGGCT TTCTCGGCAA GGGCAACGCG
AGCCGGCCGC AGCCGCTCGT CGAGCAGATC GAGGCGGGCG CGATCGGCCT GAAGCTGCAC
GAGGATTGGG GCACGACGCC CGCCGCGATC GACAACTGCC TGACGGTGGC CGACGACACC
GACACGCAGG TCGCGATCCA CACCGATACG CTGAACGAGG CCGGCTTCGT CGAGGCGACG
GTCGCCGCGT TCAAGGGCCG CACGATCCAC ACGTACCACA CCGAGGGCGC GGGCGGCGGC
CATGCGCCCG ACATCCTGAA GGTGTGCGGC GAGGCGAACG TGCTGCCTTC ATCGACGAAC
CCGACGCGCC CGTACACGAT CAACACGCTC GACGAACACC TCGACATGCT GATGGTCTGC
CATCACCTCG ATCCGTCGAT CGCCGAGGAT CTCGCGTTCG CCGAATCGCG GATTCGCCGC
GAGACGATCG CGGCCGAGGA CATCCTGCAC GACCTCGGCG CGCTGTCGAT GCTGTCGTCC
GATTCGCAGG CGATGGGCCG CGTCGGCGAA GTGATCATCC GCACGTGGCA GACCGCGCAC
AAGATGAAGG TGCAGCGCGG CGCGCTCACC GGCGACGGCG CGCGCAACGA CAACTTCCGC
GCGAAGCGCT ACGTCGCGAA ATACACGATC AATCCGGCGC TCACGCACGG CATCGCGCAC
GAGGTCGGCT CGATCGAGCC GGGCAAATGG GCGGACCTCG TGCTGTGGGA GCCCGCGTTC
TTCGGGGTCA AGCCGGCGAT GATCGTCAAG GGCGGCATGA TCGCCGTCGC GCAGATGGGC
GATCCGAATG CGTCGATCCC GACGCCGCAG CCCGTGCATT ACCGCGAGAT GTTCGCCACC
CGCGGCGGCG CGCTCGCGCG CACGTCGCTC ACGTTCGTGT CGCAGCTCGC GCTCGATGCG
GGCATCAGCG CGCGCTACGG GCTCGCGAAG CGGCTCGTGC CGGTGCGCGG CTGCCGCACG
GTGACCAAGC GCGACATGAT CCACAACGCA TGGCAACCGG CCATCCGCGT CGACCCCGAA
ACCTACGACG TCGTCGCCGA CGGCGCGCTG CTCACCTGCG AGCCCGCCGC CGTGCTGCCG
ATGGCGCAAC GCTACTTCCT GTTCTGA
 
Protein sequence
MTLRLSRRAY AEMFGPTTGD RIRLADTELL IEVERDHTLY GEEVKFGGGK VIRDGMGQSQ 
LPAADVADTV ITNAVILDHW GIVKADIAIK HGRIAAIGKA GNPDIQPGVT IAIGAATEII
AGEGLIVTAG GIDTHIHFIS PQQIDEALAS GVTTMIGGGT GPATGTNATT CTPGPWHMER
MLQAADGWPI NLGFLGKGNA SRPQPLVEQI EAGAIGLKLH EDWGTTPAAI DNCLTVADDT
DTQVAIHTDT LNEAGFVEAT VAAFKGRTIH TYHTEGAGGG HAPDILKVCG EANVLPSSTN
PTRPYTINTL DEHLDMLMVC HHLDPSIAED LAFAESRIRR ETIAAEDILH DLGALSMLSS
DSQAMGRVGE VIIRTWQTAH KMKVQRGALT GDGARNDNFR AKRYVAKYTI NPALTHGIAH
EVGSIEPGKW ADLVLWEPAF FGVKPAMIVK GGMIAVAQMG DPNASIPTPQ PVHYREMFAT
RGGALARTSL TFVSQLALDA GISARYGLAK RLVPVRGCRT VTKRDMIHNA WQPAIRVDPE
TYDVVADGAL LTCEPAAVLP MAQRYFLF