Gene Bphy_1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_1389 
Symbol 
ID6242880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010622 
Strand
Start bp1561457 
End bp1562479 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content66% 
IMG OID642593172 
ProductHhH-GPD family protein 
Protein accessionYP_001857617 
Protein GI186476147 
COG category[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.844993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0527923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGG CCACGAAGAC GCCGGCTAAA CGGGCCGCGT CTCAATCTGA CGCGGCATCT 
AAAGTAAGGG CTGTGCGTGC GCGAAGCGGC GTCAGCGCGA AGGGCGCTGC GAAGGTGGCA
GTGAAATCGG CGTCTGCGAA GGGTGCCAAG GCGGGTGCGC TGGCGAGGCC CGCGTCGGTG
CGTTCTAAGG GCAACGGCGC CGAGGCGCCC GAGGCGGCGG TGAAGCCGTC GCGCGCCCGG
ACTGCGCACT TGAAAGGCAA CGGCTCGCTG CCTGCCGAAC TGGCGGGGGA CCTGCGGGAG
CTCGCACACG ACGCGCAGGA AGCGGAGCCG GTGCGCAAGC CGCGCGCGGC GGTGGCGCCT
GCCGAAAGCG CGGACGCGGT GCAGGCCGCG CAGCCGTCTG GCGTCACGCG TCCCGCGTAT
TGGGATAAGG CGTGCGCCGA TCTCGTCAAA CGCGACCGCA TTCTCAAGAA GCTGATTCCG
AAATTCGGCC CGGTGCATCT ATCGAGCCGC GCCGATCCGT TCGTCACGCT CGCGCGTTCG
GTGATTGGTC AACAGATTTC CGTGGCCTCT GCCCAGGCGA TGTGGCAGCG TATCGTCGCG
GCATGTCCGA AGCTCGCGCC GCAGCAGATC ATCAAGCTCG GTCAGGATGA TCTGATGGGC
TGCGGCGTGT CGAAGCGCAA GGCCGAGTAC ATTCTCGATC TCGCGCATCA CTTCGTCTCG
GGCGCTTTAC ACGTCGGCAA ATGGACGTCG ATGGAAGACG AGGACGTGAT CGCCGAGTTG
ACGCAGATCC GCGGCATCAG CCGCTGGACA GCGGAGATGT TCCTGATATT CGACCTGTCG
CGTCCGGACG TTCTGCCGCT CGACGATCCG AACCTGATCC ACGCAATCAG TCAGAACTAT
TTCAGCGGGG AACCGGTGAC ACGCAGCGAG GCGCGGGAAG TCGCTGCGAA CTGGGAGCCG
TGGCGCACCG TCGCGACCTG GTATATGTGG CGCAGCCTAG ACCCAGCGCC TGCCGGCAAC
TGA
 
Protein sequence
MATATKTPAK RAASQSDAAS KVRAVRARSG VSAKGAAKVA VKSASAKGAK AGALARPASV 
RSKGNGAEAP EAAVKPSRAR TAHLKGNGSL PAELAGDLRE LAHDAQEAEP VRKPRAAVAP
AESADAVQAA QPSGVTRPAY WDKACADLVK RDRILKKLIP KFGPVHLSSR ADPFVTLARS
VIGQQISVAS AQAMWQRIVA ACPKLAPQQI IKLGQDDLMG CGVSKRKAEY ILDLAHHFVS
GALHVGKWTS MEDEDVIAEL TQIRGISRWT AEMFLIFDLS RPDVLPLDDP NLIHAISQNY
FSGEPVTRSE AREVAANWEP WRTVATWYMW RSLDPAPAGN