Gene BURPS1106A_3006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3006 
Symbol 
ID4900631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2943324 
End bp2944469 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content71% 
IMG OID640136232 
Productexonuclease DNA polymerase III subunit epsilon 
Protein accessionYP_001067249 
Protein GI126455164 
COG category[L] Replication, recombination and repair 
COG ID[COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00547496 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCT CGCCGCCGCC GCATCCCGCG ATCGACACGC CGCTCGCGTT CGTCGACCTC 
GAAACGACCG GCGGATCGGC CGCCGAGCAT CGCATCACCG AAATCGGCGT TGTCGTCGTG
AACGCGAACG GCGTATCGAC ATGGACGACG CTCGTCGATC CGCAGCAGCC GATTCCCCCG
TTCATCCAGC AGCTCACGGG TATCACCGAC GCGATGGTGC GCGGCGCGCC GACGTTTGCC
GACATTGCGG GCGCATTGTT CGAGCGGCTC GACGGCAAAC TTTTCGTCGC GCACAACGCG
AGCTTCGACC GAGGCTTTCT GCGCGCGGAG TTCGAGCGAG CGGGCATCGC ATTCAATCCC
GACGTGCTTT GCACGGTGCG GCTGTCGCGC GCGCTTTTCC CGCGCGAGTC GCGCCATGGG
CTCGACGCGC TGATCGAGCG GCACGCGCTC GCGCCGTCGG CACGCCACCG GGCGCTCGCC
GACGCGGATC TCATCTGGCA GTTCTGGCAA AAGTTGCACG CCGTGATACC GGCCGAGCAA
CTGAGCGAGC AGATCGTGCG CACGACGCGC CGGTTCAGGC TCGCGGGGGC GTTGACGGAA
GCGCATCTGG AAAGCGCGCC CGCCGGCTGT GGCGTCTACG CGCTGTTCGG CGACGGCGAC
GCGCCGCTCT ATGTCGGCCG AAGCGTGCGG GTTCGCCAGC GGCTGCGCGC GCTGCTGACG
GGGGAGCGGC GCTCGTCGAA GGAAACACGG CTCGCGCAGC TCGTGCGGCG GGTCGAATGG
CGCGAGACGG GCGGCGAGCT CGGCGCGCTG CTTGCCGAGG CGGACTGGAT CGCGTCGCTT
GCGCCGTCGT TCAACCGGCG GTCGGACCGC GGCGCGACGG GCGATGCGCA TTGGCCGTTC
GGCGGGCCGG TCGCGTTCGA GGAGCGCGGC GAATCGCGTG TTTTTCATGT GATCGATCAG
TGGCGCTACG TCGGCGCGGC ATCGTCGATC GAGCGGGCGG CGACGCTCGC GGCCGACGCG
CGCGCGGCGG GCGAAGGCGC GCGGAGCGCC GCGCCGGCGG TGCGCCGCAT TCTGCAGACG
CATCTCGCGC GCGGGCTTCA ACTGATTCCG ATTCCGCTCG CGGGCGCCGC GCCTGCCGCC
GCCTAA
 
Protein sequence
MSASPPPHPA IDTPLAFVDL ETTGGSAAEH RITEIGVVVV NANGVSTWTT LVDPQQPIPP 
FIQQLTGITD AMVRGAPTFA DIAGALFERL DGKLFVAHNA SFDRGFLRAE FERAGIAFNP
DVLCTVRLSR ALFPRESRHG LDALIERHAL APSARHRALA DADLIWQFWQ KLHAVIPAEQ
LSEQIVRTTR RFRLAGALTE AHLESAPAGC GVYALFGDGD APLYVGRSVR VRQRLRALLT
GERRSSKETR LAQLVRRVEW RETGGELGAL LAEADWIASL APSFNRRSDR GATGDAHWPF
GGPVAFEERG ESRVFHVIDQ WRYVGAASSI ERAATLAADA RAAGEGARSA APAVRRILQT
HLARGLQLIP IPLAGAAPAA A