Gene BURPS668_2940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2940 
Symbol 
ID4881675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2896109 
End bp2897254 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content71% 
IMG OID640128868 
Productexonuclease DNA polymerase III subunit epsilon 
Protein accessionYP_001059957 
Protein GI126438667 
COG category[L] Replication, recombination and repair 
COG ID[COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000147558 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCT CGCCGCCGCC GCATCCCGCG ATCGACACGC CGCTCGCGTT CGTCGACCTC 
GAAACCACCG GCGGATCGGC CGCCGAGCAT CGCATCACCG AAATCGGCGT TGTCGTCGTG
AACGCGAACG GCGTATCGAC ATGGACGACG CTCGTCGATC CGCAGCAGCC GATTCCCCCG
TTCATCCAGC AGCTCACGGG TATCACCGAC GCGATGGTGC GCGGCGCGCC GACGTTTGCC
GACATTGCGG GCGCATTGTT CGAGCGGCTC GACGGCAAAC TTTTCGTCGC GCACAACGCG
AGCTTCGACC GAGGCTTTCT GCGCGCGGAG TTCGAGCGAG CGGGCATCGC ATTCAATCCC
GACGTGCTTT GCACGGTGCG GCTGTCGCGC GCGCTTTTCC CGCGCGAGTC GCGCCATGGG
CTCGACGCGC TGATCGAGCG GCACGCGCTC GCGCCGTCGG CACGCCACCG GGCGCTCGCC
GACGCGGATC TCATCTGGCA GTTCTGGCAA AAGTTGCACG CCGTGATACC GGCCGAGCAA
CTGAGCGAGC AGATCGTGCG CACGACGCGC CGGTTCAGGC TCGCGGGGGC GTTGACGGAA
GCGCATCTGG AAAGCGCGCC CGCCGGCTGT GGCGTCTACG CGCTGTTCGG CGACGGCGAC
GCGCCGCTCT ATGTCGGCCG AAGCGTGCGG GTTCGCCAGC GGCTGCGCGC GCTGCTGACG
GGGGAGCGGC GCTCGTCGAA GGAAACACGG CTCGCGCAGC TCGTGCGGCG GGTCGAATGG
CGCGAGACGG GCGGCGAGCT CGGCGCGCTG CTTGCCGAGG CGGACTGGAT CGCGTCGCTT
GCGCCGTCGT TCAACCGGCG GTCGGACCGC AGCGCGACGG GCGATGCGCA TTGGCCGTTC
GGCGGGCCGG TCGCGTTCGA GGAGCGCGGC GAATCGCGTG TTTTTCATGT GATCGATCAG
TGGCGCTACG TCGGCGCGGC ATCGTCGATC GAGCGGGCGG CGACGCTCGC GGCCGACGCG
CGCGCGGCGG GCGAAGGCGC GGGGAGCGCC GCGCCGGCGG TGCGCCGCAT TCTGCAGACG
CATCTCGCGC GCGGGCTTCA ACTGATTCCG ATTCCGCTCG CGGGCGCCGC GCCTGCCGCC
GCCTAA
 
Protein sequence
MSASPPPHPA IDTPLAFVDL ETTGGSAAEH RITEIGVVVV NANGVSTWTT LVDPQQPIPP 
FIQQLTGITD AMVRGAPTFA DIAGALFERL DGKLFVAHNA SFDRGFLRAE FERAGIAFNP
DVLCTVRLSR ALFPRESRHG LDALIERHAL APSARHRALA DADLIWQFWQ KLHAVIPAEQ
LSEQIVRTTR RFRLAGALTE AHLESAPAGC GVYALFGDGD APLYVGRSVR VRQRLRALLT
GERRSSKETR LAQLVRRVEW RETGGELGAL LAEADWIASL APSFNRRSDR SATGDAHWPF
GGPVAFEERG ESRVFHVIDQ WRYVGAASSI ERAATLAADA RAAGEGAGSA APAVRRILQT
HLARGLQLIP IPLAGAAPAA A