Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_3040 |
Symbol | cho |
ID | 3689143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 3353642 |
End bp | 3354787 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637729495 |
Product | DNA polymerase/helicase |
Protein accession | YP_334417 |
Protein GI | 76808562 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000476773 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCT CGCCGCCGCC GCATCCCGCG ATCGACACGC CGCTCGCGTT CGTCGACCTC GAAACCACCG GCGGATCGGC CGCCGAGCAT CGCATCACCG AAATCGGCGT TGTCGTCGTG AACGCGAACG GCGTATCGAC ATGGACGACG CTCGTCGATC CGCAGCAGCC GATTCCCCCG TTCATCCAGC AGCTCACGGG TATCACCGAC GCGATGGTGC GCGGCGCGCC GACGTTTGCC GACATTGCGG GCGCATTGTT CGAGCGGCTC GACGGCAAAC TTTTCGTCGC GCACAACGCG AGCTTCGACC GAGGCTTTCT GCGCGCGGAG TTCGAGCGAG CGGGCATCGC ATTCAATCCC GACGTGCTTT GCACGGTGCG GCTGTCGCGC GCGCTTTTCC CGCGCGAGTC GCGCCATGGG CTCGACGCGC TGATCGAGCG GCACGCGCTC GCGCCGTCGG CACGCCACCG GGCGCTCGCC GACGCGGATC TCATCTGGCA GTTCTGGCAA AAGTTGCACG CCGTGATACC GGCCGAGCAA CTGAGCGAGC AGATCGTGCG CACGACGCGC CGGTTCAGGC TCGCGGGGGC GTTGACGGAA GCGCATCTGG AAAGCGCGCC CGCCGGCTGT GGTGTCTACG CGCTGTTCGG CGACGGCGAC GCGCCGCTCT ATGTCGGCCG AAGCGTGCGG GTTCGCCAGC GGCTGCGCGC GCTGCTGACG GGGGAGCGGC GCTCGTCGAA GGAAACACGG CTCGCGCAGC TCGTGCGGCG GGTCGAATGG CGCGAGACGG GCGGCGAGCT CGGCGCGCTG CTTGCCGAGG CGGACTGGAT CGCGTCGCTT GCGCCGTCGT TCAACCGGCG GTCGGACCGC GGCGCGACGG GCGATGCGCA TTGGCCGTTC GGCGGGCCGG TCGCGTTCGA GGAGCGCGGC GAATCGCGTG TTTTTCATGT GATCGATCAG TGGCGCTACG TCGGCGCGGC ATCGTCGATC GAGCGGGCGG CGACGCTCGC GGCCGACGCG CGCGCGGCGG GCGAAGGCGC GCGGAGCGCC GCGCCGGCGG TGCGCCGCAT TCTGCAGACG CATCTCGCGC GCGGGCTTCA ACTGATTCCG ATTCCGCTCG CGGGCGCCGC GCCTGCCGCC GCCTAA
|
Protein sequence | MSASPPPHPA IDTPLAFVDL ETTGGSAAEH RITEIGVVVV NANGVSTWTT LVDPQQPIPP FIQQLTGITD AMVRGAPTFA DIAGALFERL DGKLFVAHNA SFDRGFLRAE FERAGIAFNP DVLCTVRLSR ALFPRESRHG LDALIERHAL APSARHRALA DADLIWQFWQ KLHAVIPAEQ LSEQIVRTTR RFRLAGALTE AHLESAPAGC GVYALFGDGD APLYVGRSVR VRQRLRALLT GERRSSKETR LAQLVRRVEW RETGGELGAL LAEADWIASL APSFNRRSDR GATGDAHWPF GGPVAFEERG ESRVFHVIDQ WRYVGAASSI ERAATLAADA RAAGEGARSA APAVRRILQT HLARGLQLIP IPLAGAAPAA A
|
| |