Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_B2218 |
Symbol | |
ID | 3753984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007511 |
Strand | - |
Start bp | 2542152 |
End bp | 2544905 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637767065 |
Product | DNA polymerase I |
Protein accession | YP_372973 |
Protein GI | 78063065 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAG AACGAAATCT GGAAGGTAAG ACCCTGCTAT TGGTTGACGG TTCGAGCTAT CTGTATCGGG CTTACCATGC GATGCCTGAT TTGCGTGGCC CTGGCGGGGA GCCGACCGGA GCGCTCTACG GAATCATCAA CATGCTGCGC CGTATGCGCA AGGAAGTCAG TGCAGAGTAT AGCGCTTGCG TGTTCGATGC AAAGGGCAAG ACGTTCCGTG ACGACCTTTA TGCCGACTAT AAGGCAAACC GTCCGTCGAT GCCGCCCGAC CTCGCATTGC AGGTCGAACC GATCCACGGC GCGGTGCGCG CGCTCGGCTG GCCGCTGCTG ATGGTCGAAG GCGTCGAGGC CGACGACGTG ATCGGCACGC TCGCGCGCGA AGCCGAGCGG CACGGGATGA ACGTAGTCGT GTCGACCGGC GACAAGGATC TCGCGCAGCT CGTCACCGAC CACGTCACGC TCGTCAACAC GATGACCAAC GAGACGCTCG ACCGCGACGG TGTGATCGCG AAGTTCGGCG TGCCGCCCGA GCGCATCATC GACTACCTTG CACTGATCGG CGACACCGTC GACAACGTGC CGGGCGTCGA GAAGTGCGGG CCGAAGACGG CCGTGAAATG GCTGGCGCAG TACGACAGCC TCGACGGCGT GATCGAGCAT GCGGGCGACA TCAAGGGCGT GGTCGGCGAC AACCTGCGCC GCGCGCTCGA CTTCCTGCCG CTCGGCCGCA CGCTCGTGAC CGTCGAGACG GCTTGCGATC TCACGCCGCA TCTCGAATCG ATCGAAGCGT CGTTGAAGAG CGACGGCGAA GCGCGCGACC TGATGCGCGA CATCTTCGCG CGCTACGGCT TCAAGACCTG GTTGCGCGAA GTCGACAGCG CGCCGGCGGA AGGCGGCGGC GCCGATGCGC CGGAAGGCGA GCCGGCGCCG GTGGTGGCGG CCGACATCGT GCGCGAATAC GACACGATCC AGACCTGGGA GCAATTCGAC GCGTGGTTCG CGAAGATCGA CGCGGCCGCA CTGACCGCAT TCGACACCGA GACGACCTCG CTCGACCCGA TGCTCGCGCG GCTCGTGGGC CTGTCGTTCT CGGTTGAATC GGGCAAGGCC GCGTACCTGC CGGTCGCACA CCGCGGCCCC GACATGCCCG AGCAGCTTCC GCTCGACGAA GTGCTCGCGC GCCTGAAGCC GTGGCTCGAA TCGGCCGATC GCAAGAAGGT CGGCCAGCAC CTGAAGTACG ACGCGCAGGT GCTCGCGAAC TACGACATCG CGCTGAACGG CATCGAGCAC GACACGTTGC TCGAATCGTA CGTCGTCGAG TCACATCGCA CGCACGACAT GGACAGCCTT GCGCTGCGTC ATCTGGGCGT CAAGACGATC AAGTATGAAG ACGTGGCCGG CAAGGGCGCG AAGCAGATCG GTTTCGACGA AGTCGCGCTC ACTCAGGCCG CCGAATACGC GGCCGAAGAT GCGGACATCA CGCTGCAGCT GCATCACGCG CTGTATCCGC AGGTTGCGCG CGAACCGGGC CTCTTGCACG TGTACCGCGA GATCGAGATG CCCGTGTCGC TCGTGCTGCG CAAGATGGAG CGCACGGGCG TGCTGATCGA CGACGTTCGC CTGCAGGCGC AGAGCACCGA AATCGCGACG CGCCTGATCG AGCTCGAAGC GCAGGCGTAC GAACTGGCGG GCGGTGAATT CAATCTCGGC TCGCCGAAGC AGATCGGGCA GATCTTCTTC GAAAAGCTGC AGTTGCCGGT CGTGAAAAAG ACACCGAGCG GCGCGCCGTC GACCGACGAA GAAGTGCTGC AGAAGCTGGC CGAGGACTAC CCGCTGCCGA AGCTGCTGCT CGAGCATCGC GGGCTGTCGA AGCTGAAGTC GACCTATACC GACAAGCTGC CGCGCATGGT GAACCCTTCC ACGGGCCGCG TGCACACGAA CTACGCGCAG GCCGTCGCCG TCACGGGCCG CCTTGCGTCG AACGATCCGA ATCTTCAGAA CATTCCGGTG CGCACGGCCG AGGGCCGGCG GATCCGCGAG GCGTTCATCG CCTCGCCGGG CCACCGTATC GTGTCGGCCG ATTATTCGCA GATCGAACTG CGGATCATGG CGCACATCTC GGGCGACGCG TCGCTGCTGC GTGCGTTCTC GCAGGGCGAG GATATCCACC GCGCAACGGC CGCCGAGGTG TTCGGCGTGA CGCCGCTGGA GGTCAATTCC GACCAGCGCC GGATTGCGAA GGTGATCAAC TTCGGGCTCA TCTACGGGAT GAGCGCGTTC GGGCTCGCGT CGAACCTCGG CATCACGCGC GATGCGGCGA AGCTCTATAT CGACCGCTAT TTCGCCCGTT ATCCGGGCGT CGCGCAGTAC ATGGAAGACA CGCGCTCGGT GGCGAAGGAG AAGGGCTACG TCGAAACCGT TTTCGGTCGC CGCCTGTGGC TGCCGGAGAT CAACGGCGGC AACGGCCCGC GCCGCCAGGC GGCCGAGCGC GCGGCAATCA ATGCGCCGAT GCAGGGCACG GCGGCCGACC TGATCAAGCT GTCGATGATC GCGGTGGACG ACTGGCTCAC GCGCGACAAG CTGGCGTCGC GGATGATCAT GCAGGTGCAC GATGAACTGG TGCTCGAGGT ACCCGACGGC GAACTGTCGC TGGTGCGCGA GAAACTGCCG GAAATGATGT GCGGCGTGGC GAAGCTGAAG GTGCCGCTGG TCGCCGAAGT GGGCGCCGGT GCGAACTGGG AAGAGGCACA CTGA
|
Protein sequence | MPEERNLEGK TLLLVDGSSY LYRAYHAMPD LRGPGGEPTG ALYGIINMLR RMRKEVSAEY SACVFDAKGK TFRDDLYADY KANRPSMPPD LALQVEPIHG AVRALGWPLL MVEGVEADDV IGTLAREAER HGMNVVVSTG DKDLAQLVTD HVTLVNTMTN ETLDRDGVIA KFGVPPERII DYLALIGDTV DNVPGVEKCG PKTAVKWLAQ YDSLDGVIEH AGDIKGVVGD NLRRALDFLP LGRTLVTVET ACDLTPHLES IEASLKSDGE ARDLMRDIFA RYGFKTWLRE VDSAPAEGGG ADAPEGEPAP VVAADIVREY DTIQTWEQFD AWFAKIDAAA LTAFDTETTS LDPMLARLVG LSFSVESGKA AYLPVAHRGP DMPEQLPLDE VLARLKPWLE SADRKKVGQH LKYDAQVLAN YDIALNGIEH DTLLESYVVE SHRTHDMDSL ALRHLGVKTI KYEDVAGKGA KQIGFDEVAL TQAAEYAAED ADITLQLHHA LYPQVAREPG LLHVYREIEM PVSLVLRKME RTGVLIDDVR LQAQSTEIAT RLIELEAQAY ELAGGEFNLG SPKQIGQIFF EKLQLPVVKK TPSGAPSTDE EVLQKLAEDY PLPKLLLEHR GLSKLKSTYT DKLPRMVNPS TGRVHTNYAQ AVAVTGRLAS NDPNLQNIPV RTAEGRRIRE AFIASPGHRI VSADYSQIEL RIMAHISGDA SLLRAFSQGE DIHRATAAEV FGVTPLEVNS DQRRIAKVIN FGLIYGMSAF GLASNLGITR DAAKLYIDRY FARYPGVAQY MEDTRSVAKE KGYVETVFGR RLWLPEINGG NGPRRQAAER AAINAPMQGT AADLIKLSMI AVDDWLTRDK LASRMIMQVH DELVLEVPDG ELSLVREKLP EMMCGVAKLK VPLVAEVGAG ANWEEAH
|
| |