Gene Bind_2744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2744 
Symbol 
ID6200686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3127207 
End bp3130338 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content59% 
IMG OID641706693 
ProductDNA polymerase I 
Protein accessionYP_001833805 
Protein GI182679659 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.560241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGCT CCGTGTGCGG TCGGGCATTG GCTCCGCTGC GAAACTTTGC CGAAGATCAT 
ATCATGACCC TTGTTCATCG TCCGATCGAA GCCGGCGACC GAGTGTTCCT AGTCGACGGT
TCTTCCTTTG TCTTTCGGGC CTATTTTCAG TCGATCCGGC AAGACGCCAA ATATAATTAT
CGGTCTGACC GTCTGCCGAC TGGCGCGGTG CGCCTGTTTT GCACCAAATT GTTCCAGTTC
ATCCGCGAGG GCGCGGCGGA CCTGAAGCCG ACGCATCTGG CGATCATTTT CGATAAGTCG
GAAAATTCCT TTCGCAAGGA GATTTATCCG CCCTACAAGA GCAATCGCTC GGAGCCGCCG
GAAGATCTCA TCCCGCAATT TCCGTTGATG CGCGCTTCCG TCCGCGCTTT CGGCCTGCAG
CCGGTCGAGC AGGATCGTTA TGAAGCGGAT GACCTGATCG CCACTTATGC GACGCAGGCC
CATGCTCTCG GCGCTGAGGT TCTGATCATT TCCGCTGACA AGGATCTGAT GCAACTGATC
GGGCCGGGCG TTTCCATGTA TGATCCTGCC TCCGGCGAGG CTGGCGCCAA AGGAGCACGG
CAGGAACGGC GGATCGGGCA AGAGGAGGTC GTCGCCTATT TCGGTGTCGG CCCTGACAAA
GTCATCGATG TCCAGGCGTT GGCTGGCGAT TCGACCGATA ATGTTCCAGG CGCGCGCGGC
ATCGGCATCA AGACGGCGGC GCAGCTCATT AATGATTATG GTGATCTCGA TAATCTTTTG
AAGCTCGCCC GAGAGATCAA GCAACCGAAA CGGCGCGAGA TCCTCACTGA ACCGGAAAGC
GTCGGGCTGA TCGAAATCTC CCGCAAGCTT GTCACTCTGG TTCGTGACGT GCCGGTGGAA
ATCCCGCTCG CCGATCTTGG CTTGCATGCG CCGGAACCGC ACAAGCTCGT CGCTTATTTG
AAGGCGCTGG AATTCACCAC TCTGACCAAA CGCGTGGCTG AAGCCTATGG CGTCGATCTT
GCTGCGGTCG AGCCCGATCC GGCCTTTTGC GGCGCGGCCG GCTGGCGCGG CCGCAATGGC
GAGGTGCTCG TTGAAGCCGA GCCAAAAGCG GGTGAGGGGG CGACGGAACC AGCCGGTGCT
GAAGCTTCGG GAGAAGTCGC GACAAGAAGC GGGCCGAAAC GCAATGCGCG TTATGGTGAG
CCTGTTGAAA AAGCAGCTTT GGGCAGCGGA CCGGGTCTCT TCGCCTCCGC CCGCGCCGCC
GAGGGACGTG CTGGGACATT CGAGCACTCG GCCTATCAGA CGATCGTCAC TCTTGAGGCT
CTCGATGCTG TAATCGCGGC GGCTTATGAA GCGGGCCTGC TTGCCATCGA TACCGAAACC
ACCTCGCTCG ATCCCATGCT TGCGGATCTG GCTGGTCTTT CTCTCTGCGC GGAACCGGGA
AAGGCCTTCT ATGTTCCAGT CGGCCATACG AGCGGCGAAG GGCAGGATTT GTTTGAAGGT
AAGGGCCTGC TGCCGGGGCA ATTGCCGATC GGCGTGGTGC TGGAGCGCCT GAAGCCGCTG
CTTGAAGATC CAAGCATCCT CAAGATCGGA CAGAACGTCA AATATGATTG GATCGTGCTG
AAACAGCACG GCGTCGAGAT GCGGCCCTTG GACGATACGC TGCTGCTCTC TTATGTGCTC
GACGCTGGTC TGACCGACCA TGGCATGGAT GTCTTGTCCG AGAAACATCT TGGTCATAAG
CCAATCCCCT TCAGCGCGGT TGCCGGCAGC GGCCGGACTT TCATCGGCTT TGCCCGGGTC
GCGATCGACA AGGCGACGGA ATATGCGGCG GAGGACGCCG ATGTCACCTT GCGGCTCTGG
CGTGTTCTGA AACCGCGTCT GCCTGCGGAA AAGATGACCA CGGTCTATGA GACCTTGGAA
AGGCCGATGA TTGACGTCTT GGCCCGCATG GAGCGCCGGG GCGTTGCGAT TGATCCGCCG
TTGCTTGGCC GGCTTTCAAG CGATTTCGCG CAGGATATGG CGCGCTACGA GGCGGAGATT
TACGAACTCG CCGGGGAAAA ATTCAATCTC GGCTCCCCCA AGCAATTGGG GGATATTCTG
TTCGGCAAGC TTGGCCTGCC GGGCGCCAAG AAGACCGCGA CGGGAGCCTG GTCCACCGCC
GCCGGCGTTC TTGAGGATCT TGCCGAGCAG GGCGTCGACC TTGCCGCCCG CATTCTCGAC
TGGCGGCAGT TGGCCAAGCT CAAATCCACG TACACGGATG CGCTGCCAAA CTATATCAAT
CCTTCGAGCG GGCGCGTGCA TACATCCTAT GCTTTGGCCG CGACAACCAC CGGGCGGCTT
TCGTCCTCGG AGCCCAATCT GCAAAACATC CCGGTCCGCA ATGAGGCAGG ACGCAAGATT
CGGCGCGCTT TTATCGCCCC GCCAGGGCGT AAATTGGTTT CAGCCGATTA CAGCCAGATC
GAATTGCGGC TGCTCGCCCA TATCGCCGAT ATCAAGTCGC TGAAAAACGC CTTTGCTGAA
AACCTCGATA TTCATGCGAT GACCGCCTCG GAAATGTTCG GGGTTCCGGT CGAAGGAATG
CCGAGCGAGA TCCGGCGCCG TGCCAAGGCC ATCAATTTCG GCATCATCTA TGGTATTTCC
GCTTTCGGCC TCGCCAATCA GCTCGCCATC CCCCGTGAGG AAGCGGCCGC TTATATCAAG
AAATATTTTG AGCGTTTCCC CGGCATCCGG GCCTATATGG ACGACACCAA GAAGAAAGCG
CGTGAGAATG GCTATGTCGA AACGATTTTC GGCCGCAAAT GCCATTATCC GCGCATCAAG
GCCTCGAATC CTTCAGAACG CGCTTTGAAT GAGCGCGCGG CGATCAATGC GCCCATTCAA
GGCTCGGCGG CTGATATTAT CCGCCGCGCG ATGATCCATA TGGAAGCCGC GCTCGCGGCG
GAAAAATTAT CCGCGCAAAT GCTGCTGCAA GTGCATGACG AATTGATTTT CGAAGTGCCG
GACGATGAAG TCGAGGCGAC CATTGCGGTT GTGCGCAAGA TCATGGTCGA GGCGCCCTTG
CCGGCGGTCC ACCTGTCCGT GCCCTTGCAG GTTGATGCCA AGGCCGCCCA GAATTGGGAT
GAGGCCCATT GA
 
Protein sequence
MLGSVCGRAL APLRNFAEDH IMTLVHRPIE AGDRVFLVDG SSFVFRAYFQ SIRQDAKYNY 
RSDRLPTGAV RLFCTKLFQF IREGAADLKP THLAIIFDKS ENSFRKEIYP PYKSNRSEPP
EDLIPQFPLM RASVRAFGLQ PVEQDRYEAD DLIATYATQA HALGAEVLII SADKDLMQLI
GPGVSMYDPA SGEAGAKGAR QERRIGQEEV VAYFGVGPDK VIDVQALAGD STDNVPGARG
IGIKTAAQLI NDYGDLDNLL KLAREIKQPK RREILTEPES VGLIEISRKL VTLVRDVPVE
IPLADLGLHA PEPHKLVAYL KALEFTTLTK RVAEAYGVDL AAVEPDPAFC GAAGWRGRNG
EVLVEAEPKA GEGATEPAGA EASGEVATRS GPKRNARYGE PVEKAALGSG PGLFASARAA
EGRAGTFEHS AYQTIVTLEA LDAVIAAAYE AGLLAIDTET TSLDPMLADL AGLSLCAEPG
KAFYVPVGHT SGEGQDLFEG KGLLPGQLPI GVVLERLKPL LEDPSILKIG QNVKYDWIVL
KQHGVEMRPL DDTLLLSYVL DAGLTDHGMD VLSEKHLGHK PIPFSAVAGS GRTFIGFARV
AIDKATEYAA EDADVTLRLW RVLKPRLPAE KMTTVYETLE RPMIDVLARM ERRGVAIDPP
LLGRLSSDFA QDMARYEAEI YELAGEKFNL GSPKQLGDIL FGKLGLPGAK KTATGAWSTA
AGVLEDLAEQ GVDLAARILD WRQLAKLKST YTDALPNYIN PSSGRVHTSY ALAATTTGRL
SSSEPNLQNI PVRNEAGRKI RRAFIAPPGR KLVSADYSQI ELRLLAHIAD IKSLKNAFAE
NLDIHAMTAS EMFGVPVEGM PSEIRRRAKA INFGIIYGIS AFGLANQLAI PREEAAAYIK
KYFERFPGIR AYMDDTKKKA RENGYVETIF GRKCHYPRIK ASNPSERALN ERAAINAPIQ
GSAADIIRRA MIHMEAALAA EKLSAQMLLQ VHDELIFEVP DDEVEATIAV VRKIMVEAPL
PAVHLSVPLQ VDAKAAQNWD EAH