Gene Bind_3417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3417 
Symbol 
ID6199005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3877727 
End bp3880612 
Gene Length2886 bp 
Protein Length961 aa 
Translation table11 
GC content60% 
IMG OID641707364 
ProductDNA topoisomerase I 
Protein accessionYP_001834463 
Protein GI182680317 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0414891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.668216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTCG TCATCGTCGA ATCGCCGGCG AAGGCCAAGA CCATCAATAA ATATCTCGGC 
AAGGATTACG AAGTTTTTGC CTCGTTCGGT CATGTCCGCG ATTTGCCGCC CAAGGATGGT
TCGGTCGATC CCGACCACGA CTTCGCCATG CTTTGGGATG TCGACACCAA ATCCGCCAAA
CGGCTCGCCG ACATCGCCAA GGCGGTGAAG GAGGCCGACC GGGTGATTCT CGCCACCGAC
CCTGACCGTG AGGGCGAGGC GATTTCCTGG CATGTGCTGG AAGTGCTCAA GGCCAAAAAG
GTCCTGAAGG ATAAGCCGGT CGAGCGTGTG GTGTTCAATG CGATCACCCA ATCGGCGATT
CTCGACGCCA TGCGGCATCC GCGCGCTATC GACATTGATC TCGTCGATGC CTATCTCGCG
CGCCGGGCGC TTGATTATCT CGTCGGCTTC AATCTTTCGC CGGTGCTTTG GCGCAAATTG
CCGGGGGCGC GTTCGGCTGG CCGCGTGCAA TCGGTCGCCT TGCGTCTCGT CTGCGAGCGC
GAATTGGAGA TCGAACGCTT CGTTCCGCGG GAATATTGGT CGCTCACGGC CTTTCTGCGC
ACGCCCGCTG ATCAGCCCTT TTCCGCGAAA CTCGTCGGCG CGGACGGCAA AAAGATCAAC
CGGCTCGATA TTGGCGCGGG CGCCGAAGCC GAAGCCTTCA AGGCGGCGCT CGAAACGGCC
AAATTCACGG TGGCCAAGGT CGAGGCCAAG CCCGCCAGAC GCAATCCCGC GCCTCCTTTC
ACTACATCGA CCTTGCAACA GGAGGCGGCG CGCAAATTGG GGCTTGCACC AGCGCGCACC
ATGCAATTGG CCCAGCGCCT TTACGAAGGC ATCGATCTCG ATGGCGAGAC GGTGGGCCTC
ATCACTTATA TGCGAACCGA TGGCGTTGAT CTGGCACCGG AAGCGATCAC TGGCGCGCGG
AAAGTGATTG CCGCCGAATA TGGCGATAAA TATGTGCCGC AGGCACCGCG CCGCTATCAG
GTCAAGGCCA AGAATGCCCA GGAGGCGCAT GAGGCGATCC GACCGACCGA TCTTGCCCGC
CTGCCGAAAC ATGTCGCTCG TTTTCTCGAT GCGGAGCAGG CGCGGCTTTA TGATTTGATC
TGGACCCGCA CCATTGCGAG CCAAATGGAA TCGGCCGAAC TGGAGCGGAC CACGGTGGAT
ATTCTGGCCG AGGTTGGTGC GCGCCGGCTC GATCTTCGCG CCACTGGGCA GGTGGTGCGT
TTCGACGGAT TCCTGAAACT CTATCAGGAA GGCCGCGACG ACGAAGAGGA TGAAGAGGGT
GGCCGCCTAC CGGCCATGCA GGTCGGCGAC CCCTTGAAAA AGGACAGGAT CGAGGCGAGC
CAGCATTTCA CCGAGCCGCC ACCGCGCTTT ACCGAGGCGA CGCTCGTCAA GCGCATGGAG
GAACTTGGCA TAGGCCGGCC CTCGACCTAT GCCTCGACGC TCGCCGTCCT GAAAGATCGC
GAATATGTGC GGATCGACAA GAAACGGCTG ATTCCCGAGG ACAAGGGACG GCTCGTTACG
GCTTTTCTCG AAAGCTTCTT TGGCCGCTAT GTCGGCTATG ATTTCACCGC CGATCTGGAA
TCAAGCCTCG ACAAGATTTC CAATCACGAG ATCGATTGGA AACAGGTCCT GCGCGATTTC
TGGGCCGATT TTTCTGGCGC CATCGCCGAC ACCAAGGATT TGCGCACCAC ACAAGTCCTC
GATAGCCTCA ATGAAGTGCT CGGTCCCTAT ATTTTCCCCG ATAAGGGGGA TGGCTCCAAT
CCGCGCGCCT GTCCTTCTTG CGCAAATGGC CAATTGTCGC TGAAGCTCGG CAAATTCGGT
TCTTTCATCG GCTGTTCCAA TTATCCGGAG TGCAAATTCA CCCGCACTCT TTCGGATACG
GGGCCGGAGG GAGGCAACGG CGAAACGGAT CGTCCGGGTG TCAAGGTGCT GGGGGTCGAT
GCCGAAACAG GCGAGGAGAT TTCGCTGCGC GATGGGCGCT TTGGTGCCTA TGTGCAGCGC
GGCGAAGGCG AAAAGCCCAA ACGCGCCTCC TTGCCCAAGA CGATCGCGCC GGCCGATCTG
ACACTCGACA TGGCGCTCGG GCTTCTTTCC CTGCCGCGCG AGGTTGCGCG CCATCCCGAA
ACCCATGAGC CGATTCTGGC GGGCATCGGC CGGTTTGGTC CCTATGTCCA GCATGGCAAG
ACCTATGCGA ATATCGGCAA GGACGAGGAT ATTCTGACCC TTGGCGCCAA TCGCGCCATC
GACCTCATCA TTGCCAAAGA AAGCGGGCTC ACCGGTCGCC GTTTCGGCAA AGGCGAATCC
GCGCCTGCCC GTGTTTTGGG TGATCATCCC GAAGGGGGGC AGGTCACGAT CAAGGCCGGG
CGCTTTGGTC CCTATGTCAA TTACGGCAAG CTCAACGCGA CCTTGCCGAA AGACGCCGAC
CCCACCACAT TGACGCTGGA GGAAGGCTTG GCCTTGCTCG CCGCCAAGGC GAGTGGTCAA
GGGGGAGGAA AAGGCGCGGT GCAGGGCCAA CTCCTCGGCG AGCACCCTTC GGGCGGTCCC
ATTACCGTGC GGGAAGGCCG TTTTGGGCCT TATGTCAATC ACGGCAAGGT CAATGCGACC
TTGAAATCCG GTCTCTCGCC GGAAACTTTG ACGCTCGAAG AGGCCATCCG CCTGATTGAC
GAGAAGGCCG GAGCCGCATC CAAAAAGGCC CCTGCGAAGA AGGCGCCCGC AAAGAAAGCC
TCTGGCAAGA CGACGACCAC CGAGAAAGCG CCCGCGAAAA AGGCTGCAAG CAAGGCAGCA
ACGGCCAAAA CCACCAAGGC CAAAGCGGCG AAATCATCGG AGCCGGATGA AGAACCCCCT
TTTTAA
 
Protein sequence
MNVVIVESPA KAKTINKYLG KDYEVFASFG HVRDLPPKDG SVDPDHDFAM LWDVDTKSAK 
RLADIAKAVK EADRVILATD PDREGEAISW HVLEVLKAKK VLKDKPVERV VFNAITQSAI
LDAMRHPRAI DIDLVDAYLA RRALDYLVGF NLSPVLWRKL PGARSAGRVQ SVALRLVCER
ELEIERFVPR EYWSLTAFLR TPADQPFSAK LVGADGKKIN RLDIGAGAEA EAFKAALETA
KFTVAKVEAK PARRNPAPPF TTSTLQQEAA RKLGLAPART MQLAQRLYEG IDLDGETVGL
ITYMRTDGVD LAPEAITGAR KVIAAEYGDK YVPQAPRRYQ VKAKNAQEAH EAIRPTDLAR
LPKHVARFLD AEQARLYDLI WTRTIASQME SAELERTTVD ILAEVGARRL DLRATGQVVR
FDGFLKLYQE GRDDEEDEEG GRLPAMQVGD PLKKDRIEAS QHFTEPPPRF TEATLVKRME
ELGIGRPSTY ASTLAVLKDR EYVRIDKKRL IPEDKGRLVT AFLESFFGRY VGYDFTADLE
SSLDKISNHE IDWKQVLRDF WADFSGAIAD TKDLRTTQVL DSLNEVLGPY IFPDKGDGSN
PRACPSCANG QLSLKLGKFG SFIGCSNYPE CKFTRTLSDT GPEGGNGETD RPGVKVLGVD
AETGEEISLR DGRFGAYVQR GEGEKPKRAS LPKTIAPADL TLDMALGLLS LPREVARHPE
THEPILAGIG RFGPYVQHGK TYANIGKDED ILTLGANRAI DLIIAKESGL TGRRFGKGES
APARVLGDHP EGGQVTIKAG RFGPYVNYGK LNATLPKDAD PTTLTLEEGL ALLAAKASGQ
GGGKGAVQGQ LLGEHPSGGP ITVREGRFGP YVNHGKVNAT LKSGLSPETL TLEEAIRLID
EKAGAASKKA PAKKAPAKKA SGKTTTTEKA PAKKAASKAA TAKTTKAKAA KSSEPDEEPP
F