Gene Bind_2674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2674 
Symbol 
ID6200265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3042685 
End bp3045768 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content53% 
IMG OID641706621 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001833735 
Protein GI182679589 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAA AGCCCCATTC TTCCATTGGC CAGACCGAGA AAAAAACCCA GGCGCGCGTC 
GTCGCTCTTT TCCGCGACCA GCTCGGCTAT GATTATCTTG GCAACTGGAT CGATCGCGAA
AACAACCGCA ATATCGAGGA AGCCTATCTC CGGCCCTTTC TCAAAACGCG CGGATATGAC
GAGGCGCTGA TCAGCCGCGC CCTTTATGAA CTCAATAAAG CCGCGACCGA TCAATCCCGC
AGCCTCTACG ATGTCAATCG TTCCGTCTAT GATCTCTTGC GCTATGGCGT GAAGGTAAAG
GCCGATGTGG GCGAGACGAC GCAAACCGTC TGGCTGATCG ATTGGAAGAA TTTTGCCGCC
AACCATTTCG CTATCGCCGA GGAAGTGGCA GTCAAGCCCG CGAGCGCTTC CGACAAAAGC
ACAGCGAAAG CCTGGGGGAA ACGCCCCGAT GTGGTGATTT ATGTCAACGG CATCGCGCTC
GCCGTGCTGG AACTCAAGCG CTCGACGGTT GCGATCTCAC AAGGCATTCG CCAGAATCTC
GACAACCAGA AGAAGGTTTT TATCCAACAC TTCTTCACCA CCATGCAGCT GATCATGGCG
GGCAGTGACA GCGAGGGTCT GCGTTATGGC GTGATCGAAA CGCCCGAGAA ATATTATCTC
GCTTGGAAGG AAGAAAGCCC GGTCCAAAAT CCTCTCGATC GCGCGCTGCT TCAACTCTGT
GAAAAATCGC GTTTCCTGGA ACTGATTCAC GATTTCATTG TCTTCGACGC CGGCACGAAA
AAGACCTGCC GTCAGAACCA ATATTTTGGC GTGCATGCGG CGCAAGACCA TGTTCGCCGC
CGTGAAGGTG GGATCATCTG GCACACGCAA GGCTCAGGCA AAAGCCTGAC CATGGTCTGG
CTCGCGAAAT GGATACGCGA GCATGTGAAG GATGCGCGCG TACTCATCAT CACCGACCGG
ACCGAACTCG ACGAGCAGAT CGAGAAAGTG TTCAAGGGCA TCAATGAGGA GATCGTCCGT
GCCAAAAGTG GCGCCGATCT CGTCGCCAAA CTAAACGACA CGAATCCCTG GCTTCTATGC
TCGCTGATCC ACAAATTCCG TGGCAGGGAA GATGAAGAGG GTGATGTCCC CGGCTATATC
GAGGAAGTCC GCGAGGCGCT GCCGAAGGAT TTCCGCGCCA AGGGCGATAT TTTTGTCTTC
GTGGACGAAT GCCATCGCAC GCAAAGCGGC GCCTTGCACG ACGCTATGAA GGCTATTCTA
CCGAGCGCCA TGTTCATAGG TTTCACCGGC ACGCCACTTT TGAAGGCGGA TAAGAAAACC
AGTCTCGAAA TCTTTGGCCC CTATATCCAT ACCTATAAAT TCGATGAAGC GGTGAGCGAT
GGCGTCGTGC TCGATCTGCG TTATGAGGCG CGCGACATCG ACCAGAACAT CACCTCGCCC
ACCAAGATCG ACCAATGGTT CGCGGCCAAG ACCAAGGGAT TGACCGATCT CGCCAGGGCG
CAGTTGAAAC AGCGCTGGGG CACTTTGAAG AAAGTGCTCT CGAGTCAATC GCGTTTGGAA
AAAATTGTTG AAGACATCCT TCTCGATTTT GCGCTCAAGG ATCGGCTGGC GGGCGGACGC
GGCAACGCGA TGCTCGTTTC ATCCAGCATT TATCAAGCGT GTAAATTCTA TGAATTGTTT
GACAAGACCG AGCTCTCCAG TAAATGCGCG ATCATTACCT CCTATAAGCC GTCACCCAAT
GATCTGAAAG GAGAGGACAG CGGCGAAGGT CTGACGGAGC GGTTGCGGCA ATATGAAATC
TACAACAAAA TGCTTGAGGG CCGAGACCCG GAGACCTTCG AGAAGGAGGT GAAGAAAAAA
TTCATCGATG AACCAGGACA GATGAAGCTT CTAATCGTGG TGGACAAGCT TCTGACCGGC
TTCGATGCAC CTTCAGCGAC CTATCTCTAT ATCGACAAGC ACATGCAGGA TCATGGCCTG
TTCCAGGCCA TTTGCCGCGT GAACCGTCTT GATGGGGATG ATAAGGAATA TGGCTACATC
ATAGACTACA AGGATCTTTT CAAAAGCCTC GAAAGCTCGA TCAAGGATTA TACTGCCGAG
GCTTTTGAAG GCTATGATAA AGCGGATGTC GCCGGCCTTT TATCCGACCG GCTTGAAAAG
GCGCGCGAAA GGCTGGATGA AGCCCTCGAA GCGGTGCGCG CACTCTGTGA ACCTGTTGAA
GCGCCTCGTG ACACAGCGGC TTTCCTTCAA TTCTTCTGCG CGAAAGACAC AGCCGATAAA
ACCGCACTGA AGGAAAACGA ACCGAAACGG CTCACCCTCT ACAAGCTGGT CGCCTCTCTG
CTGCGCGCCT ATGCCGATAT CGCGAACGAA ATGCCAGAAG CGGGCTATAG CGATGCTGAA
ATCGCGGCGA TCAAGGCCGA AGTCGATCAT TTCACGCACG TCCGCGATGA AGTAAAGCTC
GCAAGCGGCG ATTATATCGA TCTCAAAATG TATGAGCCGG CGATGCGGCA CCTGATCGAC
ACCTATATCC GGGCAGACGA AAGCAGGCTG ATTTCCGCCT TCGACGATAT GTCGTTGGTG
CAGATGATCG TCGAGCGCGG AGCGGATGCG GTGGATGCCT TGCCGAAGGG CATTCGCGAA
AACAAGGAAG CCGTCGCCGA AACAATTGAA AACAATGTGC GTAAGCTTAT CATCGAAGAA
ACGCCGATCA ATCCGAAATA TTATGAAACT ATGTCGGATC TTCTGGATGC GCTGATCGTT
CAGCGCAAAC AGCAGGCGAT TGCCTATGAG CAATATCTCG CGGAAATCGT CGCGCTCACG
AAGAAAGCGA AGAATTCAGC CACTGGAGCG ACCTATCCCA CAACGATGAA CACCGCCGCG
AAACGCGCCC TCTATGATAA TCTCGGCAAG GATGAAGCCT TGGCGATAGC GATTGATGCA
GACATCCGTA AGAAAAAGCA AGATGATTGG CGTGGGCATA GGGGTAAGGA GCAAATGGTG
CGCAACATTA TCCGAACACA TCTCACCGAT CCGGCGCTTG TCGATCAGAT CTTCGATCTG
GTGAGAAGCC AACATGAATA TTGA
 
Protein sequence
MSQKPHSSIG QTEKKTQARV VALFRDQLGY DYLGNWIDRE NNRNIEEAYL RPFLKTRGYD 
EALISRALYE LNKAATDQSR SLYDVNRSVY DLLRYGVKVK ADVGETTQTV WLIDWKNFAA
NHFAIAEEVA VKPASASDKS TAKAWGKRPD VVIYVNGIAL AVLELKRSTV AISQGIRQNL
DNQKKVFIQH FFTTMQLIMA GSDSEGLRYG VIETPEKYYL AWKEESPVQN PLDRALLQLC
EKSRFLELIH DFIVFDAGTK KTCRQNQYFG VHAAQDHVRR REGGIIWHTQ GSGKSLTMVW
LAKWIREHVK DARVLIITDR TELDEQIEKV FKGINEEIVR AKSGADLVAK LNDTNPWLLC
SLIHKFRGRE DEEGDVPGYI EEVREALPKD FRAKGDIFVF VDECHRTQSG ALHDAMKAIL
PSAMFIGFTG TPLLKADKKT SLEIFGPYIH TYKFDEAVSD GVVLDLRYEA RDIDQNITSP
TKIDQWFAAK TKGLTDLARA QLKQRWGTLK KVLSSQSRLE KIVEDILLDF ALKDRLAGGR
GNAMLVSSSI YQACKFYELF DKTELSSKCA IITSYKPSPN DLKGEDSGEG LTERLRQYEI
YNKMLEGRDP ETFEKEVKKK FIDEPGQMKL LIVVDKLLTG FDAPSATYLY IDKHMQDHGL
FQAICRVNRL DGDDKEYGYI IDYKDLFKSL ESSIKDYTAE AFEGYDKADV AGLLSDRLEK
ARERLDEALE AVRALCEPVE APRDTAAFLQ FFCAKDTADK TALKENEPKR LTLYKLVASL
LRAYADIANE MPEAGYSDAE IAAIKAEVDH FTHVRDEVKL ASGDYIDLKM YEPAMRHLID
TYIRADESRL ISAFDDMSLV QMIVERGADA VDALPKGIRE NKEAVAETIE NNVRKLIIEE
TPINPKYYET MSDLLDALIV QRKQQAIAYE QYLAEIVALT KKAKNSATGA TYPTTMNTAA
KRALYDNLGK DEALAIAIDA DIRKKKQDDW RGHRGKEQMV RNIIRTHLTD PALVDQIFDL
VRSQHEY