Gene Bind_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3201 
Symbol 
ID6199170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3641131 
End bp3643998 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content58% 
IMG OID641707149 
ProductTPR repeat-containing protein 
Protein accessionYP_001834250 
Protein GI182680104 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0757905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGC TCGCCCGCTC AACCATCCAT CGGCCAATCC CTTCCATGGG CCGCTTGCGC 
GGTGTCGAGC ACAAACTGAT GAATGGCAGC CTGTCCCAAG ACTTTAATCA CACCCCGCCC
CAGAATCGCG AGGAGATTCG CGAGAAATCC CAGTTGCAGG CCGTGGTCGC CGAGGCCTTC
CGCCTGCTGA AGGAAGGAGC ACCGGAACGC GCCATCGCTT ATATCGCGCC TTTCAGCCCT
TTGGCCGCCC GCAGCGAGAT CGGCTGCTAC GTCTTCGGCT TGATCTGTTT CAATGCCGAC
GATCCGCGCG ATGCGCTGAG CTGGTTCGAC CGGGCGCTCA ATCTGAAACC CGCTTATCCG
GAGGCCCTCG GCGCCAAGGC GATCATTCTG CAAAGGCTCG GCCGACCCCA GGAGGCCCTC
GAGGTTTTCG AGGCCGCCTG GATGCTGCGT CCCACGGATG TCGAAATCCT CTTCAGCATC
GGTGTCGTCA GACAAAGCCT CGGCCAGATG AAGGAAGCGC TTGACGCTTA TGAACAGGCC
TTGTGCCTGC GCCCCGATTA TTGCGAGGCT CTGACCAATC GCGGCGCCCT GCTCGAGCGA
TTCGGCCGGT TCGCCGAAGC CCTCGAATGT TTCGAAGAGA TTGCCCGCCA GCGCAATGAT
GACAGCGTCA ATCTCTTCAA TATGGGATCC GTCCTGCAAA AGCTTGGGCG CCTCGAGGAC
GCGCTCGCCG CTTATGAAAA GGCTGCCCGC ATCGGCCCGC CTGACCCCGA GACGGAACTC
AATCGCGGCA ATGTCCTGCA GAAACTCTCT CGTTTCGAAG AAGCGATCGC ATGTTACGAT
CAAGCGCTTC TCTACCGTGC CCATTATCCG CAAGCTTTTT ATAATAAAGG CATAGCGCTT
CAGGGCCTCG GCAAACCACA CGAAGCGCTC GCGGCCTATG ACGCCGCGCT TGGTCTCGAG
CCTTCCTATT GCGAGGCCTG GTGCAATCGC GGCAATATCC TGCACGAGCT CAAACGCCTG
CCCGATGCCC TTCTCTCCTA TCGCGAAGCC CTGAAAGTCC GGCCTCATTT CCTACCGGCT
TTGACCAATC GCGCCAATGT TTTGTTGGAG TTGAACCGCT TCGAAGAAGC CCTGCATTCC
TGCACCGAAG CCTTGAAACA TGATCCCAAT CATGCGCGCG CCTTAGGGAT TTCCGGCGCG
ATCCTTCATA AATTATCGCG GTTCCATGAA GCCCTGGAGG CGCTCGACAA AGCCGCCGCG
CTCAATCCAG CCTCACCCGA AGTCGCGCTC AACCGCGGCA ATGTCTTGCA GGAGCTCGGC
CGTCTGCCCG AAGCGATTGC CGCTTATGAA AAAGCTCTTG CTTTAAAAAA CCCCTATCCC
GAAGCTCTGT CGGGCCTGGG CGTCGCCTTG AAGGAACAGG GCCGTTTCAA CGAAGCGCTG
GCTTGTTTCG ATCAGGCACT AGACCTCAAG CCGGATTTTG CCGATGCCCG CAACAACCGC
GCCGGCCTCC TCTTGCTTTA CGGACGTTTT GAGCAAGGGT TTGCCGATTA CGAGAGTCGC
TGGGACAGAT CGAACGCCCC CCGAAAGATT TTCGAATCGA AGCTGCCATA TTGGGAAGGC
GCGCCGTTGC AGGGCCAGAA ACTCATCGTC TTCGATGAAC AAGGCCTGGG GGACCTCATT
CAATTTGCCC GTTATCTGCC GTGCCTCGTC GATGCGGGGG CCGAGGTAAC TTTCCTTGCC
CGCCGATCCA TGCATCGGCT GCTCTCTTCC CTCCAAGGAC CCATTCGACT GATCGCTTCC
GTCGATCCCG AAGAAGACTT CACCTATCAA ATCCCTCTCA TGGGTCTTCC CCGTGCGTTC
GGGACACGTT TGGAGACGAT CCCTGCAGCC GTGCCCTATC TCAAAGCTGA AGTCGATCGT
ATCACCCAAT GGGCGGAAAG GATCGGAGGG CCATCATTCC GCATCGGCAT CTGCTGGAAG
GGCAACCCGC ATATCAATCT GCGGCGCGGC ATGTCACCGG ATCATTTCGC TCCCCTCGCG
GCCCTGCCGA ATGTGCGGCT GTTCAGCCTG ATGCGCGAAT CTTCTCTCAC TGAAGCAGAG
GGATCTCGCA TCCCGGATTT TATCGAGACA CTCGGCCCGG ATTTCGATGC TGGCGATGAT
GCCTTTCTCG ATTGCGCGGC CGTGATGGAC AATCTCGATC TCATCATCAC CTCGGATACA
TCAATCGCGC ATCTCGCGGG CGCTCTCGCC CGACCGGTTT TTCTTGCCTT GAAACAGATT
CCGGATTGGC GCTGGCTGAT GGAGCGTGAA GACTGCCCCT GGTATCCAAC CATGCGTCTT
TTCCGGCAAA AGCAAAATGG CGAGTGGCGA GAGGTTTTCG ACGCCATGAC GCGCGCCGTC
GCGGAAAAGC TCCGTCAGGA GCCGACGCCT GCCACGGGCC ATGATAGCCT TGGCCCAAGC
TCATCCTCAT TGCGACATCC CCTCGCCATT CCCAGTGGCA TTGGAGAGCT GATCGACAAG
ATCACCATTC TGGAAATCAA GGCAAGCCGG ATCGGCGATA CCGATAAACG CGCTCATGTC
GAACATGAGC TTGCCCTGTT GCGGCAATTG CGAATGGAGA ATGGTTTCGA CACGGTGAGT
CTCGCCCCTC TGGAAACAAA ACTCAAGGCC GCCAATCTCA TATTGTGGGA GGCCGAGGAC
GCTTTACGTC AACATGAGGC GGAAGGGAAT TTTGGGGCGA ATTTCATTCA CCTGGCACGC
CAAGTCTACA AAACCAATGA TCAGCGCGCT GCATTAAAAC GAGAGATCAA TATCATTTTC
AACTCGCCGA TCATCGAGGA GAAATCCTAC AATGATCGGC GAGCGTGA
 
Protein sequence
MSRLARSTIH RPIPSMGRLR GVEHKLMNGS LSQDFNHTPP QNREEIREKS QLQAVVAEAF 
RLLKEGAPER AIAYIAPFSP LAARSEIGCY VFGLICFNAD DPRDALSWFD RALNLKPAYP
EALGAKAIIL QRLGRPQEAL EVFEAAWMLR PTDVEILFSI GVVRQSLGQM KEALDAYEQA
LCLRPDYCEA LTNRGALLER FGRFAEALEC FEEIARQRND DSVNLFNMGS VLQKLGRLED
ALAAYEKAAR IGPPDPETEL NRGNVLQKLS RFEEAIACYD QALLYRAHYP QAFYNKGIAL
QGLGKPHEAL AAYDAALGLE PSYCEAWCNR GNILHELKRL PDALLSYREA LKVRPHFLPA
LTNRANVLLE LNRFEEALHS CTEALKHDPN HARALGISGA ILHKLSRFHE ALEALDKAAA
LNPASPEVAL NRGNVLQELG RLPEAIAAYE KALALKNPYP EALSGLGVAL KEQGRFNEAL
ACFDQALDLK PDFADARNNR AGLLLLYGRF EQGFADYESR WDRSNAPRKI FESKLPYWEG
APLQGQKLIV FDEQGLGDLI QFARYLPCLV DAGAEVTFLA RRSMHRLLSS LQGPIRLIAS
VDPEEDFTYQ IPLMGLPRAF GTRLETIPAA VPYLKAEVDR ITQWAERIGG PSFRIGICWK
GNPHINLRRG MSPDHFAPLA ALPNVRLFSL MRESSLTEAE GSRIPDFIET LGPDFDAGDD
AFLDCAAVMD NLDLIITSDT SIAHLAGALA RPVFLALKQI PDWRWLMERE DCPWYPTMRL
FRQKQNGEWR EVFDAMTRAV AEKLRQEPTP ATGHDSLGPS SSSLRHPLAI PSGIGELIDK
ITILEIKASR IGDTDKRAHV EHELALLRQL RMENGFDTVS LAPLETKLKA ANLILWEAED
ALRQHEAEGN FGANFIHLAR QVYKTNDQRA ALKREINIIF NSPIIEEKSY NDRRA