Gene Bind_1772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1772 
Symbol 
ID6201161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2006406 
End bp2009351 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content56% 
IMG OID641705761 
ProductWD repeat-containing protein 
Protein accessionYP_001832889 
Protein GI182678743 
COG category[S] Function unknown 
COG ID[COG4916] Uncharacterized protein containing a TIR (Toll-Interleukin 1-resistance) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAG ATTCAAAAAG CTCTCGCCGC TTTGCGATCG CCCTCTCGTT TCCAGGGGAA 
CACCGGGAAT ACGTCGAACG AGTTGCCTTG GCATTGCTAC CGGAACTTGG CGGAGAACAA
GGCAAGGCCC GCATCTTCTA TGACGCGTGG CATGAAAGTA AAGTAATCGG ATACAATTCG
AACCGGAAGC TTCAGAAGGT ATACTCAAAA GATTCCGATT TGATTGTTCC ATTTTATTGC
AAAGACTATT TGGAAAAGAA GTGGTGTGGC GTAGAACTGC GCGCCATCGA AGAATTGCTG
TTTGATCAAG AATACGAAAG GGTGCTTCCT TTTCGCTTCG ATATGGTGGA CATCCCTGGA
TCTTTCAAAA CCGACGTATT TCCGATTATT ACGGAGCGTT CGCCGGAAGA CATCGCTCGC
CTGATCCTCG AACGCTACAT GGAACTGCAC GGTGACGAGG TCGTGCAGAC GACTTCGCCC
CATGCCACCC CGGCCCACGC CATCCCCGCT GATATCTCCC GCATCACTGA ATATGCTCCA
ACGCAATTGA TCGGTCGCGA GGCCGAAACT CAGCTCCTCC ACGACGCCTG GGCGAAAGCC
CAAAACAACG AAACGAAGCG CCCCCACCTC CTCACCTTTG TCGCCCTTGG CGGTGAGGGA
AAAACCTCGC TTGTAGCAAA GTGGGCGGCA GAGCTTGCTC ACCTGGATTG GCCAGGGTGC
GACGCGGTGT TTGCATGGTC CTTTTACAGC CAGGGCTCAG AGGGACAGGT TGCGCCCTCC
TCCGAGTTTT TCTTGAAGGA GGCCCTCACC TTTTTCGGTG ACTCTGTGAT GGCGGACAGC
GCGGTGGGCG CGTTCGATAA GGGCCGGCGA CTGGCAGAGC TTGTCGGCGG ACGACAGGCA
CTGCTAATCC TTGATGGTCT GGAACCGCTA CAATACGCGC CGACCTCACC CACACCGGGT
GAGCTTAAGG ACCCTGGCCT CGCCGCGCTG CTTAAAGGGC TCGCCACCAA AAACTACGGC
CTATGCGTGG TCACCACCCG CTATTCAATC CCCAACTTAC GGGCCTTTTG GCAGAACACC
GCCCAAGAAA AGAACATCTT GCGCTTATCC AAGGAAGCGG GCGTGGCGTT GCTTCGGGTA
CTGGGAGTGA AAGGACAGCA ACTAGAGTTC GAGACTCTCG TAGAGGATGT GAAAGGTCAC
GCGCTCACTT TGAACCTGCT GGGCGCTTAT CTTCGTGACG CCCATGCCGG GGATATCCGC
AAGCGAGACC TCATTAATCT GCGAGAAGCC GATGAAGAAC AAGGCGGACA CGCTTTTCGC
GTCATGGATG CCTATGTGCG GTGGTTCGAG AGCGACAGGA GCGAAGGAAA GAAAAGTCAG
CGCGCCCTCG CTCTATTACG GTTGTTGGGG TTGTTCGATC GGCCAGCGGA TGCGGGTTGC
CTCATCGCAT TGTGGAAAGC TCCCCTCATC CCGAATCTTA CTGAGCCGCT CGTAGCGATA
AGCGAGGCGC ATCGCAATCA AGCCTTCACC CGGCTGCAGG ATGCAAAGCT GGTCACAATT
AAACGGGACG GGTCCGGTGT GTTGCTCACG TTGGATGCTC ATCCATTACT ACGTGCATAC
TTTGCCTTAC AACTTCGCAC GTACCACCCC GAGGCGTGGC GCGTCGCCCA TGAGCGGCTT
TATAAGTACC TTTGCACCAC AACGGAGGAC AAACCCGAGC CTACGCTCGA AGATCTGCAG
CCTCTCTATC AGGCCGTTTC TCACGGTTGC CAAGCCGAGA TGCAGCAGGA AGCATGTGAA
ATGTACCGTA ACCGGATCAT TCGCGAATCG GCATCCTACA GTACAAGAAG ACTTGGTGCT
TTAGGTTCTA ACTTGGGAGC TCTGGCCTGC TTCTTCGAGC AACAGTGGAG CCGCGTCTCG
CCCGTGCTCA CAGAAAACGC CAAAGGCTGG CTAATGTCAG AAGCCGCCCA CCATCTGGTA
TTTTTGGGGC GGTTAACCGA GGCCATGGAG CCGGTGCAGG TTGCGCTAGA GGTGTCCATC
GCGAGGCAGG CCTGGAAGAA CGCTGCTATC ACCGTTATCA ACCTCAATCA TCTGGAGCGG
GCACTGGGCA AGGTCGCCAA GGCGGTCAAG ACGGCCGAAC TGTCAGTGGA CTACGCTGAC
CGTAGCACTG ATGCCACCAC TCTAGCGATC GGGTTCTCAG CCTATGCCGA GGCTCTTCAC
CAAGCGGGAC GAGAGGCTGA GGCGGCGGCG CAGTTTCTAA AGGCTGAGGC GATATGGATT
AGGCTCCAAC CCGAGAACCC ACTGCTACAC TCGTCGGCAG GCTTCCACTA TTGTGACCTG
CTCCTTGCCG GTGCCGAACG GGCGGCTTGG CGAGGCATGT TTAATTCATC ATTTGTCCCA
CAACCGTCCT TGTCAGAATC CTGCAGAGAG GTCTCCAAGC ACGCAGCGCA GACGCTGCTG
TGGAGCGAAC AGAACGCCTT CTCTCTCCTT GATATCGCTC ATGACCGCCT CACACTTGGC
CGCACGGCGC TCTACATGAC AATTCTGGAT AGCGATATGC CCCAACAGTC CAATTTTTGC
CGCGACTCGC TCGATCAGGC CATTGAGGGA TTCCGGCGTG CCCAAGTGCA ATACGCCCTT
GTTGAAGGTC TCCTCTCCCG TGCGTGGCTA CGTTTCCTCG CAGGGGCACC GAGCGGTGCC
GAGAGTTCGC AAAGCGACCT CGACGAAGCC TGGGAAATTG TGGAACGCGG GCCGATGCCG
TTGTTTATGG CAGACATTCA TTTGCACCGA GCACGATTGT TTGGATTAAG CAAAGATCGA
CCAGCAACGT ATCCGTGGAC CTCTTTGCAA CACGACCTCA CCGAGGCGCG GCGGTTGATC
GAAAAGCATG GCTATGAGCG GCGCAAGGAA GAACTCGAAG ATGCAGAGGC CGCCGCCCGT
CGGTAA
 
Protein sequence
MNEDSKSSRR FAIALSFPGE HREYVERVAL ALLPELGGEQ GKARIFYDAW HESKVIGYNS 
NRKLQKVYSK DSDLIVPFYC KDYLEKKWCG VELRAIEELL FDQEYERVLP FRFDMVDIPG
SFKTDVFPII TERSPEDIAR LILERYMELH GDEVVQTTSP HATPAHAIPA DISRITEYAP
TQLIGREAET QLLHDAWAKA QNNETKRPHL LTFVALGGEG KTSLVAKWAA ELAHLDWPGC
DAVFAWSFYS QGSEGQVAPS SEFFLKEALT FFGDSVMADS AVGAFDKGRR LAELVGGRQA
LLILDGLEPL QYAPTSPTPG ELKDPGLAAL LKGLATKNYG LCVVTTRYSI PNLRAFWQNT
AQEKNILRLS KEAGVALLRV LGVKGQQLEF ETLVEDVKGH ALTLNLLGAY LRDAHAGDIR
KRDLINLREA DEEQGGHAFR VMDAYVRWFE SDRSEGKKSQ RALALLRLLG LFDRPADAGC
LIALWKAPLI PNLTEPLVAI SEAHRNQAFT RLQDAKLVTI KRDGSGVLLT LDAHPLLRAY
FALQLRTYHP EAWRVAHERL YKYLCTTTED KPEPTLEDLQ PLYQAVSHGC QAEMQQEACE
MYRNRIIRES ASYSTRRLGA LGSNLGALAC FFEQQWSRVS PVLTENAKGW LMSEAAHHLV
FLGRLTEAME PVQVALEVSI ARQAWKNAAI TVINLNHLER ALGKVAKAVK TAELSVDYAD
RSTDATTLAI GFSAYAEALH QAGREAEAAA QFLKAEAIWI RLQPENPLLH SSAGFHYCDL
LLAGAERAAW RGMFNSSFVP QPSLSESCRE VSKHAAQTLL WSEQNAFSLL DIAHDRLTLG
RTALYMTILD SDMPQQSNFC RDSLDQAIEG FRRAQVQYAL VEGLLSRAWL RFLAGAPSGA
ESSQSDLDEA WEIVERGPMP LFMADIHLHR ARLFGLSKDR PATYPWTSLQ HDLTEARRLI
EKHGYERRKE ELEDAEAAAR R