Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_3201 |
Symbol | |
ID | 6199170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 3641131 |
End bp | 3643998 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641707149 |
Product | TPR repeat-containing protein |
Protein accession | YP_001834250 |
Protein GI | 182680104 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0757905 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGGC TCGCCCGCTC AACCATCCAT CGGCCAATCC CTTCCATGGG CCGCTTGCGC GGTGTCGAGC ACAAACTGAT GAATGGCAGC CTGTCCCAAG ACTTTAATCA CACCCCGCCC CAGAATCGCG AGGAGATTCG CGAGAAATCC CAGTTGCAGG CCGTGGTCGC CGAGGCCTTC CGCCTGCTGA AGGAAGGAGC ACCGGAACGC GCCATCGCTT ATATCGCGCC TTTCAGCCCT TTGGCCGCCC GCAGCGAGAT CGGCTGCTAC GTCTTCGGCT TGATCTGTTT CAATGCCGAC GATCCGCGCG ATGCGCTGAG CTGGTTCGAC CGGGCGCTCA ATCTGAAACC CGCTTATCCG GAGGCCCTCG GCGCCAAGGC GATCATTCTG CAAAGGCTCG GCCGACCCCA GGAGGCCCTC GAGGTTTTCG AGGCCGCCTG GATGCTGCGT CCCACGGATG TCGAAATCCT CTTCAGCATC GGTGTCGTCA GACAAAGCCT CGGCCAGATG AAGGAAGCGC TTGACGCTTA TGAACAGGCC TTGTGCCTGC GCCCCGATTA TTGCGAGGCT CTGACCAATC GCGGCGCCCT GCTCGAGCGA TTCGGCCGGT TCGCCGAAGC CCTCGAATGT TTCGAAGAGA TTGCCCGCCA GCGCAATGAT GACAGCGTCA ATCTCTTCAA TATGGGATCC GTCCTGCAAA AGCTTGGGCG CCTCGAGGAC GCGCTCGCCG CTTATGAAAA GGCTGCCCGC ATCGGCCCGC CTGACCCCGA GACGGAACTC AATCGCGGCA ATGTCCTGCA GAAACTCTCT CGTTTCGAAG AAGCGATCGC ATGTTACGAT CAAGCGCTTC TCTACCGTGC CCATTATCCG CAAGCTTTTT ATAATAAAGG CATAGCGCTT CAGGGCCTCG GCAAACCACA CGAAGCGCTC GCGGCCTATG ACGCCGCGCT TGGTCTCGAG CCTTCCTATT GCGAGGCCTG GTGCAATCGC GGCAATATCC TGCACGAGCT CAAACGCCTG CCCGATGCCC TTCTCTCCTA TCGCGAAGCC CTGAAAGTCC GGCCTCATTT CCTACCGGCT TTGACCAATC GCGCCAATGT TTTGTTGGAG TTGAACCGCT TCGAAGAAGC CCTGCATTCC TGCACCGAAG CCTTGAAACA TGATCCCAAT CATGCGCGCG CCTTAGGGAT TTCCGGCGCG ATCCTTCATA AATTATCGCG GTTCCATGAA GCCCTGGAGG CGCTCGACAA AGCCGCCGCG CTCAATCCAG CCTCACCCGA AGTCGCGCTC AACCGCGGCA ATGTCTTGCA GGAGCTCGGC CGTCTGCCCG AAGCGATTGC CGCTTATGAA AAAGCTCTTG CTTTAAAAAA CCCCTATCCC GAAGCTCTGT CGGGCCTGGG CGTCGCCTTG AAGGAACAGG GCCGTTTCAA CGAAGCGCTG GCTTGTTTCG ATCAGGCACT AGACCTCAAG CCGGATTTTG CCGATGCCCG CAACAACCGC GCCGGCCTCC TCTTGCTTTA CGGACGTTTT GAGCAAGGGT TTGCCGATTA CGAGAGTCGC TGGGACAGAT CGAACGCCCC CCGAAAGATT TTCGAATCGA AGCTGCCATA TTGGGAAGGC GCGCCGTTGC AGGGCCAGAA ACTCATCGTC TTCGATGAAC AAGGCCTGGG GGACCTCATT CAATTTGCCC GTTATCTGCC GTGCCTCGTC GATGCGGGGG CCGAGGTAAC TTTCCTTGCC CGCCGATCCA TGCATCGGCT GCTCTCTTCC CTCCAAGGAC CCATTCGACT GATCGCTTCC GTCGATCCCG AAGAAGACTT CACCTATCAA ATCCCTCTCA TGGGTCTTCC CCGTGCGTTC GGGACACGTT TGGAGACGAT CCCTGCAGCC GTGCCCTATC TCAAAGCTGA AGTCGATCGT ATCACCCAAT GGGCGGAAAG GATCGGAGGG CCATCATTCC GCATCGGCAT CTGCTGGAAG GGCAACCCGC ATATCAATCT GCGGCGCGGC ATGTCACCGG ATCATTTCGC TCCCCTCGCG GCCCTGCCGA ATGTGCGGCT GTTCAGCCTG ATGCGCGAAT CTTCTCTCAC TGAAGCAGAG GGATCTCGCA TCCCGGATTT TATCGAGACA CTCGGCCCGG ATTTCGATGC TGGCGATGAT GCCTTTCTCG ATTGCGCGGC CGTGATGGAC AATCTCGATC TCATCATCAC CTCGGATACA TCAATCGCGC ATCTCGCGGG CGCTCTCGCC CGACCGGTTT TTCTTGCCTT GAAACAGATT CCGGATTGGC GCTGGCTGAT GGAGCGTGAA GACTGCCCCT GGTATCCAAC CATGCGTCTT TTCCGGCAAA AGCAAAATGG CGAGTGGCGA GAGGTTTTCG ACGCCATGAC GCGCGCCGTC GCGGAAAAGC TCCGTCAGGA GCCGACGCCT GCCACGGGCC ATGATAGCCT TGGCCCAAGC TCATCCTCAT TGCGACATCC CCTCGCCATT CCCAGTGGCA TTGGAGAGCT GATCGACAAG ATCACCATTC TGGAAATCAA GGCAAGCCGG ATCGGCGATA CCGATAAACG CGCTCATGTC GAACATGAGC TTGCCCTGTT GCGGCAATTG CGAATGGAGA ATGGTTTCGA CACGGTGAGT CTCGCCCCTC TGGAAACAAA ACTCAAGGCC GCCAATCTCA TATTGTGGGA GGCCGAGGAC GCTTTACGTC AACATGAGGC GGAAGGGAAT TTTGGGGCGA ATTTCATTCA CCTGGCACGC CAAGTCTACA AAACCAATGA TCAGCGCGCT GCATTAAAAC GAGAGATCAA TATCATTTTC AACTCGCCGA TCATCGAGGA GAAATCCTAC AATGATCGGC GAGCGTGA
|
Protein sequence | MSRLARSTIH RPIPSMGRLR GVEHKLMNGS LSQDFNHTPP QNREEIREKS QLQAVVAEAF RLLKEGAPER AIAYIAPFSP LAARSEIGCY VFGLICFNAD DPRDALSWFD RALNLKPAYP EALGAKAIIL QRLGRPQEAL EVFEAAWMLR PTDVEILFSI GVVRQSLGQM KEALDAYEQA LCLRPDYCEA LTNRGALLER FGRFAEALEC FEEIARQRND DSVNLFNMGS VLQKLGRLED ALAAYEKAAR IGPPDPETEL NRGNVLQKLS RFEEAIACYD QALLYRAHYP QAFYNKGIAL QGLGKPHEAL AAYDAALGLE PSYCEAWCNR GNILHELKRL PDALLSYREA LKVRPHFLPA LTNRANVLLE LNRFEEALHS CTEALKHDPN HARALGISGA ILHKLSRFHE ALEALDKAAA LNPASPEVAL NRGNVLQELG RLPEAIAAYE KALALKNPYP EALSGLGVAL KEQGRFNEAL ACFDQALDLK PDFADARNNR AGLLLLYGRF EQGFADYESR WDRSNAPRKI FESKLPYWEG APLQGQKLIV FDEQGLGDLI QFARYLPCLV DAGAEVTFLA RRSMHRLLSS LQGPIRLIAS VDPEEDFTYQ IPLMGLPRAF GTRLETIPAA VPYLKAEVDR ITQWAERIGG PSFRIGICWK GNPHINLRRG MSPDHFAPLA ALPNVRLFSL MRESSLTEAE GSRIPDFIET LGPDFDAGDD AFLDCAAVMD NLDLIITSDT SIAHLAGALA RPVFLALKQI PDWRWLMERE DCPWYPTMRL FRQKQNGEWR EVFDAMTRAV AEKLRQEPTP ATGHDSLGPS SSSLRHPLAI PSGIGELIDK ITILEIKASR IGDTDKRAHV EHELALLRQL RMENGFDTVS LAPLETKLKA ANLILWEAED ALRQHEAEGN FGANFIHLAR QVYKTNDQRA ALKREINIIF NSPIIEEKSY NDRRA
|
| |