Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_3358 |
Symbol | |
ID | 6201627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 3810316 |
End bp | 3812226 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641707304 |
Product | extracellular solute-binding protein |
Protein accession | YP_001834404 |
Protein GI | 182680258 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0726814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.160875 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACTGGAC GATTCCCGCT CTTCTCGACT CGCCGTCAAG CATTGGGGCT CGCCCTCGGC GGCCTTGGCG GTCTCGCCAT GCCGCCCGCT TTTTTCTTTG CAAAGCGCGC GAAGGCTCAG GAACAAAAGC CAACGGAAAA GCCCGTTGAA GGCGAAGCTT ATGAGGCGCA CGGCCTCTCG ATCTTCGGCG ATCTCGCCTT ACCGCCTGAT TTCAAGCATC TGCCTTACGT GAATCCGGAT GCGCCCAAGG GCGGCATCTT CATCGAACAG GCCGGCTTCA ATACGTTCAA CACGCTGAAT GTCTTCATCC TCAAGGGGGA TGGCGCGGCT GGCATGGGGC TGATCTTCGA CAGCCTGATG ACGGGAAGCA ATGATGAACC CGATGCGCTT TATGGCCTCG TCGCTCACAA GGTAGAAGTT TCAGCCGACC GCACCCTCTA TCGATTTTTT CTCCGCAAGG AAGCGCGTTT CCACGACGGC TCGAAGATCA AGGCGTCGGA CGTGGTGTTT TCCTTGAATC TGATCAAAAC CAAGGGCCAT CCAGTCTTGC GCCAGGGCCT GCGCGATCTC GAAAGCGCCG AAGCCGAGGC CGAGGATATC GTACGCGTCC GCATGCGCCC GGGCCATAGC CGCGAATCTC CGCTGACCGT CGCCAGTCAG CCGATTTTTT CGCAGGCCTA TTACACGACC CATAATTTTG AGGAGACCAC GCTTGAACCG CCCCTTGGTT CCGCGGCCTA CAAGGTCGGC CCGTTCGAGC AAGGGCGCTA TATCTCTTTC ACGCGGGTCG AGAATTATTG GGGCAAGGAC CTCCCGATCA ATCGCGGGCG CAGCAATTTC GATGTGGTGC GCTACGAATA TTTCAACGAT CGCAAAGTGG CTTTCGAGGC GTTCAAAGCC GGCGTCTTCA CCTATCGCGA GGAATATACC TCACTCATCT GGGCGACAGG CTATGATTTC GCCGCGCTCA AGGAAGGCAA GGTCAAACGC GAAACTGTTC CAGACGCCTA TCCGCGCGGT ACGCAAGGCT GGTTCCTGAA TACGCGGAGA ACCAAATTCA AGGATGCGCG CATCCGCCGG GCGCTCGGCT ATGCCTTCGA TTTCGAATGG ACCGACGCCA ATATCATGTA CAATCTCTAC AAGCGTACGG TTTCCTATTT CCAGAACTCG CCCATGGCCG CCGAAGGCCT GCCCTCGGCC GAGGAACGCG CTCTGCTCAC GCCTTTCGAA GATCAATTGC CTCCCGAGGT TTTTGGCGAG GCTGTCGTGC CCCCCGTCTC AGACGGCTCC GGCCAGGATC GCAAGCTCCT GCGCGAGGCC AGTGAGCAAT TGCGGCAGGC CGGCTGCACC CGCAAGGGCA ATACACTCCT GCTCCCCGAC GGCAAGCCCT TCGAAATCGA ATTTCTTGGC TTTGAAACCT CGTTCCAGCC CCATACGGCG GCTTTCATCA AAAATCTCAA ATTGCTCGGG ATCGATGCCG ATTATCGTGT CGTCGATGCG GCCCAATATA AACGCCGGGT CGATGAGTTC GACTATGATA TTGTCACGGA ACGATTTAGT TTCGGCCTGA CGCCAGGCGA GGATATGCGG CTCATTTTCG GTTCCGAAAC GGCGAATCTT TCCGGCTCCC GCAATGTGGC GGGCATTGCC TTGCCGAGCG TCGATGCCCT CATCGCAAAA GCTCTGGTCG TCGACACGCG CGAGAACCTG ACCACGATCT GCCGGGCGAT CGACCGCATC TTGCGCGCCC ATTATTTCTG GGTGCCGATG TGGAACAATC CTAACCATCT GCTGGCCTTC TGGGATCTGT TCGGCCGTCC GGAGCGCATG CCCCATTATG ATGTCGGCGT GCCCTCGACC TGGTGGTTCG ATCCTCAAAA GGCCCAGCGC ATCGGTTGGC GGAAACCCTA G
|
Protein sequence | MTGRFPLFST RRQALGLALG GLGGLAMPPA FFFAKRAKAQ EQKPTEKPVE GEAYEAHGLS IFGDLALPPD FKHLPYVNPD APKGGIFIEQ AGFNTFNTLN VFILKGDGAA GMGLIFDSLM TGSNDEPDAL YGLVAHKVEV SADRTLYRFF LRKEARFHDG SKIKASDVVF SLNLIKTKGH PVLRQGLRDL ESAEAEAEDI VRVRMRPGHS RESPLTVASQ PIFSQAYYTT HNFEETTLEP PLGSAAYKVG PFEQGRYISF TRVENYWGKD LPINRGRSNF DVVRYEYFND RKVAFEAFKA GVFTYREEYT SLIWATGYDF AALKEGKVKR ETVPDAYPRG TQGWFLNTRR TKFKDARIRR ALGYAFDFEW TDANIMYNLY KRTVSYFQNS PMAAEGLPSA EERALLTPFE DQLPPEVFGE AVVPPVSDGS GQDRKLLREA SEQLRQAGCT RKGNTLLLPD GKPFEIEFLG FETSFQPHTA AFIKNLKLLG IDADYRVVDA AQYKRRVDEF DYDIVTERFS FGLTPGEDMR LIFGSETANL SGSRNVAGIA LPSVDALIAK ALVVDTRENL TTICRAIDRI LRAHYFWVPM WNNPNHLLAF WDLFGRPERM PHYDVGVPST WWFDPQKAQR IGWRKP
|
| |