Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_2194 |
Symbol | |
ID | 6199066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 2508662 |
End bp | 2510482 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641706185 |
Product | extracellular solute-binding protein |
Protein accession | YP_001833303 |
Protein GI | 182679157 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.808425 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTGCT CCTGTTTCAA ACCTGTTTTC CTCCTCCTCT TGCTTTGGCA GACATCCTTT GGATTCGCAT CGCTTGGAAG GGCCGCAGAA GCCGCGCATG CGATCGCCAT GCATGGCGAG CCGGCTCTCT CAGCGGATTT TCCCCATTTT CCCTACGCCA ATCCCAATGC ATCGCAGGGC GGCACTCTGC GTCTTGGTTT TCAAGGGACA TTCGACAGTC TCAATCCATT CAATCTCAAA GCCGGTTCGA CCGCCCAAGG TCTCAACGGC AATATTTTCG AACCCCTCAT GACGCGCTCG CTCGACGAGC CTTTCACGCT CTACGGATTG ATCGCCGAGT CGATCGAGAC CGATGACTCA CGCGAGTTCG TCATTTTTCA TCTCAACCCC AAGGCGCATT TTTCCGACGG CACACCCATC ACGGCAGACG ATGTCCTCTT CAGTTTCAAC CTGTTGAAAA CCCATGGGCG GCCACAGCAT CGCGTCTCCT ATGGCATGGT GAAATCAGCC ACTGCTCCCG ATCCCCTGAC TGTCCGCTAT GATCTCAATT CCGGGCAAGA CCGGGAAATG CCCCTGATGC TCGCCTTGAT GCCCGTTCTG CCCAAACATC TCGTCAATCC AGCGACCTTC GATGAAGCGA CGCTCAATCC GCCGACCGGA TCGGGCCCCT ATATCCTCAC GGAAGTCAAA CCGGGCGAGC GTCTCATCCT GCATCGAGAT CCCCATTATT GGGCGGCCGA TCTGTCGACA CGGCGCGGCC TGTTCAATTT CGAGACGATC ACCATTGACT ATTTTCGCGA CGCCAACAGC CTGTTCGAAG CTTTCCGGGC GGGCCTCATC GATTTCCGCG AGGAGACCAG CCCCGCACGC TGGATGAAGG CTTATGATTT TCCAGCTCTC ACCGAGGGCC GGATCTTCAA GGAAGCTTTG CCCATCGGCG GCCCCAAGGG CATGGAGGGA TTCGTCTTCA ATCTGCGGCG CCCCCTCTTC ACGGATATCA AGGTTCGCGA GGCGCTCGCC TCCCTCTTCG ATTTCGAATG GATCAACACC AATCTTTATG GTGGCCTGTA CCGCCGCACA CAGAGTTTCT TCGACGAATC GGAATTAGCC TCCACCGGCC GCCCGGCGAG CGAGGCCGAA CGGCGCCTGC TCGCGCCGTT TCCCGGCGCG GTGCGTGAGG ATATTTTGGA AGGGCGCTGG CATCCACCGC AGACCGACGG CTCAGGGCAG GATCGCACCC AACCTCGCCA TGCTCTCGGG CTCCTGCACG AGGCAGGTTA TGATTTGAAG GACGGCCTCC TCTCCAAGGA GGGTAAGCCG CTCTCCTTCG AGATCATGGT CACGGATCGT AATCGAGAGA GGCTGGCGCT TGATTATGCC CGTTCGCTGA CCCGGATCGG CGTCGATGCC CATGTCCGCC TGGTCGATGA AGTTCAGTAT CAGCGGCGGC GCCAAAAATT CGATTTCGAC ATGATGATCG GCAGTTGGAT AGCCTCCGCC TCGCCCGGCA ATGAGCAGCG GTCACGCTGG GGCTCGAAAA GCGCCGATCA GGAAGCCTCG TTCAATCTTG CCGGTGTCAA ATCACCCGCC GTGGATGCGA TGATCAATCA TCTCCTCGCC GCACGAACGC ATGACGATTT CGTCACAGCC GTACGGGCCT ATGATCGTGT TCTGCTTTCA GGCTTTTACG TCGTGCCGCT GTTTCATTCG CCGACACAAT GGATCGCCGG AACGACACGG CTCGGCCGGC CCGATGTCCT GCCCCGCTAT GGCGCGCCGA GCGGCAGCGC GACCTTGGAA ACCTGGTGGA TGCGGCCGTA A
|
Protein sequence | MPCSCFKPVF LLLLLWQTSF GFASLGRAAE AAHAIAMHGE PALSADFPHF PYANPNASQG GTLRLGFQGT FDSLNPFNLK AGSTAQGLNG NIFEPLMTRS LDEPFTLYGL IAESIETDDS REFVIFHLNP KAHFSDGTPI TADDVLFSFN LLKTHGRPQH RVSYGMVKSA TAPDPLTVRY DLNSGQDREM PLMLALMPVL PKHLVNPATF DEATLNPPTG SGPYILTEVK PGERLILHRD PHYWAADLST RRGLFNFETI TIDYFRDANS LFEAFRAGLI DFREETSPAR WMKAYDFPAL TEGRIFKEAL PIGGPKGMEG FVFNLRRPLF TDIKVREALA SLFDFEWINT NLYGGLYRRT QSFFDESELA STGRPASEAE RRLLAPFPGA VREDILEGRW HPPQTDGSGQ DRTQPRHALG LLHEAGYDLK DGLLSKEGKP LSFEIMVTDR NRERLALDYA RSLTRIGVDA HVRLVDEVQY QRRRQKFDFD MMIGSWIASA SPGNEQRSRW GSKSADQEAS FNLAGVKSPA VDAMINHLLA ARTHDDFVTA VRAYDRVLLS GFYVVPLFHS PTQWIAGTTR LGRPDVLPRY GAPSGSATLE TWWMRP
|
| |