Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bpro_3529 |
Symbol | |
ID | 4013792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas sp. JS666 |
Kingdom | Bacteria |
Replicon accession | NC_007948 |
Strand | - |
Start bp | 3731459 |
End bp | 3732835 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637943191 |
Product | extracellular solute-binding protein |
Protein accession | YP_550335 |
Protein GI | 91789383 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00601615 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.181904 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTCA AACTACTTGC CACAGCGGCG TTCATGACCA CGGCGCTTTT AGCCGGCGGC CATGCAGCAG CCCAGGAGAA ACTCACCGTC TGGTGGGTCA AGGGCTTCTA CAAGGCCGAA GACGATGCCC TGTTTGCGGC CATCAAGAAG TACGAAGACA AGCACAAGGG TGTCAAGATC GAGCTTTCGC AATACCCGAT TCAGGACATG ATTCCGAAGA CCGTGGCGGC GCTCGACTCG GGCAACCCGC CCGACGTGGC TTATGCGGAC GTCTATGACT TCCAGGTCAC GGGCAAGTGG GCTTTCGACG GCAAGCTCGA AGACATCAGC AGCGTGATCA ATCCCATGCG CGCCAAGTTC GCGCCCAACA CGGTCGAGAC CGCCTTCCTG TACAACGACC AGGCCAAGAC GCGGGCGTAC TACGCCTTCC CGATGAAGCA GCAGACCATG CACATCCAGT ACTGGGCTGA CATGCTGGCG GAGGCCGGGT TCAAGGAGTC GGACATTCCG ACGGCCTGGA AGGACTACTG GTCCTTCTGG TGCGACAAGG TGCAGCCGGC CTATCGTCAG AAAAGCGGCA ACCGCACTTT CGGCATCGGC CAGCCGCTGG GCGTGGATTC CAGCGACTCG TTCTACTCCT TCCTCACGTT CATGGACGCC TATAACGTGA GCCTGGTCAG CGACAGCGGC AAGCTGCTGG TGGATGATCC CAAGGTGCGC GCCGGCCTCA TCGGCGCCCT GACCGATTAC ACCCAGATTT ACGCCAGGGG CTGCACGCCA CCGTCGTCGA CCAGCTGGAA AGACCCGGAC AACAACGTCG CCTTCCACAA CAAAACGACC GTCCTCACCC ACAATGCGAC CATCTCGATC GCGGCCAAAT GGCTCGATGA CATGAACAAC GCCGCCTTGA AGCCCGAGGA TCGGGAGATC GCGAAGAAGA ACTACACGGA GCGCATCCGC ACCGCCGGTT TCCCGAGCAA GCCCGATGGC AGCAAGATGG TCTATCGTGC CGCGATCAAG ACCGGCGTGG TTTTCAGCCA GGCCAAGAAC AAGGCCCGCG CCAAGGAGTT TGTGGCCTTC CTGCTGCAGG ATGAAAACCT CACGCCGTAC GTCGAGGGCT CGCTGGGCCG CTGGTTCCCG GTGATGAAGG CGGCGCAGCA GCGCCCGTTC TGGAAAGCCG ATCCGCATCG CACAGCCGTG TACAACCAGT TCACGGCCGG CACGGTGAAT TTCGAGTTCA CCAAGAACTA CAAGTTCACC GTTCTCAACA ACGAGAACGT CTGGGCCAAG GCCGCGAACC GGGTTCTCAA CGAGAAAGTG CCGGTTGACA AGGCGGTCGA CGAAATGATC GCCCGCATCA AGACCGTCGC GAACTAA
|
Protein sequence | MKFKLLATAA FMTTALLAGG HAAAQEKLTV WWVKGFYKAE DDALFAAIKK YEDKHKGVKI ELSQYPIQDM IPKTVAALDS GNPPDVAYAD VYDFQVTGKW AFDGKLEDIS SVINPMRAKF APNTVETAFL YNDQAKTRAY YAFPMKQQTM HIQYWADMLA EAGFKESDIP TAWKDYWSFW CDKVQPAYRQ KSGNRTFGIG QPLGVDSSDS FYSFLTFMDA YNVSLVSDSG KLLVDDPKVR AGLIGALTDY TQIYARGCTP PSSTSWKDPD NNVAFHNKTT VLTHNATISI AAKWLDDMNN AALKPEDREI AKKNYTERIR TAGFPSKPDG SKMVYRAAIK TGVVFSQAKN KARAKEFVAF LLQDENLTPY VEGSLGRWFP VMKAAQQRPF WKADPHRTAV YNQFTAGTVN FEFTKNYKFT VLNNENVWAK AANRVLNEKV PVDKAVDEMI ARIKTVAN
|
| |