Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bpro_0292 |
Symbol | |
ID | 4011663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas sp. JS666 |
Kingdom | Bacteria |
Replicon accession | NC_007948 |
Strand | - |
Start bp | 306642 |
End bp | 307997 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637939977 |
Product | ABC transporter nitrate-binding protein |
Protein accession | YP_547155 |
Protein GI | 91786203 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAATC GCTTCGATGC CTATGATGCC GACCGACCAC TGATGCTGCG CTGCGCCTGC GGACAGGACC ATGCACCGGG CGAACATGAC GCGGCAGCGG CCAGCGCGCA GGAGCTGTCG CACAGCTTCA TGGAGGCCAG CCTGGTCAAG GCGCTGTTCC CGGTCGACAG CGTACGTCGC AGCTTCCTGC GGGCGGTGGG CGCCAACACC GCGCGCGCGG CGATTGCCTC CGTCTTTCCC ATGGGTGCGC TGCAGGCGAT GGCGCAGGAC CGCGGCCCGC TGGAAAAGAA AGACCTCAAG ATCGGCTTCA TCGCCATCAC CTGCGCCAGC CCGCTGATCA TGGCCGACCC GCTGGGCTTC TATAAAAAAG AAGGCCTCCA CGTCCAGCTC AACAAGACCG CCGGCTGGGC GTTGATTCGC GACAAGATGA TCAACAAGGA GCATGACGCG TCGCACTTCC TGTCGCCCAT GCCGCTGGCC ATGACCATGG GCCTCGGGTC CAACCAGGTC AACATGAATG TAGCGAGCAT CCAGAACACC AACGGCCAGG CCATCACCCT GCACGTCAAG CACAAGAACA ACCGCGACCC GAAGAACTGG AAAGGCTTCA AGTTCGCGAT TCCGTTCGAG TACAGCATGC ACAACTTTTT GCTGCGCTAT TACCTGGCCG AGAACGGGCT GAACCCCGAC ACCGACGTGC AGCTGCGCGT GGTGCCGCCG CCGGAGATGG TGGCCAACCT GCGCGCCGGC AACATCGACG GCTTCCTCGG CCCCGACCCT TTCAACCAGC GCGCGGTCTA CGACGAAGTG GGCTTCATCC ACATCCTGTC CAAGGACATC TGGGACGGCC ACCCCTGCTG CGCCTTTGGC GTGAGCGACG AGTTCATCCA GAAAAATCCC AACACCTTTG CGGCCCTCTA CCGCGCGGTG CTGGGCGCCT CCTTCATGGC CAGCCAGCCC AAGGACCGCG ACCTGATTGC CAAGGTGATC GCGCCGACGC AATACCTCAA CCAGCCCGAG GCGGTGCTGC AGCAGGTGCT GACCGGCAAG TTTGCCGATG GCCTGGGCAA CATCAAAAAT GTGCCCGATC GTGCCAACTT CGACCCGGTG CCGTGGCAAA GCATGGCGGT CTGGATGCTG ACCCAGATGA AACGCTGGGG CTATGTGAAG GGCGACGTCA ACTACCGGCA GATTGCCGAA AAGGTGTTCC TGCTGACCGA AGCCAAGAAA CAGATGTCGC TCGCCGGCTT CAAGCCGCCG GAAGGTGCCT ACAAAAAGTT CAAGGTCATG GGCAAGGAGT TTGACCCCGC CAAGCCCGAG GACTATGTGA AGAGCTTTGC CATCAATAAG CTATGA
|
Protein sequence | MSNRFDAYDA DRPLMLRCAC GQDHAPGEHD AAAASAQELS HSFMEASLVK ALFPVDSVRR SFLRAVGANT ARAAIASVFP MGALQAMAQD RGPLEKKDLK IGFIAITCAS PLIMADPLGF YKKEGLHVQL NKTAGWALIR DKMINKEHDA SHFLSPMPLA MTMGLGSNQV NMNVASIQNT NGQAITLHVK HKNNRDPKNW KGFKFAIPFE YSMHNFLLRY YLAENGLNPD TDVQLRVVPP PEMVANLRAG NIDGFLGPDP FNQRAVYDEV GFIHILSKDI WDGHPCCAFG VSDEFIQKNP NTFAALYRAV LGASFMASQP KDRDLIAKVI APTQYLNQPE AVLQQVLTGK FADGLGNIKN VPDRANFDPV PWQSMAVWML TQMKRWGYVK GDVNYRQIAE KVFLLTEAKK QMSLAGFKPP EGAYKKFKVM GKEFDPAKPE DYVKSFAINK L
|
| |