Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_2050 |
Symbol | |
ID | 4031470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | + |
Start bp | 2271562 |
End bp | 2273424 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637970507 |
Product | extracellular solute-binding protein |
Protein accession | YP_577308 |
Protein GI | 92117579 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCAAGG GCGAATCAAA ACGGACCCCG AGTTTGCGAT CGATACCGCG CAACCGCAGC CGGATAGGAG CGTGTCTGGG CCTCGCGGTC GGGCTGCTCG CTGCGGGGTC CGAAGGGATT TCGGCGAGTC CTGACTATGC TATCGCGATG CACGGCACAC CAGCCTTGCC GGCCGGTTTC AGCCAGATGC CCTATGTCAA TCCGGACGCG CCCAAGGGTG GCCGGCTGGT TCAGAGCGTT CCGGGCAGCT TCGATAGCCT CAATCCCTTC ATCGTCAAAG GCGTTGCCCT CCAGCAAATA CGGGGGTTCG TGGTCGAGAG CCTGATGGCC CGAGGCAACG ACGAACCCTT CACGCTCTAT GGCCTCCTGG CGAACAGCGT TGAGACCGAC GACGCCCGAA CCCATGCCAC CTTCCACCTC AACCCGCTGG CGCGTTTTTC CGACGGGCAG CCCGTCCGTG CCGAAGACGT GCTGTTCTCC TGGCAACTCC TGCGAGACAA GGGCCGCCCC AACCATCGCC TGTATTATTC GAAGGTCGCA ACGGCAAAGG CGATCGATGA ACGCACAGTG CGTTTCGATT TCGGCGGAAC CAGAGATCGC GAACTGCCGC TGATCCTCGG GCTGATGCCG ATTTTGCCGA AACATGCGAT CGACGTCGCG ACTTTCGAGC AGACCTCGAT GACGGCGCCG TTGGGTTCCG GGCCGTATCG CGTCACTGCG GTAAAGCCCG GAGCCAGCGT CACGCTGACG CGCAATGCGG ACTATTGGGG GCGTGACCTG CCGGTCAATC GTGGCCTGTG GAATTTCGAC GAGATCAGGT TCGACTTCTA TCGCGAAGCC AACAGTCAGT TCGAAGCCTT CAAGCGCGGG CTGTACGATT TTCGCGTCGA GACCGAACCG CTGCGCTGGC ACGATGGGTA CAATTTTCCG GCCGCCCGCA ACGGTCAGCT CGTTCGCGAA ACCATCAAGA CGGGCCTGCC AGCGCCGTCG GAATTTCTGG TGTTCAATAC CCGACGTCAG ATGTTCTCCG ACGTCCGCGT CCGCGAAGCG CTGACGCTGT TGTTCGATTT CGAGTGGATC AACCGGAACT ATTTTTTCGG GCTCTACAGC CGCGCCGGGG GCTTCTTCGC GGGATCGGAA CTGTCTGCCT ACGCACGCCC CGCCGACGAA CGGGAACGGT CGTTGTTGAA GCCGTTTGCG TCGGCCGTGC GGCCCGATGT TCTCGACGGC AACTACCGTC TGCCGGTGAC AGACGGCTCA GGCCGCGACC GCAAGGCCTT GCGCGCCGCC CTCGCCCTGC TGTCGCAAGC CGGTTACGAG CTTGACGGGA CGGTCTTGCG TCACCGCTCA ACCAGGGCGC CCCTCGCCTT CGAAATCCTG GTGACGACGC GCGATCAGGA ACGAATCGCG CTCACCTATG CGCGCGATCT CAAGCGGGCC GGCATTGAGG TGTCCGTGCG CTCGGTCGAT GCCGTGCAGT TCGACCAGCG GCGGCTGAGC TTCGATTTCG ACATGATCCA GAACCGCTGG GATCAATCGC TGTCGCCGGG CAACGAGCAG TCATTTTACT GGGGCAGCGA GGCCGCCGAC ACCACCGGGA CTCGAAATTA CATGGGCGCG AAGAATCCGG CGATCGATGC CATGATCGCC GCCCTGCTCG AGGCCCGGGA GCGTCCGGCT TTCGTGGATG CGGTTCGCGC GCTCGACCGG GTCCTGATGT CCGGCTTCTA CGCAATTCCG GTCTACAACG TTCGAGAGCA ATGGATTGCT CGCTGGAATC GGATAGAACG ACCGGCAGCG ACCGCGTTGA CCGGATACCT CCCCGAAACA TGGTGGCAAA GGCCGCCGAA GTCGAGCCAG TGA
|
Protein sequence | MSKGESKRTP SLRSIPRNRS RIGACLGLAV GLLAAGSEGI SASPDYAIAM HGTPALPAGF SQMPYVNPDA PKGGRLVQSV PGSFDSLNPF IVKGVALQQI RGFVVESLMA RGNDEPFTLY GLLANSVETD DARTHATFHL NPLARFSDGQ PVRAEDVLFS WQLLRDKGRP NHRLYYSKVA TAKAIDERTV RFDFGGTRDR ELPLILGLMP ILPKHAIDVA TFEQTSMTAP LGSGPYRVTA VKPGASVTLT RNADYWGRDL PVNRGLWNFD EIRFDFYREA NSQFEAFKRG LYDFRVETEP LRWHDGYNFP AARNGQLVRE TIKTGLPAPS EFLVFNTRRQ MFSDVRVREA LTLLFDFEWI NRNYFFGLYS RAGGFFAGSE LSAYARPADE RERSLLKPFA SAVRPDVLDG NYRLPVTDGS GRDRKALRAA LALLSQAGYE LDGTVLRHRS TRAPLAFEIL VTTRDQERIA LTYARDLKRA GIEVSVRSVD AVQFDQRRLS FDFDMIQNRW DQSLSPGNEQ SFYWGSEAAD TTGTRNYMGA KNPAIDAMIA ALLEARERPA FVDAVRALDR VLMSGFYAIP VYNVREQWIA RWNRIERPAA TALTGYLPET WWQRPPKSSQ
|
| |