Gene Nham_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_2050 
Symbol 
ID4031470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2271562 
End bp2273424 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content62% 
IMG OID637970507 
Productextracellular solute-binding protein 
Protein accessionYP_577308 
Protein GI92117579 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCAAGG GCGAATCAAA ACGGACCCCG AGTTTGCGAT CGATACCGCG CAACCGCAGC 
CGGATAGGAG CGTGTCTGGG CCTCGCGGTC GGGCTGCTCG CTGCGGGGTC CGAAGGGATT
TCGGCGAGTC CTGACTATGC TATCGCGATG CACGGCACAC CAGCCTTGCC GGCCGGTTTC
AGCCAGATGC CCTATGTCAA TCCGGACGCG CCCAAGGGTG GCCGGCTGGT TCAGAGCGTT
CCGGGCAGCT TCGATAGCCT CAATCCCTTC ATCGTCAAAG GCGTTGCCCT CCAGCAAATA
CGGGGGTTCG TGGTCGAGAG CCTGATGGCC CGAGGCAACG ACGAACCCTT CACGCTCTAT
GGCCTCCTGG CGAACAGCGT TGAGACCGAC GACGCCCGAA CCCATGCCAC CTTCCACCTC
AACCCGCTGG CGCGTTTTTC CGACGGGCAG CCCGTCCGTG CCGAAGACGT GCTGTTCTCC
TGGCAACTCC TGCGAGACAA GGGCCGCCCC AACCATCGCC TGTATTATTC GAAGGTCGCA
ACGGCAAAGG CGATCGATGA ACGCACAGTG CGTTTCGATT TCGGCGGAAC CAGAGATCGC
GAACTGCCGC TGATCCTCGG GCTGATGCCG ATTTTGCCGA AACATGCGAT CGACGTCGCG
ACTTTCGAGC AGACCTCGAT GACGGCGCCG TTGGGTTCCG GGCCGTATCG CGTCACTGCG
GTAAAGCCCG GAGCCAGCGT CACGCTGACG CGCAATGCGG ACTATTGGGG GCGTGACCTG
CCGGTCAATC GTGGCCTGTG GAATTTCGAC GAGATCAGGT TCGACTTCTA TCGCGAAGCC
AACAGTCAGT TCGAAGCCTT CAAGCGCGGG CTGTACGATT TTCGCGTCGA GACCGAACCG
CTGCGCTGGC ACGATGGGTA CAATTTTCCG GCCGCCCGCA ACGGTCAGCT CGTTCGCGAA
ACCATCAAGA CGGGCCTGCC AGCGCCGTCG GAATTTCTGG TGTTCAATAC CCGACGTCAG
ATGTTCTCCG ACGTCCGCGT CCGCGAAGCG CTGACGCTGT TGTTCGATTT CGAGTGGATC
AACCGGAACT ATTTTTTCGG GCTCTACAGC CGCGCCGGGG GCTTCTTCGC GGGATCGGAA
CTGTCTGCCT ACGCACGCCC CGCCGACGAA CGGGAACGGT CGTTGTTGAA GCCGTTTGCG
TCGGCCGTGC GGCCCGATGT TCTCGACGGC AACTACCGTC TGCCGGTGAC AGACGGCTCA
GGCCGCGACC GCAAGGCCTT GCGCGCCGCC CTCGCCCTGC TGTCGCAAGC CGGTTACGAG
CTTGACGGGA CGGTCTTGCG TCACCGCTCA ACCAGGGCGC CCCTCGCCTT CGAAATCCTG
GTGACGACGC GCGATCAGGA ACGAATCGCG CTCACCTATG CGCGCGATCT CAAGCGGGCC
GGCATTGAGG TGTCCGTGCG CTCGGTCGAT GCCGTGCAGT TCGACCAGCG GCGGCTGAGC
TTCGATTTCG ACATGATCCA GAACCGCTGG GATCAATCGC TGTCGCCGGG CAACGAGCAG
TCATTTTACT GGGGCAGCGA GGCCGCCGAC ACCACCGGGA CTCGAAATTA CATGGGCGCG
AAGAATCCGG CGATCGATGC CATGATCGCC GCCCTGCTCG AGGCCCGGGA GCGTCCGGCT
TTCGTGGATG CGGTTCGCGC GCTCGACCGG GTCCTGATGT CCGGCTTCTA CGCAATTCCG
GTCTACAACG TTCGAGAGCA ATGGATTGCT CGCTGGAATC GGATAGAACG ACCGGCAGCG
ACCGCGTTGA CCGGATACCT CCCCGAAACA TGGTGGCAAA GGCCGCCGAA GTCGAGCCAG
TGA
 
Protein sequence
MSKGESKRTP SLRSIPRNRS RIGACLGLAV GLLAAGSEGI SASPDYAIAM HGTPALPAGF 
SQMPYVNPDA PKGGRLVQSV PGSFDSLNPF IVKGVALQQI RGFVVESLMA RGNDEPFTLY
GLLANSVETD DARTHATFHL NPLARFSDGQ PVRAEDVLFS WQLLRDKGRP NHRLYYSKVA
TAKAIDERTV RFDFGGTRDR ELPLILGLMP ILPKHAIDVA TFEQTSMTAP LGSGPYRVTA
VKPGASVTLT RNADYWGRDL PVNRGLWNFD EIRFDFYREA NSQFEAFKRG LYDFRVETEP
LRWHDGYNFP AARNGQLVRE TIKTGLPAPS EFLVFNTRRQ MFSDVRVREA LTLLFDFEWI
NRNYFFGLYS RAGGFFAGSE LSAYARPADE RERSLLKPFA SAVRPDVLDG NYRLPVTDGS
GRDRKALRAA LALLSQAGYE LDGTVLRHRS TRAPLAFEIL VTTRDQERIA LTYARDLKRA
GIEVSVRSVD AVQFDQRRLS FDFDMIQNRW DQSLSPGNEQ SFYWGSEAAD TTGTRNYMGA
KNPAIDAMIA ALLEARERPA FVDAVRALDR VLMSGFYAIP VYNVREQWIA RWNRIERPAA
TALTGYLPET WWQRPPKSSQ