Gene Nham_0202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_0202 
Symbol 
ID4030662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp225903 
End bp226823 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content64% 
IMG OID637968737 
Productextracellular solute-binding protein 
Protein accessionYP_575562 
Protein GI92115833 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.725948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACCAC AGATTACTGT TTCCGCCAAC GAAAGATTGA TCCGCCGCCT TGCGACCATG 
CTGGGTTGTA TGCTGGTCGC CGGCTGGATG CTGGTTTTGG GAGCAGCGGT CGACGATGCG
CGCGCGCAAG CCGCAGCGAA GACCGCCACC GTCGCGCCGC AGGCGGTGCC GGGCTTCTGG
GATCCGCGCC GCCGTCCGGA TCGCCCCGAT CTGTCACGCA TCACCGTGAT CCGCTTTCTG
ACCGAGACCG ACTATCCGCC CTTCAACTTC ACCGGTCCCG ACGGCAATCC GGCCGGCTTC
AATGTCGATC TGGCGCGCGC CCTGTGCGAG GAAATCAAGA TCACCTGCAC GATTCAGATG
CGGCGCTTCG AGACGCTGGT GGACGCGCTC ACCAGCAACC GCGGCGACGC CATCATCGCC
TCGCTCGCGG TAACGCCGGA GCTGCGCAAG CGGGTGGACT TCACCGACCC GTACTATCGA
ACGCCGGCGC GATTCGTGTC GCGGCGCGAC GCCGTGATGG CCGAGGTGCG CCCGGAATAT
CTCGAGGGCA AGAAGGTCGG CGTGATCGCA GGGTCGGCGC ACGAGGCCTA TCTCAAGGTC
TTCTTCACCG ATGCCGAACT CCACACCTAT CCGAACGACG AGGCGCTGCG GCAGGCGCTG
CGGCGGGGCG AAGTCGACTT CATTTTCGGC GACGCCATTT CACTGGCGTT CTGGATCAAC
GGCACCGATT CGGAAGGCTG CTGCGCCTTC AGCGGCGGCC CCTTTGTCGA GAGCCGCTAT
TTTGGCGAAG GCGTCGGCAT CGCGGTGAAA AAGGGCAATG ACGTGCTGCG TCAGGCGCTG
AACTGGGCGC TGTTCCGGGT CTGGGAAAAA GGCCGCTATA CCGACCTGTG GTTGCGGTAT
TTTTCCGTCA GTCCGTTTTA G
 
Protein sequence
MQPQITVSAN ERLIRRLATM LGCMLVAGWM LVLGAAVDDA RAQAAAKTAT VAPQAVPGFW 
DPRRRPDRPD LSRITVIRFL TETDYPPFNF TGPDGNPAGF NVDLARALCE EIKITCTIQM
RRFETLVDAL TSNRGDAIIA SLAVTPELRK RVDFTDPYYR TPARFVSRRD AVMAEVRPEY
LEGKKVGVIA GSAHEAYLKV FFTDAELHTY PNDEALRQAL RRGEVDFIFG DAISLAFWIN
GTDSEGCCAF SGGPFVESRY FGEGVGIAVK KGNDVLRQAL NWALFRVWEK GRYTDLWLRY
FSVSPF