Gene Nham_1441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1441 
Symbol 
ID4032842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp1624754 
End bp1626325 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content65% 
IMG OID637969915 
Productpeptidase S1C, Do 
Protein accessionYP_576723 
Protein GI92116994 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACA GGCCCACTGA TCTCTCGCCG CCTTCTCCGC CCAAGCCCGC CCGCCGTTCC 
GGGCTCACCG CGCGCAAGTT CGCGCTGATG GCGTCGGTCG CCGCCGGCCT CGGCCTGGCC
GCATACGGGT TCAGTCCCCC GACGGACAAA GCCGATCTCT TCACCACCCC GGCCCATGCG
CAGGTTAATA ACGACGTCAG CAAGGTCGCG CAGCCGATCG GCTTTGCCGA TATCGTCGAG
CGTGTGAAGC CGTCGGTGAT TTCGGTCAAG GTCCGTATGA AGGACAAGGT GGTCGCCGGC
GACGACGATT CGCCGTTCCA GCCGGGTTCG CCGATGGAAC GCTTCTTCCG TCGCTTTGGC
GGACCGGATG GTCTGCCGCC GGGAATGCAT GGCCGGCGTG GCCATGGCAT CGTGACGGGA
CAGGGCTCCG GCTTCTTCAT TTCGGCCGAT GGCTACGCCG TAACCAACAA CCACGTCGTC
GACGACGCCG ACAAGGTCGA AGTCACGACC GATGGCGGCA AGACCTACAC CGCCAAGGTG
ATCGGAACCG ATCCGCGCAC CGACATTGCC CTGATCAAGG TCGAGGGCGG CTCGAATTTC
CCGTTTGCGA AGCTATCGGA CGACAAGGCG CGGATCGGCG ACTGGGTGCT CGCGGTCGGC
AACCCGTTCG GCCTCGGCGG CACCGTGACC TCCGGCATCG TGTCGGCCAT CGGCCGCGAC
ATCGGCAACG GTCCCTATGA CGATTTCATC CAGATCGACG CGCCCGTCAA CAAGGGTAAC
TCCGGCGGTC CGGCCTTCAA CATGAAGGGC GACGTGATCG GCGTGAACAC CGCGATCTAT
TCGCCGTCCG GCGGCAGCGT CGGCATCGCA TTTTCGATTC CCGCCAACAC CGTGAAGAAC
GTCGTCGCGC AACTCAAGGA CAAGGGGTCG GTGAGCCGCG GATGGATCGG CGTCCAGATC
CAGCCGGTCA CCGCTGACAT CGCCGATAGC CTCGGCCTCA AGAAAACCGA AGGCGCGCTG
GTGGCCGAGC CGCAGGCCGA CAGCCCGGCG GCCAAGGCCG GCATCCAGTC CGGCGACGTG
ATCACGCAAG TCAACGGTGC TACCGTCAAG GATGCGCGCG AACTGGCGCG GACCATCGGT
GGCCTTGCGC CGGGGACCTC GATCAAGCTG AACGTCCTGC ACAAGGGGAC GGACAAGGTC
ATCAACCTGA CGCTCGGCAT CCTGCCGAAC ACCGCGGAAG CAAAGGCCGA TCTATACGGC
GATGATCACG GGCCGTCGCA TGGCGACGAC GTTCCCAGGC TTGGCATGAC GGTTGCTCCG
GCGGGGTCGG TTGCCGGCGC CGGCACGGAG GGCGTGGTCG TCGTCGATGT CGATCCGAAG
AGCGCGGCCG CGGATCGCGG CTTCAAGGAA GGCGACGTGA TCCTCGAGGT CGGAGGAAAG
GACGTCTCCA ATTCGGCCGA CGTGCGCGAG GCGATCAAAG CGGCGCGCGC GGACCACAAG
AACAGTATTT TGATGCGGGT TCGCACCGGC GGCACGGCGC GTTTCGTCGC GGTTCCGATC
GCGAAGGGCT AG
 
Protein sequence
MTDRPTDLSP PSPPKPARRS GLTARKFALM ASVAAGLGLA AYGFSPPTDK ADLFTTPAHA 
QVNNDVSKVA QPIGFADIVE RVKPSVISVK VRMKDKVVAG DDDSPFQPGS PMERFFRRFG
GPDGLPPGMH GRRGHGIVTG QGSGFFISAD GYAVTNNHVV DDADKVEVTT DGGKTYTAKV
IGTDPRTDIA LIKVEGGSNF PFAKLSDDKA RIGDWVLAVG NPFGLGGTVT SGIVSAIGRD
IGNGPYDDFI QIDAPVNKGN SGGPAFNMKG DVIGVNTAIY SPSGGSVGIA FSIPANTVKN
VVAQLKDKGS VSRGWIGVQI QPVTADIADS LGLKKTEGAL VAEPQADSPA AKAGIQSGDV
ITQVNGATVK DARELARTIG GLAPGTSIKL NVLHKGTDKV INLTLGILPN TAEAKADLYG
DDHGPSHGDD VPRLGMTVAP AGSVAGAGTE GVVVVDVDPK SAAADRGFKE GDVILEVGGK
DVSNSADVRE AIKAARADHK NSILMRVRTG GTARFVAVPI AKG