Gene Noc_2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2341 
SymbolhslU 
ID3704763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2685411 
End bp2686754 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content49% 
IMG OID637738824 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_344329 
Protein GI77165804 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.320663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGAAC TAATCCCCCA GCAAGAACTA ACCCCTCGGC AAATTGTTCA GGAACTGGAT 
AAACATATTA TTGGACAAAC CGCTGCTAAG CGGGCTGTTG CTGTTGCCCT CCGTAACCGC
TGGCGTCGGC GCCAGGTGAG TGAAGAGCTG CGCAACGAAA TCACTCCCAA AAATATTCTT
ATGATTGGCC CCACCGGCGT GGGTAAAACC GAGATTGCCC GGCGCCTTGC AAAACTAGCG
AATGCTCCCT TTATGAAAGT CGAAGCAACT AAGTTTACCG AAGTAGGCTA TGTTGGCCGA
GACGTAGAAT CCATCATTCG GGATCTCGTG GACATTGCCA TCAAAATAAC TCGCGAGCAG
GAAACGGCTA AGGTCCGCAA CCAAGCGGAA GATAGAGCAG AGGAGCGAAT TCTTGATGCT
CTCTTACCCG CTGCACGGGC TGCGCCCCAT GATACGATGG ATGAAGAATC AAATACTCGG
CAAAAATTTC GCAAAATGCT GCGGGAAGGA AGATTAGATG ACCGAGAAAT CGAGATTGAG
TTAGCTGCCG TTTCCATGGG GGTTGAAATT ATGGCCCCCC CAGGGATGGA AGAAATGACT
AGCCAGCTCC AAAATATGTT CCAAAATTTG GGGGGCACTC GTACCCGAGC GCGCCGACTA
CGAGTCCGCG AAGCCTTCAA GCTACTGACT GAAGAGGAGG CTGGAAAGCT CATCAATGAT
GAAGATTTAA AAGCGCGATC CCTAGAAAAT GTGGAACAAA ATGGTATTGT ATTTCTTGAC
GAAATGGACA AAATCGCCAA GCGTTCGGAA TTCTCTGGTA CAGATGTCTC CCGTGAAGGC
GTGCAGCGCG ATTTACTCCC TTTGGTAGAA GGCAGCGCGG TCTCCACAAA GTATGGTATG
GTGCGCACCG ACCATATCCT ATTTATTGCA TCTGGGGCTT TTCATCTAGC TAAGCCCTCA
GATCTCATCC CGGAAATGCA AGGACGGTTA CCCATTCGAG TTGAGCTAGG CGCCCTTAGC
GTTGACGACT TCGTACGTAT CTTAACTGAG CCGAATGCCT CTCTCACTGA GCAATATACT
GCATTATTAA AAACTGAAGG AATCTCCTTA CATTTCACCG AGGAAGGTAT TGCACATATC
GCCCAAATCG CTTGGCAGGT TAATGAACGT ACCGAGAATA TTGGTGCCCG CCGATTGCAT
ACTGTCATGG AGCGCCTCCT AGAAGGGCTC TCCTTTGAAG CGGAAAACCA TACTAGTAAA
AAAGTTATTA TTGACGCCGC CTATGTAGAT GCTCAGCTAG CCGACTTGGC CCGGGATGAA
GACTTGTCAC GCTATATCCT CTAG
 
Protein sequence
MPELIPQQEL TPRQIVQELD KHIIGQTAAK RAVAVALRNR WRRRQVSEEL RNEITPKNIL 
MIGPTGVGKT EIARRLAKLA NAPFMKVEAT KFTEVGYVGR DVESIIRDLV DIAIKITREQ
ETAKVRNQAE DRAEERILDA LLPAARAAPH DTMDEESNTR QKFRKMLREG RLDDREIEIE
LAAVSMGVEI MAPPGMEEMT SQLQNMFQNL GGTRTRARRL RVREAFKLLT EEEAGKLIND
EDLKARSLEN VEQNGIVFLD EMDKIAKRSE FSGTDVSREG VQRDLLPLVE GSAVSTKYGM
VRTDHILFIA SGAFHLAKPS DLIPEMQGRL PIRVELGALS VDDFVRILTE PNASLTEQYT
ALLKTEGISL HFTEEGIAHI AQIAWQVNER TENIGARRLH TVMERLLEGL SFEAENHTSK
KVIIDAAYVD AQLADLARDE DLSRYIL