Gene Noc_2569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2569 
Symbol 
ID3704573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2920559 
End bp2922484 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content51% 
IMG OID637739049 
Productpeptidase M41, FtsH 
Protein accessionYP_344552 
Protein GI77166027 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000845189 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCGACA TGGCAAAAAA TATTATTTTG TGGGTAGTGA TTGCCCTAGT ACTGATGTCG 
GTTTTTAACA GCTTCGACAC GCGTCAGGTA AGCGGCCATC ATATCGATTA CTCTCGATTC
ATCGCCGATG TTAAAAGTGG GCAAGTGAAT AAAGTGGTGA TCGACGGGCG TCACATCAGT
GGTGAAACAA GCGAAGGGAA ACACTTTACA ACCTATAGTC CAGGTAATGA TCCAGGTTTG
ATCGGCGATC TCTTGGGTAA TGGTGTTGTT ATTGAAGCCA AGCCAGAAGA GGGAACAGGG
CTTTTGATGC AGGTTTTTAT CTCTTGGTTC CCTATGTTAT TGCTCATTGC AGTATGGATT
TTTTTCATGC GGCAGATGCA GGGCGGAGCT GGAGGCCGTG GAGCAATGTC GTTTGGTAAA
AGCCGTGCTC GCATGCTGAG CGAAGAGCAG GTGAAAGTTA CCTTTAGTGA TGTTGCTGGA
TGTGACGAAG CAAAGGAAGA AGTTCAGGAA TTGGTAGAGT TCTTGCGCGA GCCCGGTCGT
TTTCAAAAAT TAGGAGGTAA AATTCCTCGG GGCGTGCTTA TGGTAGGACC GCCAGGAACA
GGAAAAACCT TATTGGCAAG GGCGATTGCT GGCGAAGCCA AAGTCCCCTT CTTCACCATT
TCCGGCTCTG ATTTTGTGGA AATGTTTGTG GGCGTGGGCG CCTCGCGAGT GCGAGATATG
TTTGAGAACG CCAAAAAGCA CGCGCCTTGC ATTATTTTCA TCGACGAAAT TGATGCCGTT
GGGCGTCAGC GCGGTGCCGG CCTTGGAGGC GGCCATGATG AGCGGGAGCA AACCCTCAAT
CAAATGCTGG TGGAGATGGA TGGGTTTGAA GGCAACGAAG GCGTTATTGT TATTGCTGCG
ACCAACCGCC CTGATGTGCT GGATCCGGCT CTGTTGCGGC CAGGACGCTT TGATCGGCAG
GTTGTGGTTT CTCTCCCAGA TATCCGAGGG CGAGCGCAAA TCCTTAAAGT TCACCTTCGT
AAGGTCCCAG TCGCGGAAGA TGTGGAGCCG GCCCTCATTG CACGAGGTAC CCCGGGTTTC
TCTGGCGCCG ATCTTGCTAA CTTGGTCAAT GAGGCCGCTC TGTTTGCCGC TCGTGGCAGC
AAGCGCTTGG TCGATATGCA AGACTTGGAG CAAGCCAAGG ATAAAATCCT GATGGGTGTC
GAACGGCGTT CAGCAGTGAT GAGTGAGGAC GACAAGAGGC TCACTGCTTA TCATGAAGCT
GGGCACGCTA TCATTGGCCG TTTGGTACCC TCGCACGATC CAGTTTACAA GGTAAGCATT
ATCCCTCGGG GCCGAGCGCT AGGCGTTACT ATGTTCTTGC CGGAAGAAGA TCGCTACAGC
CTGAGCAAGC TACAGATAGA GAGTCAGATT TCCAGCCTCT TTGGCGGGCG CTTAGCTGAA
GAATTGATCT TTGGCGTGGA GTACGTAACT ACGGGAGCTT CTAATGATAT CCAGCGTGCT
ACCGAGTTAG CCCGTAATAT GGTGACTAAA TGGGGACTTT CGGAAAAGCT GGGCCCACTG
GCCTATGGCG AAGAGGAAGG CGAAGTGTTT CTGGGACATT CTGTGACCCA GCATAAGGGT
ATTGCGGATA CGACGGCTTC AGAAATTGAT ACCGAAATAC GGGCTATTAT TGACCGCAAT
TACCTGCGGG CAAAACAGCT CTTAGAGGAG AATATGGACA AATTGCACGT TATGTCCGAT
GCTCTAATGA AATATGAAAC CATTGATAAG GAACAAATTG ATGACATTAT GGCTGGTAAA
GAACCACGAC CACCTAAAGT AAGCGGGTCT GATGTGGAAC CGCCCAGTGG GAGTGATGCA
GTGCCACCTA AAGGGAAGGA AGAACAGCCT GTAGGGGGGG GATCTATCCC TGCTAGCCAG
CATTAA
 
Protein sequence
MSDMAKNIIL WVVIALVLMS VFNSFDTRQV SGHHIDYSRF IADVKSGQVN KVVIDGRHIS 
GETSEGKHFT TYSPGNDPGL IGDLLGNGVV IEAKPEEGTG LLMQVFISWF PMLLLIAVWI
FFMRQMQGGA GGRGAMSFGK SRARMLSEEQ VKVTFSDVAG CDEAKEEVQE LVEFLREPGR
FQKLGGKIPR GVLMVGPPGT GKTLLARAIA GEAKVPFFTI SGSDFVEMFV GVGASRVRDM
FENAKKHAPC IIFIDEIDAV GRQRGAGLGG GHDEREQTLN QMLVEMDGFE GNEGVIVIAA
TNRPDVLDPA LLRPGRFDRQ VVVSLPDIRG RAQILKVHLR KVPVAEDVEP ALIARGTPGF
SGADLANLVN EAALFAARGS KRLVDMQDLE QAKDKILMGV ERRSAVMSED DKRLTAYHEA
GHAIIGRLVP SHDPVYKVSI IPRGRALGVT MFLPEEDRYS LSKLQIESQI SSLFGGRLAE
ELIFGVEYVT TGASNDIQRA TELARNMVTK WGLSEKLGPL AYGEEEGEVF LGHSVTQHKG
IADTTASEID TEIRAIIDRN YLRAKQLLEE NMDKLHVMSD ALMKYETIDK EQIDDIMAGK
EPRPPKVSGS DVEPPSGSDA VPPKGKEEQP VGGGSIPASQ H