Gene Noc_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0024 
Symbol 
ID3705957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp20172 
End bp21842 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content46% 
IMG OID637736548 
ProductDNA/RNA non-specific endonuclease 
Protein accessionYP_342096 
Protein GI77163571 
COG category[C] Energy production and conversion 
COG ID[COG1229] Formylmethanofuran dehydrogenase subunit A 
TIGRFAM ID[TIGR03121] formylmethanofuran dehydrogenase subunit A 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCATTA AACTCACCGG CGGTACCGTT TATGATCCCA TGCATGGGAT TAATGGCGAA 
GTTCGTGATA TTTATATCCG AGATGGCCGC ATTATCAATC CCCCCCCCGG CGATATGTCT
ATCGATCAAG AGTATCCCTT AAATAATAAA ATTATCATGG CTGGAGCGAT CGATATCCAT
AGTCATATCG GTGGAGGTAA CGTTAATATC GCCCGAACAT TGCTGCCGGA GGGTCATCAC
ACTAATTTAT TACCCCGTAC TGAACTATTA CGCGCAGGAT CTGGCCGCGC GATTCCCAGC
ACCTTCGCTA CCGGTTATCG TTATGCGGAA ATGGGTTATA CGGCGGTCTT CGAACCTGCC
GTATTACCTA TGAATGCCCG CCAAGCCCAC ATGGAAATGG GGGATACTCC CTTGGTAGAC
AAAGGTGGCT ATGCCTTATT GGGTAATGAT GATTATGGGT TACGGATGTT GGCAGCTAAT
AAGGATCAAA AAACCTTTAA CAATTACGTG GCCTGGATTC TCAAAGCCAG CCAGTGTCTA
GGTATCAAAG TAGTCAATCC AGGTGGAATT AATGCCTTCA AATTCAATCA ACGCCGCCTC
GATCTGGATG AACCAGGTCC CTTTTATGGG GTTACACCAC GACAAATATT GCTTCGCCTC
GCACGTGCAG TACATGAATT GGATATTCCT CACCCCATTC ATGTCCATGG CTGTAACTTG
GGAGTACCAG GCAATCTAAA AACCACTTTA AGCACTATTG AGGGGATCGC GGGCCTCCCT
ATGCATCTTG CCCATATTCA ATTTCATAGT TACGGCGCTG AAGGTGATCG GAAATTCTCA
TCAGGGGCAG CTCAAATTGC AGAAGCGGTA AATAGACACC CGAATATTAC CGTTGATGTC
GGTCAAATTC TGTTTGGGCA AACGGTAACC GTATCCAGTG ACACCATGCA GCAATATGCC
AGCCATCCCC ATGCTTATCC TAAGAAATGG GCCTTCATGG ATATTGAATG TGACGCTGGC
TGCGGTATTG TCCCTTTTAA GTACCAAGAC AAGCATTTTG TTAATGCTCT CCAGTGGGCT
ATTGGCTTAG AGATCTTTCT TTTAGTGGAT GATCCCTGGC GGGTTTTTCT TACGACTGAT
CATCCTAATG GTGCCCCTTT TGTTTCTTAC CCTCATCTTA TCCGGCTATT AATGGACCGA
AGTTTCCGTA ATGATATGTT AGCCACTATC CATCCGGAGG CTGCCCAGGC CAGTACTTTA
GGCACTATTA CCCGGGAATA TTCTCTTTAT GAGATTGCTA TCATGACCCG GGCTGGAGCG
GCTAAACTGC TTGGTCTTTC AGACCGAGGG CATCTAGGTA TTGGGGCAGC CGCTGATATT
ACCGTCTACA CGGAGCAAAA AGACAAGGAA AAAATGTTTT CTAAACCCGA CTATGTCTTT
AAGGATGGGG AACTCGTCGT TAGAAACGGG GAAATCGTCA AGGTCACTTG GGGCGCAACC
CATGTAGTCC GGCCGGAGTT CGATAATAGT ATAGAAAAAG AGCTTTCCAG CTACTTTGAT
CGTTATCTTC CCATGAAGAT TAGCAATTTT AAAATAAACG ATGAGGAAAT GACTTATTTC
GGGCGAGGTC ATATCCAAGT TCATCCCTGC CGGGAGAGGA ATAGCTTCTG A
 
Protein sequence
MLIKLTGGTV YDPMHGINGE VRDIYIRDGR IINPPPGDMS IDQEYPLNNK IIMAGAIDIH 
SHIGGGNVNI ARTLLPEGHH TNLLPRTELL RAGSGRAIPS TFATGYRYAE MGYTAVFEPA
VLPMNARQAH MEMGDTPLVD KGGYALLGND DYGLRMLAAN KDQKTFNNYV AWILKASQCL
GIKVVNPGGI NAFKFNQRRL DLDEPGPFYG VTPRQILLRL ARAVHELDIP HPIHVHGCNL
GVPGNLKTTL STIEGIAGLP MHLAHIQFHS YGAEGDRKFS SGAAQIAEAV NRHPNITVDV
GQILFGQTVT VSSDTMQQYA SHPHAYPKKW AFMDIECDAG CGIVPFKYQD KHFVNALQWA
IGLEIFLLVD DPWRVFLTTD HPNGAPFVSY PHLIRLLMDR SFRNDMLATI HPEAAQASTL
GTITREYSLY EIAIMTRAGA AKLLGLSDRG HLGIGAAADI TVYTEQKDKE KMFSKPDYVF
KDGELVVRNG EIVKVTWGAT HVVRPEFDNS IEKELSSYFD RYLPMKISNF KINDEEMTYF
GRGHIQVHPC RERNSF