Gene Noc_1238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1238 
Symbol 
ID3706389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1356542 
End bp1359523 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content53% 
IMG OID637737740 
Productmolybdopterin oxidoreductase, iron-sulfur binding subunit 
Protein accessionYP_343269 
Protein GI77164744 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.23327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGAT CGAGTATAAA ACCTTTGGAT TTAGCGCCCA TTCGGGCCCG CCTTGCCGAG 
GCGCAAGGGC GTGCTTTCTG GAAAAGCTTG GAGGAGCTGG CGGGCAGTGA AGAATTCGAA
CGTTTTCTAT ACCAGGAATT CCCCTTTTTT CGCGAGCTGA GTCAGGCTTC GCTCAGCCGG
CGGGACTTTA TGCGGCTAAT GGGAGCTTCT TTAGCTCTCG CCGGCCTAAG TGCATGCAGC
ACCCCGCCAC CGGAGGAAAT CCTTCCTTAC ATACGGGCGC CGGAAGGTTT AGTGCCAGGT
GAGTCGCTGT TTTTTGCCAC GGCCATGCCT CTGGATGGCT TTGCAACCGG CGTGTTGGTG
GAAAGCCGGA TGGGTCGCCC CACCAAGGTG GAAGGTAATC CCCTGCATCC AGCGAGTTTG
GGGGGGACGG ATATTTTTGC CCAGGCTTCC GTGCTGCAAT TATGGGATCC CGACCGAGCT
CAGGTGATAA GTCACCGGGG AGAAATTAGC ACTTGGCAGA CTTTTTTAGC TGCAATGGGT
GAGAAGATGA GGACCTTTGA GGGCAACCAA GGCAAGGGGC TCTACCTATT AACGCCAACC
GTAAGCTCAC CAACGTTAAT TTCCCAGTTA CGGACGTTAG GCAAACGCTT TCCCCATGCC
CATTGGCATC AATATCAACC GATTAACCAA GATAATAGCT ATGAAGGGGC TCGTTTGGCC
TTTGGTGAGT CCCTTGAGAC CCGTTATCAC CTGGAGCGGG CTGAGGTAAT TTTGTCTCTT
GATGGGGACT TCCTTGGCTC GCTGCCAGGA CATTTGCGCT ATGCCCGCGA TTTTGCCAAG
AAGCGGCGGG TGGATTCGGC ACAAAGCACC ATGAACCGCT TGTATGTAGC TGAGAGTTCA
CCCACTATAA CCGGGACGAT GGCGGATCAT GGGGTTTCCC TGCGCGCTAG CCAAATAGAA
GTATTAGCCT TGCAGTTGGC CCGCGCACTA GGCATTGGCG TGCCCAGGAG GGAAGAGACG
GCTTCCGATT TGCCTGAACA ATGGGTGCGG GCCGTGGCCG AGGATTTACG CCAGCACCGC
GGGACTTCCT TGGTGATAAC GGGAGAAAAA CAGCCTCCCT TTGTTCATGG GTTGGTCCAT
GCCGTGAATC AAGCCTTGGG GAATGTGGGA ACGACGCTTA CTTACAGTGC GCCAAGAGCC
TTCAATCCAC GAAACCAAAA TGAATCCCTG AACCATCTGG TGGCTCAGAT GGACGCAGGC
AAAGTGGATA CGCTCATCAT GCTAGGAGGC AACCCCGCCT ATAACGCGCC GGCCGATCTG
GCCTTCTCCA AGCAGCTCGC TAAGGTGAAA TCATCGATTT ATCTCGGGCT ATATGAGGAT
GAAACCGCCG CTCATAGTCA CTGGCATATC CCCGAAACTC ATTACCTGGA GAGGTGGGGG
GATGCCCGTG CCTACGAGGG CACGGTAAGT TTATTGCAGC CCTTGATTGC ACCCCTTTAT
CAGGGCAAAT CAGATTATGA ATTGTTGGCG GTGTTGCTTG GCCAAACCGA CCGGAGCGAT
TATGACTGGG TGCGTGGATA CTGGCAAAAG CAGTGGCCGA AGTCGGATTT CAAAAGCATC
TGGAACCAGG CATTGCAGGC AGGTTTTATC GAAGGGACAG CATTAAGATC GAAATCCGTA
AAACTGCGTG ATGATTGGGT TGCCCATTTA TCGAGGGGAC AGTCTAAGAG CAAGGAAACT
TCAGGCATGG AAATTATCTT CATGCCTGAT CCAACGATTT GGGATGGGCA ATTTACCAAC
AATGGTTGGC TGCAAGAATT GCCTAAGCCG CTCACCAAGC TTACTTGGGA TAACGCCGCC
TTAATTAGCC CCCGGACGGC TGAGAATTTG GGACTTGCCA ATGAAGAAGT CGTGGCGCTT
CGCTACCAGG AACGCCAGGT TCAAGCGCCT ATCTGGATTA TGCCCGGGCA CCCAGAGGGT
GCGGTTACGG TGACGTTAGG CTATGGCCGC GCTAAAACCG GACAGGTGGG AGCGGGAACA
GGTTTTAATG CCTACGCGCT GCGCTCCTCC AGGGCGCCTT GGTTTGGGTG GGGATTGGAA
ATTGTCAAAA CTGGCAAGCA CCATTCTTTG GCCACGACCC AGCACCATCA TAGCATGGAG
GGGCGGGATA TCGTCCGAAC GGCGACCCTC TCTGAGTTTC GAGAGAACCC CCATTTCGCC
CAGCAGGAAT TACCTTCTAA AAGTCTTTAT CCTCAGTTCA ACTATTCAGG TTATGCCTGG
GGCATGACGA TTAATCAAAG TACCTGCATC GGCTGCAGCG CCTGTGTTGT GGCGTGCCAA
GCAGAAAACA ATATCCCCGT GGTGGGGAAG GAGCAGGTCA GTTTAGGGCG GGAGATGCAT
TGGCTGCGCA TTGATCGCTA CTACAGTGGA GGCTTGGATG ATCCGCGGAC TTATTTTCAG
CCAGTGCTCT GTATGCATTG TGAAAACGCG CCTTGTGAGC TGGTTTGTCC CACGGCAGCG
ACGGTGCATG ATTCAGAAGG ACTTAATCTT CAGGTTTACA ATCGTTGTAT TGGTACCCGT
TTTTGCTCTA ACAATTGCCC TTATAAGGTT AGGCGATTTA ATTTTTTGGA ATATGCTAAG
GAAACACCGT CTCTAGTAGC CCAAAAAAAT CCTGAAGTCA CCGTGCGAAT GCGGGGAGTG
ATGGAAAAAT GCAGCTATTG CATACAGCGC ATTAGCAATG CTCGTATTCA AGCGGAGCTA
GAAGAACGGC GTATTCAAGA TGGGGAAGTA TTAACTGCCT GCCAGGCAGC ATGCCCCACC
GAGGCCATTG TCTTTGGTGA TTTAAACGAT CCGGAAAGCC AGGTTGGCCA AGTAAAGGCA
TCACCGCTTA ATTATGCGCT CCTGGGCGAA CTTAACACCC GCCCCCGGAC TACCTATCTT
GCAAAATTAA CTAATCCGAA TCCTAAGCTC AAAGAAGAAT AG
 
Protein sequence
MAGSSIKPLD LAPIRARLAE AQGRAFWKSL EELAGSEEFE RFLYQEFPFF RELSQASLSR 
RDFMRLMGAS LALAGLSACS TPPPEEILPY IRAPEGLVPG ESLFFATAMP LDGFATGVLV
ESRMGRPTKV EGNPLHPASL GGTDIFAQAS VLQLWDPDRA QVISHRGEIS TWQTFLAAMG
EKMRTFEGNQ GKGLYLLTPT VSSPTLISQL RTLGKRFPHA HWHQYQPINQ DNSYEGARLA
FGESLETRYH LERAEVILSL DGDFLGSLPG HLRYARDFAK KRRVDSAQST MNRLYVAESS
PTITGTMADH GVSLRASQIE VLALQLARAL GIGVPRREET ASDLPEQWVR AVAEDLRQHR
GTSLVITGEK QPPFVHGLVH AVNQALGNVG TTLTYSAPRA FNPRNQNESL NHLVAQMDAG
KVDTLIMLGG NPAYNAPADL AFSKQLAKVK SSIYLGLYED ETAAHSHWHI PETHYLERWG
DARAYEGTVS LLQPLIAPLY QGKSDYELLA VLLGQTDRSD YDWVRGYWQK QWPKSDFKSI
WNQALQAGFI EGTALRSKSV KLRDDWVAHL SRGQSKSKET SGMEIIFMPD PTIWDGQFTN
NGWLQELPKP LTKLTWDNAA LISPRTAENL GLANEEVVAL RYQERQVQAP IWIMPGHPEG
AVTVTLGYGR AKTGQVGAGT GFNAYALRSS RAPWFGWGLE IVKTGKHHSL ATTQHHHSME
GRDIVRTATL SEFRENPHFA QQELPSKSLY PQFNYSGYAW GMTINQSTCI GCSACVVACQ
AENNIPVVGK EQVSLGREMH WLRIDRYYSG GLDDPRTYFQ PVLCMHCENA PCELVCPTAA
TVHDSEGLNL QVYNRCIGTR FCSNNCPYKV RRFNFLEYAK ETPSLVAQKN PEVTVRMRGV
MEKCSYCIQR ISNARIQAEL EERRIQDGEV LTACQAACPT EAIVFGDLND PESQVGQVKA
SPLNYALLGE LNTRPRTTYL AKLTNPNPKL KEE