Gene Noc_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2006 
Symbol 
ID3705196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2313877 
End bp2315133 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content51% 
IMG OID637738483 
Productputative oxidoreductase 
Protein accessionYP_343998 
Protein GI77165473 
COG category[R] General function prediction only 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.113232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCAGA CTCATCACAG GCTGCTCATT GTGGGTGGCG GCGCCGCTGG TATTTCAGTG 
GCGGCCAATA TGCGCCGTAA GGATAAAGCC ATGGACATTG CTATTATCGA ACCTAGTGAG
GTTCATTACT ATCAACCCGC ATTTACGTTA GTGGGGGGCG GCGTCTATGA TTTTGACAAG
ACCAAGCGCC AGGAACAGGA TCTGATTCCT AAGGAGGTGG AATGGATTCG CGACTATGCC
GAATCCTTTC AGCCAGAGAG TAATTCGGTT ACATTGCGCT CGGGAAGTTC GGTGAGTTAC
GATTATTTGG TCGTTTGCCC TGGTATTCAG CTTGATTGGC AGAAAATCGA GGGACTTAAA
GAGACTCTTG GCAAAAACGG CGTCAGCAGC AATTATTCTC CGCACACGGC GAGCTATACT
TGGGAATGCC TCCGGGATTT CCAGGGAGGC ACAGCCTTGT TTACCCAGCC GCCGATGCCT
ATCAAGTGCG CTGGTGCGCC ACAAAAAGTG ATGTATCTAG CCGCAGAGCG GTTTCGCCAG
CGTAAGGTAC TCGATAAAGC TAACCTAGAA TTCTGCAATG CCGGTCCCAC GATGTTCGGT
GTTCCGTTTT TCGCCGAAGC TCTAGATAAA GTGGTAGCCG GTTACGGCAT TAAAGCGAAT
TTCGGTTGTA ACTTAGTTGC CATAGACGGA CCCGGCCATA CCGCAACTTT TGAGACGACC
GGGGCCGACG GTAGCAAAGA AAGAATCAAT AAATCCTTCG ATTTTATCCA TGTAACGCCG
CCTCAAAGTG CCCCTGACTT TATTAAAAAC AGCCCTCTGG CTAATGCCGC AGGATGGGTG
GAATTAGATG AAAATACTTT GCAGCACCCC CGCTTCAGTA ATATTTTTGG TCTTGGCGAT
GCGGGCTCGA CGAGTAACGC CAAAACCGCT GCAGCAGTGC GCAAGCAGGT TCCGGTCGTG
GTGCAGAATA TCCTGGCGCT GATAAACGAC AAAGCACTCG AGCCAAAATA CGATGGCTAT
GGCTCGTGCC CACTGACCAC TTCATTGCAC CGTGTGATGC TGGCCGAATT CTCCTACGGC
GGCAAGGTAA CCCCATCGTT TCCTATACTT GACCCACGCA GTAACCGCCT AATCTGGTGG
TGGCTGAAAA AATACGGCCT CCCGCCCCTG TACTGGGACT ACATGCTGAA AGGCTATGAC
TGGGATATTC CGCACAAGGC ATCTTATGCA GAGAAACTGG TTGCGGCAAC CGCATAA
 
Protein sequence
MAQTHHRLLI VGGGAAGISV AANMRRKDKA MDIAIIEPSE VHYYQPAFTL VGGGVYDFDK 
TKRQEQDLIP KEVEWIRDYA ESFQPESNSV TLRSGSSVSY DYLVVCPGIQ LDWQKIEGLK
ETLGKNGVSS NYSPHTASYT WECLRDFQGG TALFTQPPMP IKCAGAPQKV MYLAAERFRQ
RKVLDKANLE FCNAGPTMFG VPFFAEALDK VVAGYGIKAN FGCNLVAIDG PGHTATFETT
GADGSKERIN KSFDFIHVTP PQSAPDFIKN SPLANAAGWV ELDENTLQHP RFSNIFGLGD
AGSTSNAKTA AAVRKQVPVV VQNILALIND KALEPKYDGY GSCPLTTSLH RVMLAEFSYG
GKVTPSFPIL DPRSNRLIWW WLKKYGLPPL YWDYMLKGYD WDIPHKASYA EKLVAATA