Gene Noc_3008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3008 
Symbol 
ID3705716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3400559 
End bp3401632 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content51% 
IMG OID637739482 
Producturoporphyrinogen decarboxylase HemE 
Protein accessionYP_344980 
Protein GI77166455 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGT TAAAAAATGA TCGTTTTCTC CGGGCACTGT TGCGGCAGCC GGTGGATCGA 
ACCCCGATTT GGATTATGCG CCAGGCGGGG CGTTATCTCC CGGAATATCG GGAAGTGCGT
GCTAAAGCTG GGGATTTTTT GACCTTATGT ACAACGCCAG AGCTAGCTTG CGAAGTGACC
CTGCAACCGC TGCGTCGCTT TGACCTGGAT GCTGCTATTA TTTTTTCCGA TATCCTCACC
ATTCCGCATG CCATGGGGTT GGGATTGTAT TTTTCCAAAG GTGAAGGGCC TCGTTTTGAA
CGGCCGGTAA GAACTAAAAA CCAAGTCTCT GCCCTTGGAG TTCCAGATCC AGAATCGGAT
TTAAGTTATG TCATGGAAGC TCTGCGGTTA ACCCGAAGAG AGCTGGATGG GCGCGTGCCT
CTCATCGGTT TTTCTGGCAG TCCCTGGACC TTAGCGTGCT ATATGGTGGA AGGGGGATCA
AGCAAAGATT TTGCCCTAAT CAAGGGACTA ATGTTTGAGC ATCCTCAAGT AATGCATCAC
CTACTAGAAA TTCTTGCCCA GGCTGTTACA GTCTATCTTA ATGCCCAGAT AGCAGCGGGT
GCCCAGGCCG TGATGCTCTT CGATACTTGG GGTGGAGCGT TAAGTCACCG TGATTATCGG
GACTTCTCTC TCTCTTATAT GGCCAGGATT GTGGAAGGCG TAGTCCGGGA GAATGAAGGC
CGTCAAGTAC CGGTAATTTT ATTCACCAAA GGGGGAGGGC TTTGGTTGGA AACAATGGCT
GGGACGGGCT GTGATGCCTT AGGGGTTGAT TGGACTGTGG ACCTTGCTAA AGCGCGAATG
CAGGTAGGAA AACAAGTCGC CTTGCAAGGG AATATGGACC CTTGCGTGCT TTATGCCTCC
AGCGAACGGG TACGCCAGGA GGCAAGCGAA ATCATTAAAG CTTATGGTGC TGGTAGCGGC
CACGTTTTTA ACCTTGGGCA TGGCATACAT CCTACGGTGA TGCCTGAGAA AGTAGCGGCC
CTTGTGGATG CGGTCCATGA ACTGAGTGTG CCGTATCATA CCTATGAGGA GTGA
 
Protein sequence
MAELKNDRFL RALLRQPVDR TPIWIMRQAG RYLPEYREVR AKAGDFLTLC TTPELACEVT 
LQPLRRFDLD AAIIFSDILT IPHAMGLGLY FSKGEGPRFE RPVRTKNQVS ALGVPDPESD
LSYVMEALRL TRRELDGRVP LIGFSGSPWT LACYMVEGGS SKDFALIKGL MFEHPQVMHH
LLEILAQAVT VYLNAQIAAG AQAVMLFDTW GGALSHRDYR DFSLSYMARI VEGVVRENEG
RQVPVILFTK GGGLWLETMA GTGCDALGVD WTVDLAKARM QVGKQVALQG NMDPCVLYAS
SERVRQEASE IIKAYGAGSG HVFNLGHGIH PTVMPEKVAA LVDAVHELSV PYHTYEE