Gene Noc_2574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2574 
Symbol 
ID3704578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2928330 
End bp2929388 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content51% 
IMG OID637739054 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_344557 
Protein GI77166032 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.162764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTATT TTTCATGTAG CTCTAATAGA GGAATTGTAG ACTTGTCGAC ACACCTTTGG 
AATAGCAGAT CTTTTCCCCG CACCCGTCCC CGCCGTATGC GGTGTGACGA CTTCTCCCGG
CGGCTAATGC AGGAACATCG TATAAGCTGC AACGATCTTA TTTATCCAGT ATTTATTCTC
AATGGCCAAG GCCGTCGCGA GACGGTCTCC TCCATGCCGG GCATTGAGCG GCTCACCATC
GATAATTTAC TGGATGAAGC TAAAGAGCTT ATTGCGCTTG GCATCCCTGC CATTGCCCTA
TTTCCAGTGA CACCGCCTGC CCAAAAATCG GACAACGCCC ATGAAGCCTA TAATCCAGAT
GGCCTCGCGC AACAGGCAGT ACGGACTTTA AAACAACATT TCCCTGAATT AGGTGTGATC
ACTGATGTTG CCCTAGACCC CTTTACCAGC CATGGTCAGG ACGGCCTAAT AGATGCCAAT
GGTTATGTAA AGAACGATGA AACCGTTGAA GTGTTAGTAA AGCAGGCCCT TTCCCACGCA
GAAGCTGGCG CTGATATTGT TGCTCCTTCC GATATGATGG ACGGCCGTAT TGGTGCTATT
CGCCAGGCCC TAGAAAGTGC CGGACACACT AATACCCGGA TTCTTGCCTA TTCAGCAAAA
TATGCTTCTA GTTTTTACGG ACCTTTCCGG GATGCAGTCG GGTCAGCGGA TAACCTTGGC
GGCGGCAACA AATACAGCTA CCAAATGGAC CCAGCTAATG GCGATGAAGC TCTGCAAGAA
GTGGCTTTAG ATCTAGAAGA GGGCGCGGAT ATGGTCATGG TCAAGCCAGG ATTGCCCTAT
CTGGATATTG TCCAGCGGGT CAAGACAACC TTTGGGGTTC CTACCTTTGT GTATCAGGTC
AGCGGCGAAT ATGCCATGCT GACTGCCGCT GCCCAGAACG GCTGGCTGGA TCGGCAAACT
GTTACGATGG AATCTCTGCT TGCCATGAAA CGGGCCGGGG CCGATGCCAT CTTGACCTAC
TTTGCCAAAG ACGCGGCCCG CTGGCTAAAC GAGCAGTAG
 
Protein sequence
MIYFSCSSNR GIVDLSTHLW NSRSFPRTRP RRMRCDDFSR RLMQEHRISC NDLIYPVFIL 
NGQGRRETVS SMPGIERLTI DNLLDEAKEL IALGIPAIAL FPVTPPAQKS DNAHEAYNPD
GLAQQAVRTL KQHFPELGVI TDVALDPFTS HGQDGLIDAN GYVKNDETVE VLVKQALSHA
EAGADIVAPS DMMDGRIGAI RQALESAGHT NTRILAYSAK YASSFYGPFR DAVGSADNLG
GGNKYSYQMD PANGDEALQE VALDLEEGAD MVMVKPGLPY LDIVQRVKTT FGVPTFVYQV
SGEYAMLTAA AQNGWLDRQT VTMESLLAMK RAGADAILTY FAKDAARWLN EQ