Gene Noc_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2973 
Symbol 
ID3707355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3358923 
End bp3360395 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content50% 
IMG OID637739447 
Producthypothetical protein 
Protein accessionYP_344945 
Protein GI77166420 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000455039 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGAAGA AGGAAAGCTT AGTGGGAGTT GGGTTAGCCG CGCTCACACT GGCTGTAAGC 
TTACCCTTGG GAGCGACGGA AATGCCCCAA ACCATGGAAG AAATGTGGCG GATCATTCAG
CAGCAACAAC AAGAAATTGA AGCGCTTAAA GCGAAATCCC AACCCCTAGA AACTGAGAAA
GCTCATCCGG AAATTTCTAA AGAAGTACCC GAGAAAACTA AAGAAGCCAC AAAAACAACT
ACCTCTACTT CTGAGGAGAA TGCAGAGACA AAAGCTCAGG TTAAGGAACT GGAACACAAA
ACGGGTGTGC TCGCTGAAGC GGTGGAAAGT CTGCGGACCG CCATGCATAT TCCAGAAGAA
TTCGAATATA AAAGTATGTA TGGTCTAGGG CCGGCGGCTT CCAAGGTTTA TCAAGTCGGT
AAAGGACTAT CTATTGGTGG TTATGGTGAA GGTCGCTATC AAACTTTTGT GAATGGAGAT
GGGGACGATA ATGCCGATTT TGCCCGGCTA GTACTTTATA CTGGGTATAA GTTCACCGAC
CGGATCATCT TTAACAGTGA GATTGAGTTT GAGCATGGGA CTACCGGCGA AGGGGCTGAG
GAGAAGGGCG AAGTTTCCGT CGAGTTTGCA GCGCTTGATT TCTTTCTTGA TCCGAGAGTT
AATATTCGTG CCGGTTTGGT GTTGATGCCC ATGGGGTTTA TCAACCTCAT CCATGAACCG
CCTTTCTTTT TCGGAAATAA CCGTCCTGAG GTTGAGCGGC GAATTATTCC CAGCACCTGG
CGCGAGATTG GCGTGGGCCT TTTTGGCGAG CTGCTGCCAG GGTTAACCTA TACCATGTAC
GGAGTGAATG GACTGAACGC TGAAGAATTC AGCTCCAGGG GTATTCGCGA TGGTCGCCAA
AGTGGCAGTA AAGCTTTAGC GGAAGATTTA GCTTTTGTGG GCCGCATGGA TTATGCGCCT
CCCGGAATGC CTGGACTTTC CTTTGGGGGC TCCGCCTATG CGGGTAACTC TGGCCAAGAT
CAAAGCTATG GGGGGCAAGA TCTGGATGTC TTTACTCAGC TCTATGAGGG CCACCTCCAG
TGGCAATACC GAGGCTGGTG GTTACGGGCT CTGGGGGCCT GGGGGCATAT CGGTGATGCC
GAAGCGCTTA GTGCCGCCAA GGGGGAAACC ATCGGCGAGA GCAATTTTGG TTGGTACACG
GAGCTGGCTT ATAACTTGTT ACCGTTAGTG TGGCCGGAAA CCATCCAGTA TCTGGCCCCT
TTCTTCCGTT TTGAGCAACT GAATACTATT GCCAGCGCTC CGGCGGGATT TTCGGATAAA
GGCGGTATCA ATCAGGATAT CTACCAGGTA GGTATCAACT ATAAACCTAT TCCCAATGTG
GTTATTAAGG CGGATTATCG TAACTTCGTA GGTAGAGATG GCAACCCTTC TGCTGCCGAT
GAGTTTAATC TGGGGCTTGG GTTTATCTTT TAA
 
Protein sequence
MRKKESLVGV GLAALTLAVS LPLGATEMPQ TMEEMWRIIQ QQQQEIEALK AKSQPLETEK 
AHPEISKEVP EKTKEATKTT TSTSEENAET KAQVKELEHK TGVLAEAVES LRTAMHIPEE
FEYKSMYGLG PAASKVYQVG KGLSIGGYGE GRYQTFVNGD GDDNADFARL VLYTGYKFTD
RIIFNSEIEF EHGTTGEGAE EKGEVSVEFA ALDFFLDPRV NIRAGLVLMP MGFINLIHEP
PFFFGNNRPE VERRIIPSTW REIGVGLFGE LLPGLTYTMY GVNGLNAEEF SSRGIRDGRQ
SGSKALAEDL AFVGRMDYAP PGMPGLSFGG SAYAGNSGQD QSYGGQDLDV FTQLYEGHLQ
WQYRGWWLRA LGAWGHIGDA EALSAAKGET IGESNFGWYT ELAYNLLPLV WPETIQYLAP
FFRFEQLNTI ASAPAGFSDK GGINQDIYQV GINYKPIPNV VIKADYRNFV GRDGNPSAAD
EFNLGLGFIF