Gene Noc_0232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0232 
Symbol 
ID3706287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp256439 
End bp257734 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content50% 
IMG OID637736748 
Producthypothetical protein 
Protein accessionYP_342292 
Protein GI77163767 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAATT TGGCCCGTCA CTGTTTTATT TTTTTAGGTT TGCTACTTGT GCCAACCCTC 
GGGGCGGCTG GTTTGAATAT CTTCCCCGAG TTGGATTCAA ATGATAACAA TAATACCAGC
AGCAATAATA CCAGCCAAGA AAGTACTCAG AACTTTAGTA ATACCGCAGA CCCAGAGCCG
GAGGCTGATA CCAACGCCTC CCCCCCTGCA AACAAGGATG AAATTATTGC TGAAACCAAA
GCAGGATGCG CCACCGATCC GGCAAGCTGC GGGGTGACTT TATCCAGTTT TCTAGACAGC
ACCGGCTTTG GCGAAACCGA ACCCAATAAC CATGCCATGG GCGCTGACGC CATGGAGTTC
GGCGTTGAAT ACGCGGGGCA GCTTTATAGC CCGGAAGATG TGGACTGGTT CCGTATTACC
ACCACGGAAT CCAATCAAAT GCTCACCGTT AATTTTAACG TTCCCGGATT AAACGATATT
ACCGGATGGA ATCTCTCCAT TCGGGATAGC GGGGGTAATA TTTTCTCGGA AGTCTACACT
GGGTTTGACT TCGGACCAGA AAGCCCATTA CAGACAATCC TATCCCGTGC GGGTACTTAT
TATGTAGTGG TAAAATCCTT GAAGCAGGCC CAGGAAGAGA CAAGATCTTC AGACCAAAGC
GGCGAAGCTG ACCTGATCTA TGAACATCTG CCCCACGAGT ACCGCCTGGC CGCTTTTCTG
GGGGATTCTC AGGTCACCAC GGAACCCCTT GACGTCAATT TTTTTGACGC CGAAGTGGAA
CCCAATGACT CCCGCGATGA AGCCAACCCA TTAACCTTTG CCACCCCCCT CGCATCCAAT
GTCACCATGG AGGGCTTAAT ATCCGGACCT CTCATCTTTG GATCGGTGGG ATTTGCCTTC
GAAGAGGACT GGTTTGTCTA TGACACGGCA GGGAATGAAA TTCTCAGTAT AGAATTTTGC
GCTAGCCAAG ACTGCGAAGA CAGCACTTGG CAAGTCACCG TGTACGATGA AAATGAACGA
ATGCTGCTCA CCGGGCGGAC CGACATGGAA CAAAATTATT ATCTGGGTAT CCGCAATCCA
GGCAAGTATT TTATACGAAT TGGTGTGGCC CCGGCACTAG ATGAGGAAAG CGGCGGCGCA
CAATATGTCT GCTCTATTGA TCCTACCATG CCCCTCAAGG ATTGTCCTAG CCCCAGCGAG
AGAACATTAC TGGTTGAGTC ACCGTGGCAT CAATACAACT TCACTGTCAC CAGCACCAAG
CTTCCACCCT TGATGAGCGA GGTAGATAAT CCCTAA
 
Protein sequence
MKNLARHCFI FLGLLLVPTL GAAGLNIFPE LDSNDNNNTS SNNTSQESTQ NFSNTADPEP 
EADTNASPPA NKDEIIAETK AGCATDPASC GVTLSSFLDS TGFGETEPNN HAMGADAMEF
GVEYAGQLYS PEDVDWFRIT TTESNQMLTV NFNVPGLNDI TGWNLSIRDS GGNIFSEVYT
GFDFGPESPL QTILSRAGTY YVVVKSLKQA QEETRSSDQS GEADLIYEHL PHEYRLAAFL
GDSQVTTEPL DVNFFDAEVE PNDSRDEANP LTFATPLASN VTMEGLISGP LIFGSVGFAF
EEDWFVYDTA GNEILSIEFC ASQDCEDSTW QVTVYDENER MLLTGRTDME QNYYLGIRNP
GKYFIRIGVA PALDEESGGA QYVCSIDPTM PLKDCPSPSE RTLLVESPWH QYNFTVTSTK
LPPLMSEVDN P