Gene Noc_0973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0973 
Symbol 
ID3707404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1075297 
End bp1076445 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content59% 
IMG OID637737480 
Producthypothetical protein 
Protein accessionYP_343013 
Protein GI77164488 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACACA CCACAGTAAT GATGAGCGCG GTTGTGGCCG CCTCCCTGTT AAGCACCTCC 
GCCCTGGCCC TGGACCGGGC CAACACCTCC GAGAAAGGCA GCTTATTGGT TTTCCCGAAT
GTGGATGTGA GCGGGGATAG GAATACCATT ATTCGCCTCC AGAATGATTA CTCCGACTCG
GTGAGCCTGA AGTGCTACTG GAAGAACGGG ACCAAGTTTT TCACCGACTT TCAAATTGAG
CTGACCAAGT TCCAACCCAT CTGGATAAGC GCCCGGGATG GTGAGGGGAC CTACAGCGTC
CCCCCCTTCC CCACCTCCGC CAACCAGGAC TACCTGGATA AGATAGGCCA CAGTGGCGGT
CCATATGCCA CCCCCAGCCG GGTGACGTTG CCCGAGCATG CCCGGACCGC AGGTGAGCTG
CAATGCTGGG CGGTGGATGG CGGGGGTGCC AGTGAAATTC GCTGGAACCA TCTGGCCGGC
TCTGCCACGG TGGTGGACGC CTCCCAAGGG ACGGCCTATC AATACAACGC CTGGGGTTTT
CGCTGCCTGG TCGGGGGCAA TGGGGATGCG TGCGTGGTGG CCGATGCGGG TCAACTGGAC
CTCAATGGCA ACGAATACGA AGCCTGCCCG AAGAAGCTCA TCGGCCATTT CAGCCCCGCC
GAGACCGCTC TTGGGGGCAT GCAGGTGCAT CGCAATGAGC TGACCCTGGC CTCCTGTAAT
CAGGACCTGA CCCAGGACCA GCAGTTCCAC TTCACCAAGC TGAAGTTTAA TGTCTGGAAC
GAGCAGGAAG CCAAATACAC GGGGGCCTAT CAGTGCATGG ACAGCTGGCA TCAGGGGCTG
CTCGATGGGG TGCAGAACAA TGGCCGTAAC TTCACCGCCT CGAGCCTCAA GACCGATGTG
GCCCGCTTTA AGGTAAGGGG GATGAAGAGC AGCGTCTGTG AGCGTGCTGA TAACCGCAAA
ACCTTCGCCA TTGATGAGAG CATTGTGACC GAAGAGGCCG GGCTGCTCGG GGTCATGGCC
ACCACCTACG GGTTGGGGGA AAGTGATGGC CTCGCCGAGG CGGGCACTAC CCTGCACATG
CTCGGTCAGC GTGATGGCTT TATCGCCTAC GACCCTCAAG AGGTCATTGA GGAGCGTCCC
GCTCGCTAG
 
Protein sequence
MKHTTVMMSA VVAASLLSTS ALALDRANTS EKGSLLVFPN VDVSGDRNTI IRLQNDYSDS 
VSLKCYWKNG TKFFTDFQIE LTKFQPIWIS ARDGEGTYSV PPFPTSANQD YLDKIGHSGG
PYATPSRVTL PEHARTAGEL QCWAVDGGGA SEIRWNHLAG SATVVDASQG TAYQYNAWGF
RCLVGGNGDA CVVADAGQLD LNGNEYEACP KKLIGHFSPA ETALGGMQVH RNELTLASCN
QDLTQDQQFH FTKLKFNVWN EQEAKYTGAY QCMDSWHQGL LDGVQNNGRN FTASSLKTDV
ARFKVRGMKS SVCERADNRK TFAIDESIVT EEAGLLGVMA TTYGLGESDG LAEAGTTLHM
LGQRDGFIAY DPQEVIEERP AR