Gene Noc_0076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0076 
Symbol 
ID3705914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp75986 
End bp77884 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content60% 
IMG OID637736596 
ProductDNA helicase II 
Protein accessionYP_342143 
Protein GI77163618 
COG category[R] General function prediction only 
COG ID[COG3972] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATTTTA ACATCGGGTG GATCGCCCGC TACCGGAATT TCGACACCAT GGCCACGTTC 
ATTCCAGCAC TCACCACAAT CCCCCGGATG ACTCGCGGCG AGCGCCGTTT CGGTCGCCGG
CTTGACAGCC TGCTGGAAGA AGACTATCTC GTCTGGTACG ACATTCCATT GGGACGCCGG
CGTCGCTATC CCGACTTCAT CATTCTGCAT CCCGCTCGCG GCCTGCTGTT TCTCGAGGTC
AAGGACTGGA AAATCGAAAC CATTCGCTCG ATCACCCCTG ACAGCGTCGT GATCGATACC
CAAGAGGGCC GCAAGACGGT CAGCAACCCG TTGGCTCAAG CTCGCCAATG TGCCTTTGCC
GCCATCGACC AGCTGAAGCG CGACCCTCAG CTGACTCAGA GCGATAAGTG CTACCGCGGC
AAGCTATGCT TTCCCTATGG GCACGGTGTG GTCTTTCCCA ATATCACGCG TCGGCAGTGG
AACCAAGCCA TTCCCGAGGC AGAGCAAGAA ATCCTGCTGC CGGCGCATCG CGTGATCTGC
AAGGACGAGA TGCTCACCAC CGCCGACCCG GAGGCATTCC AGCAGCGGCT CTGGAACATG
TTCGACTACC GGTTCGGCGA GCAGTTCAGC GTCCCGCAGC TCGACCGGAT CCGCTGGCAG
CTCTTCCCGG AAGTGCGCAT CGATGCCCCC ACAACGGATC TCTTCGGTAA CGATGAAGCC
GCGGAGGATG AGCCCGCGAG CAACCTCGTC CCCAATATCG TTCGCGTCAT GGATCTGCAT
CAGGAGCAGC TGGCCAGAAG CATGGGCGAC GGCCACCGGG TCATTCATGG CGTTGCCGGC
TCCGGCAAGA CCTTGATACT CGGCTACCGC TGCCTGCATT TGGCCCAGGC CATCAGCAAG
CCGATCCTGG TGCTGTGCTT CAACATCACC CTGGCAGCGC GCCTGCGCTG CTTCATCGCC
GAAAAGGGAA TCAGCGAGAA GGTTAAGGTG CACCATTTCC ATGAGTGGTG CAGCCTGCAG
TTGAAGACCT ATCAGGCCGA CCTGGCGCCG GGAAAAGGCC CTATCTGGGA GCGTCAGGTG
GAAAGCGTCA TTCGGGCAGT CGATCAGTCA CGCATTCCCC GGGCACAATA CGGCGCGGTG
ATGATCGACG AAGGCCACGA CTTCGAGCAG GCCTGGCTCA AGCTGGTGGT ACAGATGGTC
GATCCCGACA CCAACTCGCT GCTGCTGCTT TATGACGATG CCCAGTCCAT TTATCAGAAG
AGCTCGCTGA AATTCCCGCT TTCCTCGGCT GGCGTCCAGG CCCGCGGGCG CACCACTATT
CTCAAGCTGA ACTATCGAAA CACCCGGGAA ATCCTGACGT TCGCCTATGA TTTCGCCCAG
GATTTTCTGA AAGCTCACGA TGCCGATGAT GACCATATTC CTTTGATCGC CCCCGAGGTG
GCCGGGGTGA GCGGGCCCAG GCCGGCGTTT CGTCGCCTCA GCAGCCCCCG CGATGAAGCG
CGCTATCTGG TGCGCTGCAT CCAGACATGG CGTAGCCAGG GTAGCGGCTT GAACAGTATC
GCGGTGGTCT ATACTGGCAA CTCGCAGGGG CGTCTCTTCT ATGACGCCCT GCGCGAAGCC
AGCATCCCAA GCCGCTGTCT GCAACAGTCT GCCGACAAGC GCAGCTACGA CCCGCAGGCC
GATGAAGTGG TGCTGCTCAG TCGACAGAGC AGCAAGGGGC TGGAGTTCGA TACCGTGCTG
CTGTGTGGTC TCGGGGCATT GAGCAACGAC GAGGAACGGC TGGCTCAGGA AGCGCGACTG
CTTTATGTCG GCATGACCCG CGCTCGCCGC CGGCTGCTGG TAACCAGCTG CAAGCCAAAC
TGGTACACCC AGCGGCTAAC AGAGCTCGCC TCGGCCTGA
 
Protein sequence
MNFNIGWIAR YRNFDTMATF IPALTTIPRM TRGERRFGRR LDSLLEEDYL VWYDIPLGRR 
RRYPDFIILH PARGLLFLEV KDWKIETIRS ITPDSVVIDT QEGRKTVSNP LAQARQCAFA
AIDQLKRDPQ LTQSDKCYRG KLCFPYGHGV VFPNITRRQW NQAIPEAEQE ILLPAHRVIC
KDEMLTTADP EAFQQRLWNM FDYRFGEQFS VPQLDRIRWQ LFPEVRIDAP TTDLFGNDEA
AEDEPASNLV PNIVRVMDLH QEQLARSMGD GHRVIHGVAG SGKTLILGYR CLHLAQAISK
PILVLCFNIT LAARLRCFIA EKGISEKVKV HHFHEWCSLQ LKTYQADLAP GKGPIWERQV
ESVIRAVDQS RIPRAQYGAV MIDEGHDFEQ AWLKLVVQMV DPDTNSLLLL YDDAQSIYQK
SSLKFPLSSA GVQARGRTTI LKLNYRNTRE ILTFAYDFAQ DFLKAHDADD DHIPLIAPEV
AGVSGPRPAF RRLSSPRDEA RYLVRCIQTW RSQGSGLNSI AVVYTGNSQG RLFYDALREA
SIPSRCLQQS ADKRSYDPQA DEVVLLSRQS SKGLEFDTVL LCGLGALSND EERLAQEARL
LYVGMTRARR RLLVTSCKPN WYTQRLTELA SA