Gene Noc_1150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1150 
Symbol 
ID3706915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1257160 
End bp1258368 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content46% 
IMG OID637737654 
Producthypothetical protein 
Protein accessionYP_343184 
Protein GI77164659 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTAAAG GAGCTAGAAA ACCATGGTAT AAGAAGAGAG CATTTAGTGC CATGCTGCTA 
TTGCTGGTAA ATCCGTTACA GGCTGAATGG AAACTTCCCG ACTCTTTTCA GATTCATGGT
TTTGCCTCGC AAGCATTCAT ATTGAGTACA GACAATAATT TTTTTGGCGG CAGTAAGGAT
AACGGGACAT TTGATTTCAG AGAATTGGGG ATTAATGCTT CTTGGAGGAT TTTGCCCAGG
CTTCAGGTTG CGGCTCAGGG GGTTGCACGC TGGGCTGGCG AGAATGACGA AGGCAGCCCA
AGGTTAGACT ACGGCTTGGT TGACTATGCT TTCGTGAGTA ATGCTAAGAA CACTTGGGGC
CTGCGGCTTG GCCGTGTAAT TAATCCTATT GGTTTTTATA ATGATACCCG CGACGTGGCT
TTTACTCGGC CAAGTATTTT TCTACCCCAA TCCATTTACT TTGATCGTAC CAGAGATGTA
GCACTGTCGG CGGATGGCGG CCAAGTTTAT GGTGAACAGC GTACCAGCAT AGGAGATTTT
ATCCTCCAGT TCAATATCGC AAAACCAAGG GTTGGTACAC GGGAAGAGCG GGCTTTACTC
AGCCATGACT TCCCTGGTCA ACTTAAAGGA GATACATCTG TATTTGGGCG TCTCTTGTAT
GAGAAGGATG GGGGAGGTAT TCGGTTGGGT TTTTCCTCCT TCTGGGCTAA TTTTGATTTT
GACTCTGCGT TGTCCACAGA CTCGATTTCT TCGGGATCAA TCGAGTTTTC TCCTTTGATA
TTTTCAGGCC AATATAATGG GGAGCGGTTG AGTCTGACAA GCGAGGTTGC TCTAAGACAT
TTTAGCTATT CTGATTTTGG ACCTCGTATT CCAGATACCG ATTTCACTGG CCTGAGCTGG
TATTTTCAAG CCACCTATCG TTTTACACCC CGCTGGTCAG CTGTCGCCCG TTTTGATAGT
CTTTGTACCG AGCTTGGTGA TTGTAACGGA GAAGATTTTG CTGCTAAAAC CGGACAACCT
GCACATCGCC GGTTTGCAGA TGATTGGATG GTAGGCCTAC GTTGGGATGT GACCTCTTCC
ATCATGGTGA GAACAGAGTT CCACCATATT AGGGGTACTG CTTGGATAAC CACTGAAGAT
AATCCAAATA TTAATGATCT CCACGAGGAC TGGAATATGT TCTCCCTTCT GGTTTCATTT
CGTTTCTAG
 
Protein sequence
MIKGARKPWY KKRAFSAMLL LLVNPLQAEW KLPDSFQIHG FASQAFILST DNNFFGGSKD 
NGTFDFRELG INASWRILPR LQVAAQGVAR WAGENDEGSP RLDYGLVDYA FVSNAKNTWG
LRLGRVINPI GFYNDTRDVA FTRPSIFLPQ SIYFDRTRDV ALSADGGQVY GEQRTSIGDF
ILQFNIAKPR VGTREERALL SHDFPGQLKG DTSVFGRLLY EKDGGGIRLG FSSFWANFDF
DSALSTDSIS SGSIEFSPLI FSGQYNGERL SLTSEVALRH FSYSDFGPRI PDTDFTGLSW
YFQATYRFTP RWSAVARFDS LCTELGDCNG EDFAAKTGQP AHRRFADDWM VGLRWDVTSS
IMVRTEFHHI RGTAWITTED NPNINDLHED WNMFSLLVSF RF