Gene Noc_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1969 
Symbol 
ID3705428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2257132 
End bp2259414 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content54% 
IMG OID637738445 
Producthypothetical protein 
Protein accessionYP_343961 
Protein GI77165436 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCAG TTTCGTCACG TTTTCGCAGT TGGTTTGGGC GAGAATCTGC CGTTCCTTAT 
TGCCCAACCT TCAGCGCTTC TAGGGTCATT GAGAGGAAGG TAGGGATAGA CGCTGAGCCC
GCGTTTTTTG CAGATCCAGT TTCACTCAGG GCGGCTGCGG AGCAAGTGGC GGCTGAGCGG
CCCCAGTGGC GGGGACGGCT TCTTGAACGT GCCTATTCGG ATTGCAGCAC TGGTCTTATA
ACGTATGGCC GACGCGGGGA TCTTCTTGGA AAAGCATTTC CATGGAACTC GCTTACCAAT
GGGCCAGGGG AGGATGCGCT TTATGCCGCG CGGCCCCATC GTTTTGCCTT TGCGCCACGC
TTGGCTTTAG CCATTCTTTA TGGTGCGCCG CTAGAATTTA TACTGGCTGC AACCCTTGAG
GGCTGGATGC AGGTAGCTAG CGCTGGATGC TCAAAACTCC CCTATCTCAG TAACCTTGTC
GTTATTCAGC GCTTCATTGC TCTTTCCTGG GCGTTGGCTT TCCTGATTGC GAAGCGAGGC
GATGCGAGCT ACTCGGGTCG CTGCTCTGCT GTGGAGCATG CCGTGCTAAA GATTTTGTAT
GCGGATGCGC GGTTTTTAGC GCCCCGTCTC GGAAATTCTT ACCCCAATAA CCATCTTCTT
TTGGATAACT TTGCGGGCTG GTATATAGGG CTTTTGTTTC CAGAATTTAT CGAAGAGCGG
GGATGGCTAG AACGCTATGA GTCCTTATGG TTGCGTGAAC TGGCCCGGCA GATTTATGAG
GACGGGACGG GTTTTGAGCA TTCGACTCAC TATCAAGAAT ATACCTGTGA AATGACCGCC
GCCTATGTAA TCTTGAGTCG GCGGAATAAC CGCCCCGTAC CAGATTGGAT ACTGGAGCGA
TTGAAACGGA TGTTGGCATT CCAGATAGAT TTAAGCGGGC CAAAAACAAC GCCGTTAGCG
ATTGGCAATG GAACGGAGGA CCCTCTTTTT CCCTTAGATA TTGGAGAGGG GTGGGGATGT
GCGGCTATTC GAGAGCTATA TCGAGCGCTT TTTGACTCTA GCCTTTCTCC CTCGCCTTAT
GAAGATTCAG CGGCTGAGCG TGCGTTTTGG CTGCTAGGCG GCGAGCTTGC TTCTTCGCCT
GTAGAACTGA AAAGATCTCC GCGCTTTAAG TCTTACCCCG TAGGCGGGTT TTTTGTTTTC
TCTGAGCCTG ATGAGGGTAC TCGCCTAGTT TTTCGTACCG GACCAGCAAC TGGCAAGTTG
CTTATGGGGG GGCATATGCA TGCAGATCTG TTAGGCATCT ATGTTAGTGT GGCTGGAAAT
CCCATGGTAG TTGATGGAGG AACTTATACT TATCGGACAC AGGCGGAACG ATGGCCGCCC
GATTCTCCCC GGTGGCGCGC TTACTTTGCC GGACCTGAAG CCCATAACGG TTTATCTATT
CGGGGTGTCG ACCCGCTGGG TTCTCTTAAG CGGGATTTCC GAGATCGGGA AGTGGCCGCT
CGGGTAGCCA CGACACGCCT AACTGCGGGT GCTGTAGGAG CTTGGGTTGA GGGAAGAATC
GAAAGCCAGA ACGCCTATGA TAGTTATTGC CGAGGGGTAG TCCACGTACT CGGCGAATAT
TGGCTCGTTT ATGACCAATT GCCGGAAGGT ATCTCCTCCG ATGCGGCGAG TTTCGGGCTA
CAATGGGTGC CGGGAGCAAA GGTCGAAAGG AGATCGCCCC ATGAGATTTG TGTGGTTACG
AATGAAGAGT CGCTGGGCAT CACTTTTAGC GAGGGTTTAA CCCCCCCCCA GGTTCTTGAA
GGCAGTTTGG CTCCTTTGGG AGGGTGGGTA TCACCTCACT ATGGCCACTT AGAGGCGGCC
CCTCAGATAA GAACAAAATG TGAGGGAAAC CGTGAATTAA CGGCCTTTTT GCTGGCAGCA
GGTAGAAAAC CAACAGAGAC CGTTTCGCTC CAGAGAATGC AGGGCCACCA GGGCGGGTTG
GCTTTCCGAA TTACCTGTGG CGAACGGATG GATTACCTTT TTCTAGGACC GCAAGAAGCG
TCACCGGGGA TAGAGGCGTG GGGCATTCGC TTTGAGGGTG TCTGGCTATG GTTACGGACC
ATCGCTGGAG TGCCGACGAT ACTGCGCTGG CTCGAAGGCC AGTCATTGCA GGGTGAGGCA
ATGGGCTTGT CCTTGCGAGC CCCAGCAAGG ACACCGGTCC TTGAGCTCCG GCATTCAGGT
GCGGTTCTGG AGAATCCCCA TGGTACCTTG GATGGGCTCT CTCTCTGCTG GCCGGGCAGT
TGA
 
Protein sequence
MKSVSSRFRS WFGRESAVPY CPTFSASRVI ERKVGIDAEP AFFADPVSLR AAAEQVAAER 
PQWRGRLLER AYSDCSTGLI TYGRRGDLLG KAFPWNSLTN GPGEDALYAA RPHRFAFAPR
LALAILYGAP LEFILAATLE GWMQVASAGC SKLPYLSNLV VIQRFIALSW ALAFLIAKRG
DASYSGRCSA VEHAVLKILY ADARFLAPRL GNSYPNNHLL LDNFAGWYIG LLFPEFIEER
GWLERYESLW LRELARQIYE DGTGFEHSTH YQEYTCEMTA AYVILSRRNN RPVPDWILER
LKRMLAFQID LSGPKTTPLA IGNGTEDPLF PLDIGEGWGC AAIRELYRAL FDSSLSPSPY
EDSAAERAFW LLGGELASSP VELKRSPRFK SYPVGGFFVF SEPDEGTRLV FRTGPATGKL
LMGGHMHADL LGIYVSVAGN PMVVDGGTYT YRTQAERWPP DSPRWRAYFA GPEAHNGLSI
RGVDPLGSLK RDFRDREVAA RVATTRLTAG AVGAWVEGRI ESQNAYDSYC RGVVHVLGEY
WLVYDQLPEG ISSDAASFGL QWVPGAKVER RSPHEICVVT NEESLGITFS EGLTPPQVLE
GSLAPLGGWV SPHYGHLEAA PQIRTKCEGN RELTAFLLAA GRKPTETVSL QRMQGHQGGL
AFRITCGERM DYLFLGPQEA SPGIEAWGIR FEGVWLWLRT IAGVPTILRW LEGQSLQGEA
MGLSLRAPAR TPVLELRHSG AVLENPHGTL DGLSLCWPGS