Gene Noc_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0653 
Symbol 
ID3706885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp701746 
End bp703311 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content58% 
IMG OID637737161 
Producthypothetical protein 
Protein accessionYP_342702 
Protein GI77164177 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCG ACAGCTATCT GGAACTCTTT ACTTCCCTTT TCGGCTGGAC CTTCTACGGC 
ATTCTCTGGG ATGTGTTGGT CAGTACCGGC ATCGTCTATT TGCCGTTCCT GGGGATTCTG
ATCGATAACT GGCGCGAACC AGCGCAGGGC GGTGAGGTCG GTCATGCCAG CGGCCTCTCG
CTGCGGCGCA TGGAGATCGA ACTGTTTATC GCGTTACTGG TGGTGGTCCT GGCCGGGCAA
CCCGCTGCCC TCACGCCACT CAATGCCGGT ACCTTGTCCT ATTCCCCGCC ACCAACGCTA
TTGGATCCAA CGCCGGCTGC AGCGACGGTG GCTGCGCCCC AGAGTACCTA TGGCACAACG
GGCTTCACCG GTACCGCCGC AACCGTCAAT GTTCCCGTCT GGTGGTACGG TGTCATCGCA
CTGAGTTCGG GTTTGAACCA CGCTATCGTC GAAGGCCTGC CGACCGTGGC GGATATGCGG
ACGTTCGAGC AGCAGGCGCA TTTAGCCACC ATTGCTGATC CACGCCTCAG GCAGGAAGTG
AGCGAGTTCT TCAGCCAGTG CTACATTCCG GCGCGTTCCA AATACCAGGC TGAACGACCA
AACATAGCGG CCATCAACGG CATTCTCGCC ACGTACGGTG TCGATGACCC GGATTGGATG
GGGTCCCACG TCTATCGGGA CACCTCGGGC TATTACGACA CCCTGCGCCC TGCCAGCCCA
ATCACTGGCT GGGCATACAT TGCGGCCAGA GATACCGAGT ATGACGCCAC ATCACCACCG
GCCTGGGGAA AACCCTACTG CAAACAGTGG TGGGAAGATG GGGCAATCGG CTTGCGTGAG
AAGCTGATCA ATGAGGCAGA TGCCACCTCT GCCGGCTTCT CCGGACTGGT AGTCGCCATC
GCGCCCGCTC TGGCCAGCGA ACAGCAAAAC GACGCCGTCG CCAAAACCGT ACTGACCAAC
GCGCCGCCCT CCTGGTCGAA CAATGAGTTA ATCGCCAACA ACGCATCCGG AGCAGGGTTG
GTCAACACCG CCGGATCTAT CATCAAAGGT GGTCTCGCCA CCGGTGGTGT GATCACCGCA
TCAGCCCTGT TTTCCGTAAC CATGACGGCC GTCTTACAAT CACTGCCCAT GGTGCAGGCC
ATTATGCTGC TGGGTATCTA TGCACTGCTC CCATTGGTGG TGGTTTTGTC CCGCTACTCC
ATAGCCATGA TGGTAGTGGG CGGTATGGCG ATTTTCACCA TCAAGTTCTG GACAGTGCTC
TGGTACCTGG CCATGTGGGT GGATCAGAAC CTGATTCTGT CCATGTATCC CGACGTTAAC
GTTTTCCTGC AGATCTTCGC CAACCCTGGC GAACACGACG CCAAGCGCAT GTTGCTGAAT
ATGATTACCA CCAGCCTCTA TCTGGGGCTG CCGTTGCTGT GGAGTGGGAT GATGGCGTGG
GCGGGGGTGA AAGTCGGTAG GTCGATTGAT TCGGCAGCGA ACCCGATCAA GGCGCCAGCG
CAGGATGCTG GCAATCAGGG AGGAAGTATC GGCAAGATGG TGCTGACCAA AGGTAAGAAA
CGTTAG
 
Protein sequence
MSVDSYLELF TSLFGWTFYG ILWDVLVSTG IVYLPFLGIL IDNWREPAQG GEVGHASGLS 
LRRMEIELFI ALLVVVLAGQ PAALTPLNAG TLSYSPPPTL LDPTPAAATV AAPQSTYGTT
GFTGTAATVN VPVWWYGVIA LSSGLNHAIV EGLPTVADMR TFEQQAHLAT IADPRLRQEV
SEFFSQCYIP ARSKYQAERP NIAAINGILA TYGVDDPDWM GSHVYRDTSG YYDTLRPASP
ITGWAYIAAR DTEYDATSPP AWGKPYCKQW WEDGAIGLRE KLINEADATS AGFSGLVVAI
APALASEQQN DAVAKTVLTN APPSWSNNEL IANNASGAGL VNTAGSIIKG GLATGGVITA
SALFSVTMTA VLQSLPMVQA IMLLGIYALL PLVVVLSRYS IAMMVVGGMA IFTIKFWTVL
WYLAMWVDQN LILSMYPDVN VFLQIFANPG EHDAKRMLLN MITTSLYLGL PLLWSGMMAW
AGVKVGRSID SAANPIKAPA QDAGNQGGSI GKMVLTKGKK R