Gene Noc_2819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2819 
Symbol 
ID3705568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3194679 
End bp3195872 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content50% 
IMG OID637739295 
Producthypothetical protein 
Protein accessionYP_344796 
Protein GI77166271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACTAA CCGTTCAAGC TAACGAGCTA GCTGGTATCC GCCCTTATTT ACAACCTCGA 
TTGGGGTTCG AAAACTCTTT TTATAATTAT ACGAAACCCG ATGCGGTCTC TGATGTGCGC
CTTGAATCCC CCTCGGAAGA ACCCCATATT GGTGCCACCT TAGGCGCGGA TCTAGGGCGC
TACTGGGGCA TTGAATTTGC CTACGATTAT ATTAAAACCA ACCTTTTGCA AACCTCTGGC
AAAAAGGCAG GGGACTACGC TACAACGACT TGGCTAGGCC AGTTAAGATT CCGCTACCCG
TTACTCCAGG ATCGGCTGGT TCCCTACTTA TTGGCGGGTG GTGGTATCGG TATCGGAGAA
TTTAGCGGCC GTGAGGATTT CTCCTTTACG GGGGGCGGTA GCGACACAGT GCCTCTGGGG
GTGGTTGGTG GCGGCGCTGA ATATTTTATA ACAGATAATA TTGCCCTTGG AGTCGAAGCA
AAATATTATT TCGGCTTTCA TCCTGAAATC TCAATTTCCA CAGAGGAGCG GGAACTTACC
TTGGATGCTG TTGGTGTCAC GGCCAACATG CGTGTTTATC TTGATCAGCT AGCGACGGGT
AAGTATGCTT GGCTTGGAGA GCAGCGACCG GCTCGGGATA AGGATGCCAT GCGAGGTTAT
CTTAGCTTGC GAGGTGGCGT TGCTTTTCTT ACCGATAGAA ATGCTGTTCC AGAGGCCAGT
TTCGATAGCA CATCTGGACC TTGGCCCAGT GGGTCAATAG GCATGAATTT CAATAAGCAT
TGGGGCGTGG AGCTTGCTGG CGATTATGGC CGAACCCAGT TGCGATCGCC CGTCCTTGGC
AAAATTACAG GATATCCCAT ATGGACTATC TCAGCCCTAG GGCGTTTTCG CTATCCTTTG
CTTAATGACA AACTTAGTCC CTATCTATTA GCAGGTCCCG GTCTTGGTTT CGCCGAGATA
GGCGATCCGG ATCAACCGCT TTCCGTCACC GGACTTTCTG GTGGTCAGGA TAATTCTATT
GTGGCCATCT TCGGTGCTGG AGTTGATTAC TTTATCGGCT ATAATGTAGC TTTTAACCTT
GAGGTCAAGC GGGCTGCCTT TTTTAACACC CAGGTCAAGA TCAACGGCCA ATCCGAAACG
TTATCGCCAG AGTTCGTCTC CCTAACAGCG GGTATCCGTG TTTTTTTTCC TTGA
 
Protein sequence
MPLTVQANEL AGIRPYLQPR LGFENSFYNY TKPDAVSDVR LESPSEEPHI GATLGADLGR 
YWGIEFAYDY IKTNLLQTSG KKAGDYATTT WLGQLRFRYP LLQDRLVPYL LAGGGIGIGE
FSGREDFSFT GGGSDTVPLG VVGGGAEYFI TDNIALGVEA KYYFGFHPEI SISTEERELT
LDAVGVTANM RVYLDQLATG KYAWLGEQRP ARDKDAMRGY LSLRGGVAFL TDRNAVPEAS
FDSTSGPWPS GSIGMNFNKH WGVELAGDYG RTQLRSPVLG KITGYPIWTI SALGRFRYPL
LNDKLSPYLL AGPGLGFAEI GDPDQPLSVT GLSGGQDNSI VAIFGAGVDY FIGYNVAFNL
EVKRAAFFNT QVKINGQSET LSPEFVSLTA GIRVFFP