Gene Noc_1861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1861 
Symbol 
ID3705125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2114576 
End bp2116819 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content49% 
IMG OID637738340 
ProductPhage tail protein 
Protein accessionYP_343857 
Protein GI77165332 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02242] phage tail protein domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTGGAT GGGCGTTGAG GGTTAGCTAT GAATCAAGAT CACAATGCAG CTTTTCTGTA 
TTAAATCGAG ATAATGAATG GCCCGACTTT CAATGGGAAG GACTGGAATT GCTCCAGGAT
GGTACACTGC GGCTCTATTC TATTCCACTG CTAAAGGAAA AACTGCCAGA AGAGATAGAA
GCTATTGCTC CGTCCAGAGC GCCCGCTGGT ATTACAGTCG ATCTTGATGG AACTGTCTAC
TTTAGCGATC CAGCTGCACA TAGACTGCTG AAAATTGACG GTTGTAATAG CGAGCTTAAG
ACTGTTCCCT GTATTGGTAG TAAAAACGGT AAACCGACTC AACTCAACGG CCCGCGTGGG
CTATTGATTC CTCCCCATCG GCGCTCGCTA TTGGTGGTGG ACAGCGGCAA TCACCGCATT
CAGATATTCG ATATTGCTTC GCTGCAACTG GTTGCTATAT GGGGCCAGCA GGATCCCTTC
AGCTTACCTC AACCAAGTGA CGCGCCCGGA TATTTTAACA CGCCGTGGAC TTTGGCAGCG
GATACCAAAG GCAATGTTTA CGTGGTGGAT TACGGCAATC AGCGCGTGCA GAAATTCAAT
TTCTTAGGAG AAGTAATTCC TGATTTTTGG GAGACTCTGC AAGCCGCTAA CTTACAACAA
CCCAGTGACA TCGCAGCGGG CGCAATAGGA GAAGAGCTCT ATTTTTATAT TGTTGCACAA
GATGCCAAAG GTGCCTGGAA AATTTTTGTA GTTGATAATA ATGGCCATCC CGTACTCGAT
ACTTCAGGTC AATCCATTGC CTTTGGTGAA GAATACCTTG AGCAGCCCAT GGGCATTGCT
GTCGACAAGG ATACAATCTA TGTAGGCGAT AATAATCGGA GGCGCGTACT CACCTTCAAA
AAAAAATCTG ATACTTTCGA GTTTGCCGGT GAGGCACTGG GCTACGAGGG GCCAGTAGCT
GCACTTGCAT TGGATGGAAA AGAGGGTCTG CTTATCCATA GTGGTATTGC TCTTGCGCCG
CTGCGCCTGA CCCTTGACAG TGGTTATCGG AATAAAGGGA TGTTATGGAG CCGTGTTATC
AAATCGGCCG AGTCCAAAGT GCAATGGCAT CGGTTACATA CCATAGTTGA TTCACTTGAA
TCCGGCGCGC ATATTCAGTT CTTTGTGCAT ACCTCGGATC AGGAGGATGA TCCGCCGCTT
GTTGATCCAA GCAGCCCCAA CCCATTTAGC GATGCGAAAT GGCGTGCCTT GCCGCTCAAT
GTGTCCGACT TTTTCATAGG AGATACTCCT GCCTGTTGCC TGTGGATAGG TGCGGTGTTT
TCTGGAGATG GATCGGCAAG CCCAATCATT TCCCAGATGC GAGTGGAATT TGATCAAGAA
ACTTACCTGA AACACCTGCC GGCAATTTAC ATCAATAGCG TCCACTCTCG AGAATTTTTA
GTGCGTTTTC TCCCCCTCTT CGAAAGCTTC TTTAATGAAG TGGAAGGAAC CATAGCCCAT
CTCCCCGCTC TGTTTGATCC AAATGCGATT CCCAAAGAAA TGCTGTCCTG GCTTGCCGGT
TGGTTGGCAA TGGAGCTGGA TGAAGATTGG GATGGAGCCA TGCAACGTCA GGTGATTGTT
GAAGCTTTCG AAAATTATGC TTGGCAGGGG ACTGCTGAGG GCTTGCGTCG ATCATTGCGC
CTTTTTGCCG GTGTTCATGC CATTATCGAG GAACCGAATC TCAACTCCGC CTGGTGGGTA
TTGCCGATAA GAGAAGAGAT GGAAGATAAA ATATCTGATC CAAGCTATCT ATCCTGGGGA
AATGAAGAAA ACTCGATCCT CGGGTTTACT ACCAGGCTTG CGTCAGCTGA GCCACAGGGC
GCAGTGGTGG GTACTACCAC CATCTTAGAT CAGTCTCATT TGATTACCAG TAAAGAGTTC
GGGGCACCGC TTTTTGAAGA TGTGGCCTAT CAGTTCAGCG TGCTTCTCTA TCGAGGCGAG
CTGCGGTGCG CGGATACGCT ATTGCGTGTA CGCGCTGTAA TCGAGCGGGA GAAACCTGCG
CATACGAGTT ATCAAGTTTG CATTATCGAA CCTCGCATGC GCGTGGGTTA TCAGGCACGA
GTGGGTGTTG ACACCGTCAT TGCTGGTCCA CCATCGGTTT CCAGACTTGG AGAAAATAGT
GCTGGAGGGG TAACGCTCGG TGGAGAACCT GCCGGCCGAA TTGGGGAACG AAACCAAGTT
GGGTTGGCTA CGCGTGTAGG TTAG
 
Protein sequence
MFGWALRVSY ESRSQCSFSV LNRDNEWPDF QWEGLELLQD GTLRLYSIPL LKEKLPEEIE 
AIAPSRAPAG ITVDLDGTVY FSDPAAHRLL KIDGCNSELK TVPCIGSKNG KPTQLNGPRG
LLIPPHRRSL LVVDSGNHRI QIFDIASLQL VAIWGQQDPF SLPQPSDAPG YFNTPWTLAA
DTKGNVYVVD YGNQRVQKFN FLGEVIPDFW ETLQAANLQQ PSDIAAGAIG EELYFYIVAQ
DAKGAWKIFV VDNNGHPVLD TSGQSIAFGE EYLEQPMGIA VDKDTIYVGD NNRRRVLTFK
KKSDTFEFAG EALGYEGPVA ALALDGKEGL LIHSGIALAP LRLTLDSGYR NKGMLWSRVI
KSAESKVQWH RLHTIVDSLE SGAHIQFFVH TSDQEDDPPL VDPSSPNPFS DAKWRALPLN
VSDFFIGDTP ACCLWIGAVF SGDGSASPII SQMRVEFDQE TYLKHLPAIY INSVHSREFL
VRFLPLFESF FNEVEGTIAH LPALFDPNAI PKEMLSWLAG WLAMELDEDW DGAMQRQVIV
EAFENYAWQG TAEGLRRSLR LFAGVHAIIE EPNLNSAWWV LPIREEMEDK ISDPSYLSWG
NEENSILGFT TRLASAEPQG AVVGTTTILD QSHLITSKEF GAPLFEDVAY QFSVLLYRGE
LRCADTLLRV RAVIEREKPA HTSYQVCIIE PRMRVGYQAR VGVDTVIAGP PSVSRLGENS
AGGVTLGGEP AGRIGERNQV GLATRVG