Gene Noc_1869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1869 
Symbol 
ID3705443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2125409 
End bp2126629 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content51% 
IMG OID637738348 
Producthypothetical protein 
Protein accessionYP_343865 
Protein GI77165340 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGCT ACCCGCGCTA TGCGCCCACT TTCGAGATCA AGATCAATGG TGAGAAACTG 
CCGATAGCCA TGCGGGCCTC CGTAGTATCG GTAAGTTATC AGGATGGCAT TGAAGGTGCC
GACCGGGTGG AAATCACTTT GGCGAATGAT AATTTGCGCT GGCTAGATCA TTCGTTGCTG
CAGGTGGACA ACGGTTTCAC GCTTTCCATT GGCTATGCGC CGGATCCCTT AGAAGAAGTT
TTTGTGGGAG AAATTACTGG GGTGAATGCT TCGTTCCCAA ATGGCGGAAT GCCCACGCTC
ACTGTGGTGG CTCATGATTT TTTACAGCGT TTGACAATGG GCACCAAAGA CCGAGCTTTT
GCTTTGAATG TACCCTGTAT CGGAAAATTC TCGCTTCCTG ATCCTCATGT AGTGACCTTG
GTGAGTGCAG TGGATTTATT GATTCCTGTG GTCGATCCGG CTGGTGCTGC GCTCTCATTT
CTGACGCTGC TGGTGGCTTA CGCCCTTGAT CCCTTGGAAG CCAAGCAGGG CATTCGCCTG
CAGCAGAGCC AAAGTGATTT TGATTTTTTA TCTATGGTCG CTAAGGAAAA CGGCTGGGAG
ATGTATATCG ACCATGCGAT GGAGCCAAAG GGCTATGTGC TGCGGTTTCA ATTTTTAATT
CAGGATTATG CGCCAAGTGC CACGCTGAAA TGGGGTGAAT CGCTGAGCGA GTTCACGCCG
CGTCTATCCA CGGTCGGCCA GGTGGCTGGG ATTTCCACGC GTATTTGGGT TCCTAGCATC
AAGATGGAGT TCGTGCTCGT TTTATCTTGG GACTTTGATC GTGCCGCATT TGATCTCATG
GTGTTTCCAG GACTTGGCAG CCTGGAAGAG TTACTTGGCT CTACTAAGGC GCAGGGTGTC
TTAAAAATCG ATGCAATTGG GCCGGCCACA GCGCCAAAGA AGATCTTGAG CGAATTATTA
CCCCGCCTTA ACAACCGGTT AACCTGTAGC GGCAGCACTA TCGGAGATCC ACGTATCAAA
GCTAGTAGAG TGGTTAGCTT CGAAGGTTTG GGTGAGCAGT TCAGCGGTCT TTATCGCGTG
ACTTCCGCAA CTCATACGAT GGATGGCAGT GGTTACCGGA CTCAGTTTGA AGCTAGAAAA
GAAGTATGGT TTGGATCGAT ACCCGTGCCG AAAGGGGTAG ATGGATTAGT GCGTGTGCAA
GGCCAGAGAG TCGGCCAATA G
 
Protein sequence
MARYPRYAPT FEIKINGEKL PIAMRASVVS VSYQDGIEGA DRVEITLAND NLRWLDHSLL 
QVDNGFTLSI GYAPDPLEEV FVGEITGVNA SFPNGGMPTL TVVAHDFLQR LTMGTKDRAF
ALNVPCIGKF SLPDPHVVTL VSAVDLLIPV VDPAGAALSF LTLLVAYALD PLEAKQGIRL
QQSQSDFDFL SMVAKENGWE MYIDHAMEPK GYVLRFQFLI QDYAPSATLK WGESLSEFTP
RLSTVGQVAG ISTRIWVPSI KMEFVLVLSW DFDRAAFDLM VFPGLGSLEE LLGSTKAQGV
LKIDAIGPAT APKKILSELL PRLNNRLTCS GSTIGDPRIK ASRVVSFEGL GEQFSGLYRV
TSATHTMDGS GYRTQFEARK EVWFGSIPVP KGVDGLVRVQ GQRVGQ