Gene Noc_1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1874 
Symbol 
ID3705448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2129319 
End bp2131118 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content50% 
IMG OID637738353 
ProductPhage tail sheath protein 
Protein accessionYP_343870 
Protein GI77165345 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCAACT GGACACAATT TATCAACACC TTTGGTGTTC AAGATCAGCT TGGTCCATAT 
ATCACAGCCC CGCAAATTTA CGTAACTCAT GCGGTGCGGG GATTTTTCGA TAATGGCGGG
GCAGCCTGTT ATTTTGTTCG CGTTGGTACA GCAATACGTG CTTCGCTTAC CCTCAACGAT
CGCGCCACGC CCACCGATAG ACCCGCCCTA GTCGTGACAG CCAAGGAAGA AGGAGTTACT
GGTAACGCAA TCACCGTTGA GGTGCAAGAT GCAAGTATTG TTACGTCAGT TGCTGCTGTA
CGCGCACAGG CTACATTATC TACCGCTTCT AATGGTGAGG CAACGGTCAC TTCCGCCTCT
GACGCGGAAA ATTTCCGACC GGGAGATATT GTTTTCCTGG AGCAGGGTAC AACCAGCGAG
CGGGCAACGA TTGCAAGCAT TAGCGATGTG ACGATTAAGT TTGCAACGAA CTTGGCTAAC
AGCTATACCG GGGGCACTAT CCGCATTGCC GATCTCGCCC CGGCACAAAC TAAGATCCGC
GTTGCTGATA CAACGAGCAT TGAACCTGGT ACATACATCA GCATTACTCA AGACGGGACA
ACGGAATCCC GAGTTGTGCA ATCGGTAGAG CCGATCAATA AATTCCTGAC ACTCACGCAA
GGACTCACTA ACACCTATAC CATGGCGACA GGGGATACGG AGGTGAATCT CCAAACGCTG
GAGTTTACGC TCATCATTAA CAAACCTGGC TTTGGAGCGG AAAATTTCCC CACCCTTTCC
ATGGATCCTC GTCACAGTCG CTATTTCTCT AGAATCGTAA ATTCTCTAAA TGCCGATGTT
ACGTTGGCTG ATCCACCTAC TCCAAGTGCG CCGCCAGACA ATCTTCCTAC GGTACTAGCT
GCAACTCCAC TAGCCGGCGG TCAAGACGAT GATGTCACGC AACTCCAAAC ATCCCATTAC
CGTAATGGCA TTGACGCATT AGAAAAAGTC GATGAGGTCA GTATTCTTTG TGTCCCTGAC
CGTACGGATC AAGACGTTCA AAAATATATG ATCGAGCACT GTGAAAAGAT GCAAGATCGT
TTTGCGGTTC TTGATCCACA AAGAAATGCT ACGCTTACTG ATATCAAAAC CCAACGAGGA
TTAGTCAGTT CTGATCGCGG CTATGCAGCG CTTTATTATC CGTGGATTAT TATTTCCAAT
CCGGTTGCTG AAGGTCGGCT CCCGGTACCT CCATCAGGTC ATATCGCTGG AATCTATGCA
CGTGTAGATG ATTCGCGAGG CGTTCACAAG GCGCCCGCAA ATGAGGCGGT ACGGGGGGTG
CTGGATTTAG AAAGGATATT GACTGATGAT GAACAGGGGC CACTGAACGA AGAAGGCATA
AATGCGATCC GCTCTTTCTT AGGCAGTGGT ATACGGGTAT GGGGTGCCCG AACTATCGCC
CCCAAAGATC GCACCCAGTG GCGCTACGTC AATGTCCGAA GACTCCTGCT GTTTATTGAA
GAATCGCTCC AGGAAGGGAC GCAATTTGCC GTATTTGAGC CAAATAACCG CTCACTATGG
GGAAAGCTAA GGCGGCAAGT TACAGAGTTT CTCAATAGAG TATGGCGCGA TGGAGCCCTT
TTTGGGGCCA CGGCTGAAGA AGCATTCCGA GTACGAATAG ACGAGGAGTT AAACCCACCG
GAAGTCCGCG CCTTGGGGCA ACTTATTATT GAAGTCATAC TTGTGCCGAC CACACCAGCG
GAGTTTGTCG TATTTCGCAT CATTTCGGAC ACAACCGGAA AGTCCCTAAT TGAGGAATAA
 
Protein sequence
MTNWTQFINT FGVQDQLGPY ITAPQIYVTH AVRGFFDNGG AACYFVRVGT AIRASLTLND 
RATPTDRPAL VVTAKEEGVT GNAITVEVQD ASIVTSVAAV RAQATLSTAS NGEATVTSAS
DAENFRPGDI VFLEQGTTSE RATIASISDV TIKFATNLAN SYTGGTIRIA DLAPAQTKIR
VADTTSIEPG TYISITQDGT TESRVVQSVE PINKFLTLTQ GLTNTYTMAT GDTEVNLQTL
EFTLIINKPG FGAENFPTLS MDPRHSRYFS RIVNSLNADV TLADPPTPSA PPDNLPTVLA
ATPLAGGQDD DVTQLQTSHY RNGIDALEKV DEVSILCVPD RTDQDVQKYM IEHCEKMQDR
FAVLDPQRNA TLTDIKTQRG LVSSDRGYAA LYYPWIIISN PVAEGRLPVP PSGHIAGIYA
RVDDSRGVHK APANEAVRGV LDLERILTDD EQGPLNEEGI NAIRSFLGSG IRVWGARTIA
PKDRTQWRYV NVRRLLLFIE ESLQEGTQFA VFEPNNRSLW GKLRRQVTEF LNRVWRDGAL
FGATAEEAFR VRIDEELNPP EVRALGQLII EVILVPTTPA EFVVFRIISD TTGKSLIEE