Gene Noc_2889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2889 
Symbol 
ID3707443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3267209 
End bp3268378 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content56% 
IMG OID637739365 
Productglycosyl transferase, group 1 
Protein accessionYP_344865 
Protein GI77166340 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.228881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACGGG CTCTCCACAT TGGCAAATTT TTCCCGCCTT TCGCTGGTGG GATGGAGTAT 
TTTCTGCGGG ATTTGTTAGG AGCTTTATCC CGGCAAGGTA TTGAGGTGGC GGCGTTGGTT
CACGATCATC TTATGCCCCG GCAACGGCGC TGTTCCCATC ATCCCGATCC CGCCGAATGG
CCCTTTCCCG TTTATCGCGC CCCCTGCCAT GGCCGTTTTC TCTATGCCCC GGTCAGCCCT
CAGTTTCCTT TCTGGCTGCA AAAAACTATC CGGGATTTTA AACCAGACCT TTTGCATCTG
CACCTTCCCA ATACCTCCGC TTTTTGGGCC ATGGTGGTGC CGGTAGCCCG GCGGCTGCCC
TGGATCATCC ACTGGCATGC CGATGTTGTT GCATCCCGTC ACGACAAGTT CCTTGCTCCC
GCTTATCTTT TTTACCGCCC TTTTGAACAA AGTCTTCTGG GGGGCGCTTC GGCTATCATC
GCCACTTCGC CTCCCTACCT TAATAGCAGC CTGGCGCTAA GGCTCTGGCG AGAGAAGTGC
CACACCATTC CCCTTGGCCT CGATCCGTCC CGCTTGCCGG GACCTAGTGA AACCGAGCAA
GCAGACGCCC ATCGGCTCTG GGGAGATGGA ACGTCCTTGC GAGTACTTAC TATTGGCCGT
CTGACCTACT ACAAAGGGCA TGAGGTACTC TTACATGCCA TTAAAGCTTT GCCAGAAGCC
CGTTTGGTGG TGGTTGGCGC CGGCGCTGGC GAAGGGAAAC TGCGGGCGCT GATTGCAAAG
CTAGCCTTGG AAGGGCGGGT CAGCTTGCAG GGTGGCTGCA CGGAGGCGCA GCGCAATGCG
CTATTGGCAA CCTGCGATGT CTTTTGCTTG CCTTCCATCG AGCGGACCGA AGCCTTTGGA
GTCGTGCTTT TGGAAGCCAT GAAGTTTGCA AAGCCGGTAG TCGCCAGCAG GATAGAGGGC
TCTGGCGTGG GCTGGGTTGT CGCCGATGGA GAAACAGGAA TATTGTGCCC CCCTCAAGAC
CCGGCTAGCT TAACCCAAGC CCTCGGAGAT TTATTGCACA CTCCCGAAAA ACGGGAATCA
CTTGGTAAGG CGGGGGAGCA GCGTTTTCGT CAGTATTTTC AAATCGATCG CATTGCGGAA
AGAACAGCCG TGCTTTATCC TCGCGTGTGA
 
Protein sequence
MLRALHIGKF FPPFAGGMEY FLRDLLGALS RQGIEVAALV HDHLMPRQRR CSHHPDPAEW 
PFPVYRAPCH GRFLYAPVSP QFPFWLQKTI RDFKPDLLHL HLPNTSAFWA MVVPVARRLP
WIIHWHADVV ASRHDKFLAP AYLFYRPFEQ SLLGGASAII ATSPPYLNSS LALRLWREKC
HTIPLGLDPS RLPGPSETEQ ADAHRLWGDG TSLRVLTIGR LTYYKGHEVL LHAIKALPEA
RLVVVGAGAG EGKLRALIAK LALEGRVSLQ GGCTEAQRNA LLATCDVFCL PSIERTEAFG
VVLLEAMKFA KPVVASRIEG SGVGWVVADG ETGILCPPQD PASLTQALGD LLHTPEKRES
LGKAGEQRFR QYFQIDRIAE RTAVLYPRV