Gene Noc_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1988 
Symbol 
ID3704872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2285796 
End bp2287067 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content52% 
IMG OID637738464 
Productputative glycosyl transferase 
Protein accessionYP_343980 
Protein GI77165455 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0156552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGACT CCAGCACCGG GCCAATTAGC TCGATAAGCA AGCGGCGGCG AATACTTTTT 
TTTGCTGAAG CCGTGACATT GGCGCATGTG GCGCGGCCAG TGGCCCTGGC AAAGAATCTG
AACCCGGCTC TTTATGAGGT TCATTTTGCG TGCGACGCAA GGTATCACAA ACTACTGGGC
AAGCTCCCAT TTATCTGGCA CCCCATTCAT TCTCTTGCTA GCGAAAAATT TCTTGAGGCG
CTCTCCAAGG GCAGTCCCGT TTATAGGGCT GATACACTAC GCGCTTATGT CAACGAAGAC
GCGAAGGTAA TCAAGGAAGT AAGCCCGGAT GTGATCGTAG GGGATTTTCG TCTGTCCCTA
GCCGTTAGCG CGCCGCTCGC TCAAATTCCT TATATGACAA TTGCTAATGC TTATTGGAGT
CCGTATGCTA AACGGCGTTT TCCCGTACCG GATATTCCTT TAGCAAAGAT AATCGGAATC
AAGGCGGCAC AATACCTGTT TAACGCCATC CAGCCCCTGG CCTTTGCCTA TCATGCTCTG
CCCTTAAATA AGATCAGGCA CGAATATGGC TTACCCAAGA TAAGCTTGGA TTTACGCCAT
ATCTACACCT ATGCCGATCA CACACTTTAT GCCGATATCC CAGCGCTGGC GCCGACCATC
GATCTCCCCT CCGGGCATCA TTATCTCGGT CCGGTTCTTT GGTCGCCCGC AGTCCCTCTT
CCTGCTTGGT GGGAGAAAAT ACCTGCGGAC AAACCCGTTC TCTATGTGAG CTTGGGCAGT
TCTGGGCAAA GTCAATTATT ACCGGAGATG TTAAAGGCGC TAGCCGATCT GCCTATCACT
CTCCTGGTGG CAACGGCGGG GCGAATCAAA CTCCCTAGCC CGCCAAAAAA TGCCTTTATA
GCGGATTATC TTCCCGGCGA TAAAGCAACA GCCCGCGCCA GCCTCGTGAT CTGTAATGGC
GGCAGCCTTA TGACCCAACA AGCGCTCATA AAGCGTGTGC CAGTGTTGGG AATTGTCAAT
AACCTTGACC AGCACCTCAA TATGGAGGCG GTGCAAAGCG CAGGCGCGGG CGAACTCCGG
CGGGCGGCAA ACGTGACCAC GGCGCATATT CTCGCCACCA CACGCCAAAT GCTAGACCAG
CCTCGTTATG CTCAGGCCGC TACCCGTCTA GCGGACTTAT TGTCCAACTA CAACGCCTCC
GATAAGTTTA ACTCCATTTT AGGCCGGATG TTCTCGAGGA AATTATGCGC AAAATCGGTT
TCAGCCAGAT AA
 
Protein sequence
MIDSSTGPIS SISKRRRILF FAEAVTLAHV ARPVALAKNL NPALYEVHFA CDARYHKLLG 
KLPFIWHPIH SLASEKFLEA LSKGSPVYRA DTLRAYVNED AKVIKEVSPD VIVGDFRLSL
AVSAPLAQIP YMTIANAYWS PYAKRRFPVP DIPLAKIIGI KAAQYLFNAI QPLAFAYHAL
PLNKIRHEYG LPKISLDLRH IYTYADHTLY ADIPALAPTI DLPSGHHYLG PVLWSPAVPL
PAWWEKIPAD KPVLYVSLGS SGQSQLLPEM LKALADLPIT LLVATAGRIK LPSPPKNAFI
ADYLPGDKAT ARASLVICNG GSLMTQQALI KRVPVLGIVN NLDQHLNMEA VQSAGAGELR
RAANVTTAHI LATTRQMLDQ PRYAQAATRL ADLLSNYNAS DKFNSILGRM FSRKLCAKSV
SAR