Gene Noc_1517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1517 
Symbol 
ID3705853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1682778 
End bp1684004 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content55% 
IMG OID637738003 
Productglycosyl transferase, group 1 
Protein accessionYP_343532 
Protein GI77165007 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCATG TGCTTAGTAT TGTTCATTAT CCGGTCTTTG GGGGACCACA TAACCGCAAC 
ATGCGTCTTG CGCCTGTTCT CGTGGAGAAA GGTGTGCGGA CGACGGTGCT GTTGCCGGAT
GAGCCGGGTA ATGCTGCCGA GCGGTTGCGC AATGGGGGCG TCGAAGTGGT GCAGCTACCA
CTGCAGCGAA TTCGAGCGAC TTCGAATCTG GGTACGCATC TTCAATTCTT TATAAACTTA
CCAGCTCAAG TGCGTGCTAT CCGCGAACTT ATCAAGCGTC TGCAGATCGA TGTTGTCCAG
CTTAATGGAT TGGTAAATCC GCACGGCGCC ATTGCCGCAC GCGGCGTCGG TATTCCGGTC
GTGTGGCAGA TTCTCGATAC CTATCCGCCT ATGCTCCTGC GGCGCTTGAT GGGGCCGTTA
TTGCAGCGCT ACGCGGATGT AGTGATGTGC ACTGGCAAGC GGGTCGCAGC GGAACATCCT
GGTGCTACGA AACGGCCGAA TCAGTTAGTG CATTTTTTCC CTCCCGTCGA TCTGAACTGC
TTCACGCCTA ATCCAGCCGT TCGGAAGCAG GCGCGAATGA ATCTGGGGAT AGCCGATACT
TCTTTGGTAA TTGGCACTGT GGGCAATATC AATCTCCAGA AGGGACACGA TAACTTCATC
CGGGCGGCTG CCCATTTAAA GGCTAAAGTT CCTGATGCGC GTTTCGCAAT CCTTGGCGCC
ATGAACGAAA ATCATCGCAG CTATACCGAG GGGCTATGGG CGCTTGCGGA TCAACTTGGG
TTGCAGGTTG GTCGTGATTT GTTTGCCTTC GACCCGACCG GGCGCGTGCA TGAGTTGGTG
CAGGCCATGG ATGTATTTTG GATGACGCCA CGGCCGCGCT CGGAAGGTAT TCCAACCGCC
ATGGAAGAGG CGATGGCACT TGCGTTGCCG GTCGTGAGTT TCGATGTGGG CTCGATCGGA
GAGTTAATTG AGCATGGCCG CACAGGCTAT TTGGTGCATG ACCAAGATCC AAAGGCGGTT
GCTGAGTATA CCTTCGACCG CTTGCTTGAG CGACAGGTTC GTACCGTGAT GGGTAACCGA
GGGCGCCAAT TCGTTGAGGA ACATGCCTCT CTTGAAGCCT GCGCGAATCG GCATATGAAG
GCCTATAGCC TTGCACTACG TCTTGGCCCG GACGAGGCTG CAGCGCCTAT TGTTAAATCC
CCGGAGGAAT CATCATCAGA AACTTGA
 
Protein sequence
MIHVLSIVHY PVFGGPHNRN MRLAPVLVEK GVRTTVLLPD EPGNAAERLR NGGVEVVQLP 
LQRIRATSNL GTHLQFFINL PAQVRAIREL IKRLQIDVVQ LNGLVNPHGA IAARGVGIPV
VWQILDTYPP MLLRRLMGPL LQRYADVVMC TGKRVAAEHP GATKRPNQLV HFFPPVDLNC
FTPNPAVRKQ ARMNLGIADT SLVIGTVGNI NLQKGHDNFI RAAAHLKAKV PDARFAILGA
MNENHRSYTE GLWALADQLG LQVGRDLFAF DPTGRVHELV QAMDVFWMTP RPRSEGIPTA
MEEAMALALP VVSFDVGSIG ELIEHGRTGY LVHDQDPKAV AEYTFDRLLE RQVRTVMGNR
GRQFVEEHAS LEACANRHMK AYSLALRLGP DEAAAPIVKS PEESSSET