Gene Noc_2281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2281 
Symbol 
ID3705073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2631275 
End bp2633167 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content53% 
IMG OID637738760 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_344269 
Protein GI77165744 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCAAAGC GTGAAATCGG CCCTTTTTTG CAGGCCCAAA AGCGACGGGC CGTGGTCATT 
GTGCATGATG TCACTGCCGT CTGTTTGGCC TGGGTGTTAA GTTACCTTAT TCGTTACAAT
TTATCGCCTT ATTGGGTGGA CTGGGAGACT TGCCTCAGCA CCTTGCCGGT GATCGTGGTG
GTCCAAAGCA TCGTCTTGAT TTGGATAGGG CTTTATCGGG GCTTGTGGCG CTTTGCGAGC
ATTCCTGATT TATTAAACAT CATTCGCGCC GTAGGTTACG GAGCCCTTTT AGTTGCGCTA
ACGCTTTTTC TTATCAACCG CCTCAACGGC GTACCCCGCA GTGCATTGCT GCTTTATCCT
ATCTTTTCTT TACTGCTGCT GGCAGGTCCC CGGCTTGCCT ACCGGGTGTG GAAAGATCGC
CGTCTTACCC TGGCTGCAAG TCCTCATCGA AAACGAGTCT TAATTTTGGG AGCTGGCCGT
GCTGGTGATT TGCTGGTCCG TGATATGCTT GTGGAGGGGC ACTACCTGCC GGTGGCTTTT
CTAGATGATC GGCCTGATCT GATAGGCCGG CAAGTGCGAG GAATTCCAGT AGTGGGGTTG
TTAGCGCAAT TGCCTGCCAA GGTGGGTGAG ATGGCTGCAG ATGTGGTGGT GATTGCGATG
CCTTCTGCTA ATAACCAGCA GATGCGCCGT GCTGTTTCGC TCTGTGAGCA GGCCCAAGTA
CCTTATCGGA CTTTACCCCG TCTGCAAGAT TTGGTTGCGG GTCGGGCGAG CATCAAGGAG
CTTCGAGAAG TTGCCATTGA CGATCTATTG GGGCGGGACG AAATATTTTT GGATTGGGAT
TATATCCGCC GAGACTTGGA AGCTAAATCC CTGTTGGTCA GTGGCGGAGG AGGATCGATC
GGATCCGAAC TTTGCCGGCA ATTAGGGCGC CTTGGGCCCG CTTCCCTTAT TGTCATCGAT
AATAGCGAAT ATAAGCTCTA TCGCCTCGAA CAGGAGTTGC GGCAGAAACT CCCGGCTCTC
AATCTCCACG CTTATTTATG TGATGTGCGT GATGAAGCTG GTGTGAAGGA GATCCTAGCG
CGTCATCTGC CAGAGGTGGT GTTCCATGCA GCAGCTTATA AACATGTACC CCTCCTAGAG
AGGCAAGCAC GGCAGGCTGT GCGAAACAAT ATTTTTGGAA CCCGCTGTTT AGCCGAGGCG
GCAAGGCAAG CAGGCTGTGG AACTTTCGTC CTGATTTCAA CCGACAAAGC GGTTAATCCA
ACGAGTGTGA TGGGCACCAG CAAGCGAGTT GCGGAGCTTC TCTGCCAAGA TCTTAACCGA
CACAGCCAAA CTCGGTTTAT CGTCGTGCGT TTTGGCAATG TGCTTGGATC AGCAGGGAGC
GTGGTGCCGT TATTCCAGGC CCAAATTGCC GCCGGTGGGC CGATCACGGT GACCCATCCC
GAAATGACCC GCTATTTCAT GACTATTCCC GAGGCATGCC ACTTGATTAC CCAAGCCGCT
ATGATAGGCA AAGGAGGGGA GATATTCGTC CTGGATATGG GCGAGTCGGT TAAAATTATC
GAACTTGCTG AGCAAATGAT CCGCCTTTCT GGCCAGGAAC CGGCAGTAGA AATCATTTAT
ACGGGGCTGA GGCCGGGGGA AAAACTCCAT GAGGAATTAT TTTATGCCCA TGAATCTCCT
GTACCTACCC AATCGCCGAA GATATTGCTA GCACGGCATG GTGCTACCGA CTGGAGACGC
TTGGAAAGCG GACTGGATGA GTTAGAGAAA GCCTGTGGGC TGGGAGAGGA ATGGCAGGTG
AAGCAGGTGT TAGCGCGCCT ATTGCCGGAG GTGAGGGCGG AGGCTAAATT GCAACAGGTG
GGTTCTAAAG TAATTGCAAT CAACAGGAGT TAA
 
Protein sequence
MPKREIGPFL QAQKRRAVVI VHDVTAVCLA WVLSYLIRYN LSPYWVDWET CLSTLPVIVV 
VQSIVLIWIG LYRGLWRFAS IPDLLNIIRA VGYGALLVAL TLFLINRLNG VPRSALLLYP
IFSLLLLAGP RLAYRVWKDR RLTLAASPHR KRVLILGAGR AGDLLVRDML VEGHYLPVAF
LDDRPDLIGR QVRGIPVVGL LAQLPAKVGE MAADVVVIAM PSANNQQMRR AVSLCEQAQV
PYRTLPRLQD LVAGRASIKE LREVAIDDLL GRDEIFLDWD YIRRDLEAKS LLVSGGGGSI
GSELCRQLGR LGPASLIVID NSEYKLYRLE QELRQKLPAL NLHAYLCDVR DEAGVKEILA
RHLPEVVFHA AAYKHVPLLE RQARQAVRNN IFGTRCLAEA ARQAGCGTFV LISTDKAVNP
TSVMGTSKRV AELLCQDLNR HSQTRFIVVR FGNVLGSAGS VVPLFQAQIA AGGPITVTHP
EMTRYFMTIP EACHLITQAA MIGKGGEIFV LDMGESVKII ELAEQMIRLS GQEPAVEIIY
TGLRPGEKLH EELFYAHESP VPTQSPKILL ARHGATDWRR LESGLDELEK ACGLGEEWQV
KQVLARLLPE VRAEAKLQQV GSKVIAINRS