Gene Noc_3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3068 
Symbol 
ID3704484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3457408 
End bp3459795 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content48% 
IMG OID637739542 
Productsucrose synthase 
Protein accessionYP_345039 
Protein GI77166514 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR02470] sucrose synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAAT TAGCCAATTT TGTAGGCCAG CACAAAGAGA TCGTTTATCT ATTGCTGCGC 
CGCTATTTGG CGCTTCAAAG ACCTTTTTTG CTTAGATCTG ATCTCGTCGA TGAATTTGAT
CTTTTCTGTA AAGAGAATGA TGAGGGGGCA TTGCTCCAAG ATTCTCCCTT AGCAACAATA
ATTCAAGCCG CCCAGGAGGC GGCGGTTGAT CCGGAGTGGA TTTATTTATC AGTGCGTCCC
CGAATTGCAA ACTGGGAATA TTATCGGATT CATACCGAGG TAATGCAGAT TGAAACGGTG
CCTGTCTCCC AATTCCTAGA ATTCAAGGAA CGTCTTGTGC TTGGTCCCAC CCAGCCTCAA
TCGTGGCCGC TGAAAATTGA TATGGGACCC TTCAACCGGG AGTTTCCCCG GCTTAGAGAA
ACCCGCTCCA TTGGGCGCGG AATGGATTTT CTCAATCGGC ATCTTTCTAA CCAGCTCTTT
AATGAGTTAG AGACCGGAGG ACAATATCTG CTCAGTTTCT TAAGTGTCCA CCACTGTCGG
GGACAACCTT TAATGCTGAA TGATCGTATC CAGGATGTAC AGGGGTTGCG CTGTGCTTTG
CGTCTGGCAA TGGATTTTCT TGGTGGTTTT CAGGAAGCTG CTGAATGGGA TGCCGTGGGG
CATAAGTTAC AGGAATTTGG ATTTGAGCGG GGCTGGGGTA GAACTGCGGC CCGAATACAG
GATTCCTTTA GTCTTTTAAT GGATATTCTT GAAGCGCCTG AGCCTGGTAA CTTGGAGCAT
TTCTTGGCGC GCATTCCCAT GATCTTTAAT ATCGTGATTC TTAGTCCCCA TGGGTATTTT
GGCCAAGGTA ATATTCTCGG ATTGCCGGAT ACTGGGGGCC AAGTAGTGTA TATCCTGGAT
CAGGTTCGGG CGCTAGAGAA GGAAATGCAC CGACAGTTAA AAGAGCAGGG ACTTGATGTT
GCACCTCAGA TTTTGGTCGT TACTCGCTTG ATTCCAGAGG CTCAGGGCAC TCGATGCGAT
CAGCGCTTAG AATCCATCGT AGGGACAGAA AATGCCGCTA TCTTAAGAGT CCCTTTTCGC
AATGCTGGGG GCGAAGTACT GCCCTATTGG CTCTCCCGCT TTGAAGTTTG GCCTTACCTT
GAGCGCTATG CCATGGATGC GGAAAGGGAA ATGCTTGCCG AACTTGAAGG GAGTCCAGAT
CTCATCATCG GTAACTACTC GGACGGAAGC TTAGTGGCGA CCCTTCTCTC TCAGCGTTTA
CGGGTGACTC AATGCAATAT TGCCCATGCC TTGGAAAAGG CTAAATATCT TTATTCCGAT
TTATACTGGC GGGAGAATGA TGCCCAATAT CATTTTGCCT GCCAATTCAC TGGCGATCTC
ATCGCTATGA ATAGCGCCGA TTTTATTGTC ACCAGCACTT ATCAGGAAAT TGCCGGGAAT
AAAAACAGCG TGGGCCAGTA TGAGAGTTAT AGCGCCTATA CTTTGCCGGG GTTGTATCAG
GTGATTCATG GCATTGACGT TTTCGATCCT AAATTCAATA TTGTCTCGCC TGGTGCGGAT
GGGGAAGTTT ACTTTCCCTA TACGGATACA AAACGGCGTT TAAGCGGATT GCGCCAGGAA
ATCGAAGCGT TGATATGGGG TGATGAACGC CCCGACGCCC GTGGCAAACT TCAGGATCAC
ACCAAACCTT TATTGTTTAC TATAGCGCGC CTAGACCGAA TTAAAAATAT CACGGGTTTA
GTAGAGTGGT ACGGGCGCTG TGAGCGGCTG CGAAAATTAG CCAATTTAGT GGTCGTGGGC
GGGTATATAG ATAAGAGTCA GTCAGCGGAT AGCGAAGAAC AGGTACAAAT TGCCCGGATG
CATCAGCTGA TAGAAGAATA TAAATTGGAT AGCCAAGTCC GCTGGTTGGG AGTTATGCTG
CAAAAAAACT TGGCAGGAGA GCTTTATCGC TTCATTGCGG ACTCCCGTGG CGCCTTTGTT
CAGCCAGCTC TTTTTGAAGC CTTTGGTTTG ACTGTGATAG AGGCTATGAG TAGCGGGTTG
CCCACCTTTG CTACTTGTTA TGGTGGGCCT TTAGAGATTA TCCAAGAGGG AGTGTCAGGT
TTTCATATTG ATCCTAACCA TGGCGAAAAA GCCGCTGACC GTATCGCTGA TTTTTTTGAA
CATTGTCAGA CTGAAGCGGG ATATTGGGAC AAATTCTCCC AAGGTGCTTT GCGCCGGATT
AAAAACCATT ATACTTGGGA ACTCTACGCC GAGCGGATGA TGACATTGTC CCGCATCTAT
GGTTTTTGGA AATACGTGAC TAATCTGGAG CGGGCGGAGC GCCGACGTTA CCTGGAGATG
TTTTATAATC TGCAATTTCG CCCCCTCGCT CAGCAGATAG AGCATTAG
 
Protein sequence
MKELANFVGQ HKEIVYLLLR RYLALQRPFL LRSDLVDEFD LFCKENDEGA LLQDSPLATI 
IQAAQEAAVD PEWIYLSVRP RIANWEYYRI HTEVMQIETV PVSQFLEFKE RLVLGPTQPQ
SWPLKIDMGP FNREFPRLRE TRSIGRGMDF LNRHLSNQLF NELETGGQYL LSFLSVHHCR
GQPLMLNDRI QDVQGLRCAL RLAMDFLGGF QEAAEWDAVG HKLQEFGFER GWGRTAARIQ
DSFSLLMDIL EAPEPGNLEH FLARIPMIFN IVILSPHGYF GQGNILGLPD TGGQVVYILD
QVRALEKEMH RQLKEQGLDV APQILVVTRL IPEAQGTRCD QRLESIVGTE NAAILRVPFR
NAGGEVLPYW LSRFEVWPYL ERYAMDAERE MLAELEGSPD LIIGNYSDGS LVATLLSQRL
RVTQCNIAHA LEKAKYLYSD LYWRENDAQY HFACQFTGDL IAMNSADFIV TSTYQEIAGN
KNSVGQYESY SAYTLPGLYQ VIHGIDVFDP KFNIVSPGAD GEVYFPYTDT KRRLSGLRQE
IEALIWGDER PDARGKLQDH TKPLLFTIAR LDRIKNITGL VEWYGRCERL RKLANLVVVG
GYIDKSQSAD SEEQVQIARM HQLIEEYKLD SQVRWLGVML QKNLAGELYR FIADSRGAFV
QPALFEAFGL TVIEAMSSGL PTFATCYGGP LEIIQEGVSG FHIDPNHGEK AADRIADFFE
HCQTEAGYWD KFSQGALRRI KNHYTWELYA ERMMTLSRIY GFWKYVTNLE RAERRRYLEM
FYNLQFRPLA QQIEH