Gene Noc_1524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1524 
Symbol 
ID3705821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1688614 
End bp1689846 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content59% 
IMG OID637738010 
Productputative glycosyl transferase 
Protein accessionYP_343539 
Protein GI77165014 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATCT TGATTTACGG TATTAACTTT GCTCCGGAAT TGACCGGTAT CGGCAAGTAT 
AGTGGGGAGA TGGCGGCGTG GCTGGCAGGC CGCGGGCATG AAGTCCGGGT AGTGACGGCT
CCTCCCTATT ATCCGGAATG GCGCGTTAGG GAGGGTTATT CCAGTTGGCG TTATGGCCGT
GAAAGCTTTG CCCTTCCAGC CGGCGGTGGT TTCAGCGGCT GGCGCTGCCC CCTTTGGGTA
CCCCGTTCTC CTTCCGGCCT AACCCGGTTA TTGCATCTAG CCACTTTTAT GTTAACCAGC
GCGCCTGTTA TGTTGCGCTA TCTCCTATGG CGTCCCCATG TGGTCGTGCT GATTGCGCCC
ACGCTTTTTT GCGCTCCCGT GGCCTGGACC GTGGCGCGGC TCGCGGGAGG GGCGGCCTGG
CTGCATCTTC AGGATTTTGA ACTGGATGCC GCGGTGGGGC TTAACCTATT GCGTCATGGC
TGGTTGCGTG GAGCCGCCAG AATTTTTGAA CGGCAGTGGC TGTTGGCTTT TGATCGGGTT
TCCAGTATCT CCCATCGGAT GCTCTCCCAT CTGGGGGACA AAGGGGTGGC CGAAACGCGA
TGCGTGCTTT TTCCTAATTG GGTGGATACT CGAGCGCTTT ATCCTCTACC GGGAGTGAGT
GCCTATCGGG CCGAGTTGGG GATCGCCCCG GAGACGGTCG TGGCCCTTTA CTCGGGAAAT
ATGGGGGAGA AGCAAGGGCT GGAGCTTTTG GTGGAGGCGG CCCGGCGGTT GCGGAACCGG
TCAGAAATTG TGTTTGTGCT GGCCGGTGCG GGGGTTGCCC GCGCCCGGCT GGAAGTCCAG
GGAAAGGATT TACCCAATCT TCGATGGTTG CCGTTGCAGC CTGCCGAGCG GTTGAACGAA
TGGCTCAATT TAGCGGATAT TCACTTATTG CCCCAGCGCG CCGATGCGGC TGATCTGGTG
ATGCCTTCCA AATTGACGGG GATGCTGGCC AGTGGCCGGC CTGTGGTGGC CACGGCGCGG
CTGGAGACTC AAATTGGGCA AGTGGTTGCG GGCTGTGGGC AGTTGGTGGC GCCGGGAGAT
GTGGAGGGCT TGGCGGAAGC CATTTATCAA ATGGCGCGGA ACCCCGGTCG GCGGCTTAAG
CTCGGTCAAC AGGCACGGCG TTATGCGGTG GAGCACTGTG ATTATGAGCG CGTGATGTTG
GATTTTGAAA AAAACCTGGG AAGTTTATTA TAA
 
Protein sequence
MRILIYGINF APELTGIGKY SGEMAAWLAG RGHEVRVVTA PPYYPEWRVR EGYSSWRYGR 
ESFALPAGGG FSGWRCPLWV PRSPSGLTRL LHLATFMLTS APVMLRYLLW RPHVVVLIAP
TLFCAPVAWT VARLAGGAAW LHLQDFELDA AVGLNLLRHG WLRGAARIFE RQWLLAFDRV
SSISHRMLSH LGDKGVAETR CVLFPNWVDT RALYPLPGVS AYRAELGIAP ETVVALYSGN
MGEKQGLELL VEAARRLRNR SEIVFVLAGA GVARARLEVQ GKDLPNLRWL PLQPAERLNE
WLNLADIHLL PQRADAADLV MPSKLTGMLA SGRPVVATAR LETQIGQVVA GCGQLVAPGD
VEGLAEAIYQ MARNPGRRLK LGQQARRYAV EHCDYERVML DFEKNLGSLL