Gene Noc_1979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1979 
Symbol 
ID3705438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2273335 
End bp2274624 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content51% 
IMG OID637738455 
Productglycosyl transferase group 1 
Protein accessionYP_343971 
Protein GI77165446 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03087] sugar transferase, PEP-CTERM/EpsH1 system associated 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGATC TTCTATTCCT TGCGCACCGG ATACCTTTTC CCCCCAATAA AGGCGATAAA 
ATACGTTCTT ATCATTTGTT GCGCTTTCTT GCCAGCCGTT ATCGTGTCCA TGTAGGCGCT
TTCGTGGACG ATCCAGTAGA CTGGAAATAT GCTTCAGACC TTCATCGGTT AGGGGTAGAT
GAGCTTTGCC TGCGGCCGCT ACCGCGGGCT CTCGCATTGG CCCGATCCCT GACAGCGTTG
GTAGCCGGTA AGCCGCTTGG TTTAGCTTAC TATAAAGATC GTTGCATGAG CCGCTGGGTG
CAAGACATCG CTGCTCGGCC TTCCTTGGAA GGCGTTGTGG TGTTTTCCTC TGTGATGGCT
CAGTATATAA CTATGCTTCC CCGGCAAGTG CCAGCTATTG TCGATTTTGT GGATGTGGAT
TCAGAGAAAT GGCATGCTTA TAGTCAGACA TCGAGTCCTC CTTTGTCTTG GGTTTATCAG
CGTGAAGCCC GTACTCTTTT GGCCTTCGAG CGAAAAATAG CTGCGCAAGC AAAAGCTGCA
ACATTTGTTT CCTCGGTGGA GGCTGAGTTA TTCCGCCGTT TAGCTCCAGA AGTGGCGAAA
CAAGTATTTG CTGCGCCTAA TGGGGTCGAT ACGGATTTTT TCTCCCCGGA TCGGCATTAT
CCTTCTCCGT ATCCTCCGGA GCAGCGGGTG TTGGTTTTTA CTGGTGCCAT GAATTACCGT
CCTAATATAG ATGCGGTAAT TTGGTTTACT AAAACTATTT TTCCGAAGAT TCTAGCGGTG
GTTCCTGCGG CCTGTTTCTA TATTGTGGGT ACGCAACCGG CTGAAGCTGT ACGGCGTCTC
TCGGCAGAGC GGCAAGTATA TGTGACAGGT ACTGTGGCGG ATATGCGTCC TTATTTAGCC
CATGCCCGAG CTGCGGTAGC GCCTCTGAGG ATTGCCCGTG GAGTTCAAAA CAAGTTGTTG
GAGGCGATGG CCATGGCCCG GCCTGTGATA GCTACTCCAG AGGCCGCCGA GGGTATTGTT
TTGCCTCCGG TATGTGAGAA TCTAGTGAGT GCAACGCCGA ATCAATTTGC AGCAAAAACT
ATTGCCGTAT TGCTGCAGGG AAAGGGAAAG GAGGCAGGTA GGAAGGGCCG GGAGCATGTC
TTACAAAATT ACCATTGGGA TAATCACCTA GAACGTTTCT CGGAGTTGCT TACTCATCCC
TCCCTCGCGC CTATAATTAG TGTGGCAAAA GAGGCCACTG GTGGAGAAAA AAATGCAGAT
GAACCCGACC GAACAGCGGT ACCTAGATAG
 
Protein sequence
MGDLLFLAHR IPFPPNKGDK IRSYHLLRFL ASRYRVHVGA FVDDPVDWKY ASDLHRLGVD 
ELCLRPLPRA LALARSLTAL VAGKPLGLAY YKDRCMSRWV QDIAARPSLE GVVVFSSVMA
QYITMLPRQV PAIVDFVDVD SEKWHAYSQT SSPPLSWVYQ REARTLLAFE RKIAAQAKAA
TFVSSVEAEL FRRLAPEVAK QVFAAPNGVD TDFFSPDRHY PSPYPPEQRV LVFTGAMNYR
PNIDAVIWFT KTIFPKILAV VPAACFYIVG TQPAEAVRRL SAERQVYVTG TVADMRPYLA
HARAAVAPLR IARGVQNKLL EAMAMARPVI ATPEAAEGIV LPPVCENLVS ATPNQFAAKT
IAVLLQGKGK EAGRKGREHV LQNYHWDNHL ERFSELLTHP SLAPIISVAK EATGGEKNAD
EPDRTAVPR