Gene Noc_0742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0742 
Symbol 
ID3707008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp804276 
End bp805385 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content54% 
IMG OID637737244 
Productglycosyl transferase, group 1 
Protein accessionYP_342785 
Protein GI77164260 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.377387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTG GATTTATCAG TAACTGGTGC AATCGTGGGC AAGGGATTGT GACCCGCCAG 
ATTCGAGCTA TTTTCGCCGA GGCGGGACAT GACACCTACG TACTGGCCCG GCCCACCCGG
GCCAGGGCGG CCATGCCCAA TCTCATTGAT AGCCGGCAGG AGTGGCAGGT GCCCCATCTG
ACTCACGGCT CGGCTTATGA TATGCCTGTC GGGGAATATA TGGCTTGGGC CAAGGAGTCC
GCCCTGGATG TGCTTTTTTG CGATATGAAT ATGCAATTCG AGGCCATTGT CGCCATTCGA
AAGTTGGGGG TGCGAACCAT TGGGCGTTTC GTGTGGGAGG CTTTCCATCC GGATTATGTG
GCAGCGGTCA AACAAGCCTA TGATATTGTT TATTCTTTGA CCCGCTGCGA GCAGGAGAAC
TACCGGAAAA TGGGAATTTC CTCCCCCTAT GTCCGGTTTG GTTTGGCGCC CTCATTTACC
GCTTTTTCTC CCATCAAGCG CCCCGATGAT GCCCTTTACT TTTTCTTCCA CGGAGGCACC
CAGGGAACCC GCAAGCCCAT CCAGGCCACG CTCAAGGCGT TCAAGCAGGT AAAGAACCCC
CATATTCGGC TGATTATTAA GAGCCAGTGC ATTGATAAAG CCTCCGAGCC TGTGACCATC
GAGGATGATC CCAGAATCAC CCATATAGTG GCGGATTTGC CCTTCGAGGA GCACCGGCGG
TTATTTTCAA GTTGCCACGT TTGCCTCTGC CCGAGCCGCT GGGAAGGGTT GGGGGTCCAT
TTGTTCGAGG CCCTGGCCTA TGGGATGCCG GTAATTTCCA ATGATATCGC CCCCATCAAT
GAAGTGATCC GCCACGGGCG GAGCGGTTTG CTGGTGCGCA GTTTCTCCAA GCGCAAGAAT
CGTTGTGGCC TTCCCATTTT CGAGCCTGAC GAAGGGCATT TACGGGAATG TATCGAGGAA
CTCAGCAATC CAGTCCGGTT GGCGGCCCTG ATGGCGAGCA CCCGGGAGGA AGCAAAGCAA
TTTGATTGGG CATTGACCCG GCAAGACTAT CTTGAATTAG CCACTTGCAC TAGGGAAAAT
CTTAGCCGGA AACAAAAGAA TGACGGGTAA
 
Protein sequence
MNIGFISNWC NRGQGIVTRQ IRAIFAEAGH DTYVLARPTR ARAAMPNLID SRQEWQVPHL 
THGSAYDMPV GEYMAWAKES ALDVLFCDMN MQFEAIVAIR KLGVRTIGRF VWEAFHPDYV
AAVKQAYDIV YSLTRCEQEN YRKMGISSPY VRFGLAPSFT AFSPIKRPDD ALYFFFHGGT
QGTRKPIQAT LKAFKQVKNP HIRLIIKSQC IDKASEPVTI EDDPRITHIV ADLPFEEHRR
LFSSCHVCLC PSRWEGLGVH LFEALAYGMP VISNDIAPIN EVIRHGRSGL LVRSFSKRKN
RCGLPIFEPD EGHLRECIEE LSNPVRLAAL MASTREEAKQ FDWALTRQDY LELATCTREN
LSRKQKNDG