Gene Noca_3550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3550 
Symbol 
ID4599429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3764176 
End bp3765345 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content76% 
IMG OID639778158 
Productglycosyl transferase, group 1 
Protein accessionYP_924737 
Protein GI119717772 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGAGG CCACACGGGC GAGGTTCACC CGACATCCAC CTGGCCGCAG CGGCCCGGTC 
ACCCCCCGAG GCGATCCTGA CGACATGCGG ATCGCCCTGG TGACCGAGAC GTTCTTCCCC
GCAGCGGACG GCACGACGAC GACCGTCAAG GCCGTCGCCG ACCGGCTCGT CGAGACCGGC
CACGAGGTGC TCGTGGTCGC GCGCGGCCCC GGCCTGGCGT CGTACGGCGG GAGCGAGGTG
GTCCGGGTCC GCCAGCTGGA CCGGCCCGGC GCGCAGGTCC GCGAGGCGCT CGAGCGGTTC
GGCCCCGACC TGGTGCACGT CACGTCCCCG GACGCCGTCG GGCGCAAGGC GCTCAAGCAC
GCCCGCCGGC TCGGCGTCCC CACGCTGGTC GTGGAGCAGT CCGCCCTGAT GGACGTCGCC
GCCGACTACT GGCGCAGCCG GGTCGCCCGG CGCAGCGACC GCGTGCTGGT GACGTCGCGG
TGGATGGTGG GCCGCCTGGC CGAGTTCGAG GTCGACGCCG GCCTGTGGCC TCCCGGCACG
GACCCGGCCG CGTTCACCCC CGCCCTGCGC GACGAGTGGC TGCACGAGCG GTGGTCGCGG
GCGCGGTCCC GCACCGGCCC CCTGGTCGTC GTGGGGTATG TCGGCAGCCT CCGCAAGCGC
CACGACGTGC GCCGGCTGGC GGCGCTCGTC CGGGTGCCGG GCATCCGCAC GGTCGTCGTC
GGCGACGGCC CGCAGCGCGC GTGGCTCGAG GCCCGGTTGC ACGGCGCGAA GTTCACCGGG
GAGCTCGGCA CCGGCGACCT GGCCGCCGTG CTGCCGACGC TCGACGTGCT GGTCCATCCC
GGTGAGCACG AGACCTGCTG CCATGCGCTG CGTGAGGCGG CCGCCGCGGG CGTGCCGGTC
GTCGCGCCGC GCTCGGGCGG CGCTCCAGAC GTGGTGGTGT CCCTCGAGAC CGGCCTCCTC
TACGACCCGA CCGACGAGCA CGCGCTGGCC CGTGCGGTCG CCGCCATCGC CGCGGACCGG
CACCGCTCCC TGCTCGGCGC GCGCGCCCGC GAGCTCGCGA CGCGCACCTG GCGACAGGCG
GTCGACGAGC TCGTGGAGCG GCACTACGTC CCGCTCGCGG CGTCGCGGAG GGCCCCCGGC
GCGGAGGAGA AGGTCCTGAT TTCTCCGTAA
 
Protein sequence
MKEATRARFT RHPPGRSGPV TPRGDPDDMR IALVTETFFP AADGTTTTVK AVADRLVETG 
HEVLVVARGP GLASYGGSEV VRVRQLDRPG AQVREALERF GPDLVHVTSP DAVGRKALKH
ARRLGVPTLV VEQSALMDVA ADYWRSRVAR RSDRVLVTSR WMVGRLAEFE VDAGLWPPGT
DPAAFTPALR DEWLHERWSR ARSRTGPLVV VGYVGSLRKR HDVRRLAALV RVPGIRTVVV
GDGPQRAWLE ARLHGAKFTG ELGTGDLAAV LPTLDVLVHP GEHETCCHAL REAAAAGVPV
VAPRSGGAPD VVVSLETGLL YDPTDEHALA RAVAAIAADR HRSLLGARAR ELATRTWRQA
VDELVERHYV PLAASRRAPG AEEKVLISP