Gene Noca_1338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1338 
Symbol 
ID4598577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1408004 
End bp1410322 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content72% 
IMG OID639775933 
Productglycosyl transferase family protein 
Protein accessionYP_922539 
Protein GI119715574 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCGCCC TGCTGCTGGC GGCGATCGTC GCCGCCGCGA TCTCGACCGG GTCCATGTTC 
GTCGGACGGG CCCGCGACCG GGCGGTCGGT CGCGTGATGT GGACCCGGCG GCTGCTCTAT
GCCTGCGTCG GGACCGCGAT CGTCGCCGCG GTCGCCTCGT GGGTCGCATT CCTGGTGGAC
GGGCGCGCGG GATCCGCTCG CCTCTCGTTC GTGCTGATCC TGCTCGCCAG CATCCTCTGG
TTGCCGGTGA CCCGGCGATG GAACGCGCGG GCGCACCTGT GCTGGTCGAC CAACGTCTAC
CTGTACGTCG TCTACCTGGT GTTCATGTTC GAGTGGACCT TCGCCAGTCC GTTGGGCGCG
ATGGGCCGGG CGGGGGCGCT GTCGCTGTGG GGGCTCGAGG TCCTCGCGGC GCTTCTCGGC
TGTGCCTACC TCTGGGAGCT GTGCGACGCC ATGGGCAGCG AGCAGTGGCG ACGTCGGGTA
TCGGACGGCG TGGAGCCGGC GCGAGCAGCG GACGCCGGCA TCCGGCCCTT CGTGAGCCTG
CACGTCCCGG CGCACGAGGA GCCCCCCGAG ATGGTGATCG AGACGCTGGA GTCGCTGCGC
GGGCTCGACT ACGAGCACTA CGAGATCATC GCGATCGACG ACAACACGAC CGACGAGTCG
CTCTGGCGCC CGGTGGAGGC CTGGTGCGCC GCGCACGGCG TGAAGTTCGC GCACCTCGAG
GACTGGCCCG GCTACAAGTC CGGAGCGCTC AACTACGCGC TGCGCGAGAT GACCGACGAC
CGGGCCGAGC TGATCGGCGT GGTCGACTCC GACTACCAGC TGGAGCCGGA CTTCCTCGCC
CGGTGCGCTC CGCTGTTCGC CGACCCGCGG GTGGGCTTCA TCCAGTCGCC GCAGGACTAC
CGCGACTGGG AGGGCGCGCC GTTCTACCGA CGGCTGTACT ACTCCTACAA GTACTTCTTC
TCGGTCTCGC AGCCCTCCCG CAACGAGCGG GACGGGGCGA TCTTCGCGGG CACCATGGGC
CTGATCCGCC GTCAGGCGCT CGAGGACGTC GGCGGCTGGG ACGAATGGTG CATCACCGAG
GACGCCGAGC TCTCCCTGCG GGTGCTCCGG GCGGGCTGGT CCGGCATGCA CGTCGACGCG
TCGTTCGGCC ACGGCGTAAT GCCGCTGACC TTCGAGGCAC TCAAGGGACA GCGGTTCCGC
TGGTGCTTCG GCGGCATCCA GATCCTGCGG ATGCACTGGC GCTCGCTGCT GCCGGGCCTC
CGGGGCGGGG GCAACCGGCT GACCCTGGGG CAGCGCTGGG CCTACCTCAG CGGCGGGCTG
CAGTGGTACG GCGACCTGGT CGGCGTGCTG TTCTTCCTGT TCCTGCTCGG CGGCGCCACC
AACCTCGCAC TCGGCGGCGG CCTGCTCTTC CGCAAGCTCT CGCCGTTCCT CCTCGCCGTG
ATCCCGCTGC TCGTGCTCCT GGGCTTCCTG CGTGCCGTGT CCCTGATCCG CCGGGGGACC
GGCGCCGGCT GGTCGGACGC GATCGGGGCG TTCCTGATCT GGCAGTCGAC GACCCTGGTC
GTGGCCCGTG CGTCGGTGCA GGGATTGTTC GCGCGCAAGG CCGAGTTCCT CCGGACGCCG
AAGACGGAGG AGGGGCTCAA CCTCTGGAAG GCGCTCTCCG CCAACAAGGG CGAGGTCCTG
CTCGCCGTGC TCGGGCTGGC CGGCATCGTC GTCGGCCTTC TGCACATCGA CACCCTCAGC
GGGGTCCTCA CCACGGGGCT CCTCGTGCTC CCGACCCTCG CGTTCGCGAG CGCGCCCGCC
AACAGCATCG CCGCGCAGCG CGCCGCCCTG CCGGCCGTGC TCGCCGAGCG CCGCCGCCTG
GAGTCGCGTC GTTCGTTGGC CGTGCGCGGC AGCACGGCCG CCGCCACCAT CGGCATCGCT
GCAGCCGCCA GCGTGGTGCT GGCGCTGGTC GCCCCGGGAG GCGAGGACGT GCGGCCGCCG
AGCCTGGTGC GTCCCGGGCG CGAACCGACC GTGACGCCCG GTCCGAGCCC CACGGACGAC
CCGGCCGGGC CGCCGAGCAC CTCCGGGCCG GCGTCGCCGT CCCCGCGCGC TTCGTCACCG
GCGAGCGCGC CTGCGGGCAG CCCGCCCGCG ACCGGCTCGT CGGCGTCGTC CAGCGGCTCA
CCCAGCTCAT CACCGAGCTC CTCGCCGAGC TCCTCGCCGA GTTCCTCGCC GAGCGCGTCG
GCGACCCGGT CGCCCGCGAC CTCGCCGAGC GCGTCCCCGA CCTCGACGGC TCCGACGCCC
TCGCCCAGCC CGACCTCGCC CTCGAGCCCG TCCCCCTGA
 
Protein sequence
MSALLLAAIV AAAISTGSMF VGRARDRAVG RVMWTRRLLY ACVGTAIVAA VASWVAFLVD 
GRAGSARLSF VLILLASILW LPVTRRWNAR AHLCWSTNVY LYVVYLVFMF EWTFASPLGA
MGRAGALSLW GLEVLAALLG CAYLWELCDA MGSEQWRRRV SDGVEPARAA DAGIRPFVSL
HVPAHEEPPE MVIETLESLR GLDYEHYEII AIDDNTTDES LWRPVEAWCA AHGVKFAHLE
DWPGYKSGAL NYALREMTDD RAELIGVVDS DYQLEPDFLA RCAPLFADPR VGFIQSPQDY
RDWEGAPFYR RLYYSYKYFF SVSQPSRNER DGAIFAGTMG LIRRQALEDV GGWDEWCITE
DAELSLRVLR AGWSGMHVDA SFGHGVMPLT FEALKGQRFR WCFGGIQILR MHWRSLLPGL
RGGGNRLTLG QRWAYLSGGL QWYGDLVGVL FFLFLLGGAT NLALGGGLLF RKLSPFLLAV
IPLLVLLGFL RAVSLIRRGT GAGWSDAIGA FLIWQSTTLV VARASVQGLF ARKAEFLRTP
KTEEGLNLWK ALSANKGEVL LAVLGLAGIV VGLLHIDTLS GVLTTGLLVL PTLAFASAPA
NSIAAQRAAL PAVLAERRRL ESRRSLAVRG STAAATIGIA AAASVVLALV APGGEDVRPP
SLVRPGREPT VTPGPSPTDD PAGPPSTSGP ASPSPRASSP ASAPAGSPPA TGSSASSSGS
PSSSPSSSPS SSPSSSPSAS ATRSPATSPS ASPTSTAPTP SPSPTSPSSP SP