Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1338 |
Symbol | |
ID | 4598577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1408004 |
End bp | 1410322 |
Gene Length | 2319 bp |
Protein Length | 772 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639775933 |
Product | glycosyl transferase family protein |
Protein accession | YP_922539 |
Protein GI | 119715574 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCGCCC TGCTGCTGGC GGCGATCGTC GCCGCCGCGA TCTCGACCGG GTCCATGTTC GTCGGACGGG CCCGCGACCG GGCGGTCGGT CGCGTGATGT GGACCCGGCG GCTGCTCTAT GCCTGCGTCG GGACCGCGAT CGTCGCCGCG GTCGCCTCGT GGGTCGCATT CCTGGTGGAC GGGCGCGCGG GATCCGCTCG CCTCTCGTTC GTGCTGATCC TGCTCGCCAG CATCCTCTGG TTGCCGGTGA CCCGGCGATG GAACGCGCGG GCGCACCTGT GCTGGTCGAC CAACGTCTAC CTGTACGTCG TCTACCTGGT GTTCATGTTC GAGTGGACCT TCGCCAGTCC GTTGGGCGCG ATGGGCCGGG CGGGGGCGCT GTCGCTGTGG GGGCTCGAGG TCCTCGCGGC GCTTCTCGGC TGTGCCTACC TCTGGGAGCT GTGCGACGCC ATGGGCAGCG AGCAGTGGCG ACGTCGGGTA TCGGACGGCG TGGAGCCGGC GCGAGCAGCG GACGCCGGCA TCCGGCCCTT CGTGAGCCTG CACGTCCCGG CGCACGAGGA GCCCCCCGAG ATGGTGATCG AGACGCTGGA GTCGCTGCGC GGGCTCGACT ACGAGCACTA CGAGATCATC GCGATCGACG ACAACACGAC CGACGAGTCG CTCTGGCGCC CGGTGGAGGC CTGGTGCGCC GCGCACGGCG TGAAGTTCGC GCACCTCGAG GACTGGCCCG GCTACAAGTC CGGAGCGCTC AACTACGCGC TGCGCGAGAT GACCGACGAC CGGGCCGAGC TGATCGGCGT GGTCGACTCC GACTACCAGC TGGAGCCGGA CTTCCTCGCC CGGTGCGCTC CGCTGTTCGC CGACCCGCGG GTGGGCTTCA TCCAGTCGCC GCAGGACTAC CGCGACTGGG AGGGCGCGCC GTTCTACCGA CGGCTGTACT ACTCCTACAA GTACTTCTTC TCGGTCTCGC AGCCCTCCCG CAACGAGCGG GACGGGGCGA TCTTCGCGGG CACCATGGGC CTGATCCGCC GTCAGGCGCT CGAGGACGTC GGCGGCTGGG ACGAATGGTG CATCACCGAG GACGCCGAGC TCTCCCTGCG GGTGCTCCGG GCGGGCTGGT CCGGCATGCA CGTCGACGCG TCGTTCGGCC ACGGCGTAAT GCCGCTGACC TTCGAGGCAC TCAAGGGACA GCGGTTCCGC TGGTGCTTCG GCGGCATCCA GATCCTGCGG ATGCACTGGC GCTCGCTGCT GCCGGGCCTC CGGGGCGGGG GCAACCGGCT GACCCTGGGG CAGCGCTGGG CCTACCTCAG CGGCGGGCTG CAGTGGTACG GCGACCTGGT CGGCGTGCTG TTCTTCCTGT TCCTGCTCGG CGGCGCCACC AACCTCGCAC TCGGCGGCGG CCTGCTCTTC CGCAAGCTCT CGCCGTTCCT CCTCGCCGTG ATCCCGCTGC TCGTGCTCCT GGGCTTCCTG CGTGCCGTGT CCCTGATCCG CCGGGGGACC GGCGCCGGCT GGTCGGACGC GATCGGGGCG TTCCTGATCT GGCAGTCGAC GACCCTGGTC GTGGCCCGTG CGTCGGTGCA GGGATTGTTC GCGCGCAAGG CCGAGTTCCT CCGGACGCCG AAGACGGAGG AGGGGCTCAA CCTCTGGAAG GCGCTCTCCG CCAACAAGGG CGAGGTCCTG CTCGCCGTGC TCGGGCTGGC CGGCATCGTC GTCGGCCTTC TGCACATCGA CACCCTCAGC GGGGTCCTCA CCACGGGGCT CCTCGTGCTC CCGACCCTCG CGTTCGCGAG CGCGCCCGCC AACAGCATCG CCGCGCAGCG CGCCGCCCTG CCGGCCGTGC TCGCCGAGCG CCGCCGCCTG GAGTCGCGTC GTTCGTTGGC CGTGCGCGGC AGCACGGCCG CCGCCACCAT CGGCATCGCT GCAGCCGCCA GCGTGGTGCT GGCGCTGGTC GCCCCGGGAG GCGAGGACGT GCGGCCGCCG AGCCTGGTGC GTCCCGGGCG CGAACCGACC GTGACGCCCG GTCCGAGCCC CACGGACGAC CCGGCCGGGC CGCCGAGCAC CTCCGGGCCG GCGTCGCCGT CCCCGCGCGC TTCGTCACCG GCGAGCGCGC CTGCGGGCAG CCCGCCCGCG ACCGGCTCGT CGGCGTCGTC CAGCGGCTCA CCCAGCTCAT CACCGAGCTC CTCGCCGAGC TCCTCGCCGA GTTCCTCGCC GAGCGCGTCG GCGACCCGGT CGCCCGCGAC CTCGCCGAGC GCGTCCCCGA CCTCGACGGC TCCGACGCCC TCGCCCAGCC CGACCTCGCC CTCGAGCCCG TCCCCCTGA
|
Protein sequence | MSALLLAAIV AAAISTGSMF VGRARDRAVG RVMWTRRLLY ACVGTAIVAA VASWVAFLVD GRAGSARLSF VLILLASILW LPVTRRWNAR AHLCWSTNVY LYVVYLVFMF EWTFASPLGA MGRAGALSLW GLEVLAALLG CAYLWELCDA MGSEQWRRRV SDGVEPARAA DAGIRPFVSL HVPAHEEPPE MVIETLESLR GLDYEHYEII AIDDNTTDES LWRPVEAWCA AHGVKFAHLE DWPGYKSGAL NYALREMTDD RAELIGVVDS DYQLEPDFLA RCAPLFADPR VGFIQSPQDY RDWEGAPFYR RLYYSYKYFF SVSQPSRNER DGAIFAGTMG LIRRQALEDV GGWDEWCITE DAELSLRVLR AGWSGMHVDA SFGHGVMPLT FEALKGQRFR WCFGGIQILR MHWRSLLPGL RGGGNRLTLG QRWAYLSGGL QWYGDLVGVL FFLFLLGGAT NLALGGGLLF RKLSPFLLAV IPLLVLLGFL RAVSLIRRGT GAGWSDAIGA FLIWQSTTLV VARASVQGLF ARKAEFLRTP KTEEGLNLWK ALSANKGEVL LAVLGLAGIV VGLLHIDTLS GVLTTGLLVL PTLAFASAPA NSIAAQRAAL PAVLAERRRL ESRRSLAVRG STAAATIGIA AAASVVLALV APGGEDVRPP SLVRPGREPT VTPGPSPTDD PAGPPSTSGP ASPSPRASSP ASAPAGSPPA TGSSASSSGS PSSSPSSSPS SSPSSSPSAS ATRSPATSPS ASPTSTAPTP SPSPTSPSSP SP
|
| |