Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2966 |
Symbol | |
ID | 4595750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 3151249 |
End bp | 3152475 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 639777571 |
Product | glycosyl transferase family protein |
Protein accession | YP_924155 |
Protein GI | 119717190 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.63894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGGTCC TGTGCTACAC CTCGCCCGCT CGCGGGCACC TCTTCCCCAC CGTGCCGATC CTGCTCGAGC TGCGCGGCCG CGGCCACGAC GTCGTCGTCC GGACCCTGGA CGCCGAGGTG GCCCGGCTGC GCGACCTCGG CCTGAAGGCC GAGCCGATCA GCCCGGAGAT CGAGGCGATC GAGCTCGACG ACTACCGGGC CCGCACCAGC CAGGCCGGCC TGAAGCGCAC CGTGCGCACG TTCGCCGCCC GGGCCCGGAT CGAGGTGCCG GAGGTCCAAC GCCTGGTCGC GGAGGAGCAT CCCGACCTGC TGCTGGTCGA CGGCAACACC TGGGGCGCCG CCGCGGTCGC GGAGGCCTCC GGGTTGCCGT GGGCGCTGCT CCAACACTTC CCCACCCCCC TCCCGGCAAG GGACGTGCCA CCGTTCGGTC CGGGTCTGCG GCCCCTCGCC GGGCCGCTCG GCCGCCTGCG CAACCGGCTG CTGCGCCCGG TGGTCCTCGG CGTGGTCGAG CGGGCCTTCC TGCCGCCCCT CAACGAGGTC GTCCGGCCGC TGGCCGGGGC GAGCCCGATC CGCGACGCGC ACGACCTCTA CACCCGGGCC CCGCTGACGC TCTACCCCAC GTCCCGGGCC TTCGAGCACC CGCGCACCGA CTGGCCCGAC TCGTTCTGCT TCGTCGGCCC GCTCGTCTGG GAGCCACCGA CGGCCCCGCC CGAGTGGCTG GCCGAGCTGG GCCGGCCGGT CGCGCTCGTG ACCACGTCCT CGGAGTACCA GGACGACGGC GCGCTCGTCG ACGCCGCGCT CGCCGGCCTC GCTCACGAGG ACCTCGACGT CGTGGCGACC CTGCCCGGCG GGTCGAGGCC CCGCCGCCTG GTACCTGCGA ACGCGCACGT CGAGCAGTTC GTGCCGCACT CGCCCGTCCT GGCCCGGGCG GCGGTCGCCG TCACCCACGG CGGGATGGGC GCCACCCAGA AGGCGCTCGC CGCCGGCGTA CCGGTCGTGG TCGTCCCCTG GGGCCGCGAC CAGGCCGAGG TCGCCCGCCG GGCGGAGGCG ACCGGCGCCG CCGTGCTCCT TCCCCGCCGG CGGTTGTCGC CGACCAGCCT GCGCGACGCC GTACGCCGAG CGCGGCGACT CCGGCCCGCC GCGGTGGCCC TGGCGGCCGC GATGGCGCAG GACGGCGGCG CCCCGCTCGC CGTGGACCGG TTGGAGTCGC TCGCCGGCCG TGGCTAG
|
Protein sequence | MKVLCYTSPA RGHLFPTVPI LLELRGRGHD VVVRTLDAEV ARLRDLGLKA EPISPEIEAI ELDDYRARTS QAGLKRTVRT FAARARIEVP EVQRLVAEEH PDLLLVDGNT WGAAAVAEAS GLPWALLQHF PTPLPARDVP PFGPGLRPLA GPLGRLRNRL LRPVVLGVVE RAFLPPLNEV VRPLAGASPI RDAHDLYTRA PLTLYPTSRA FEHPRTDWPD SFCFVGPLVW EPPTAPPEWL AELGRPVALV TTSSEYQDDG ALVDAALAGL AHEDLDVVAT LPGGSRPRRL VPANAHVEQF VPHSPVLARA AVAVTHGGMG ATQKALAAGV PVVVVPWGRD QAEVARRAEA TGAAVLLPRR RLSPTSLRDA VRRARRLRPA AVALAAAMAQ DGGAPLAVDR LESLAGRG
|
| |