Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4161 |
Symbol | |
ID | 4596675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4395081 |
End bp | 4396085 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639778767 |
Product | UDP-galactose 4-epimerase |
Protein accession | YP_925345 |
Protein GI | 119718380 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | [TIGR01179] UDP-glucose-4-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00395253 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTTC TCGTCACCGG CGGCGCCGGC TACATCGGCT CGACCACCGC CAAGGCGCTC GAGGAGGCCG GCCACACGCC GGTGATCCTC GACTCGCTGC TCACCGGCCC CCTGGCGTTC GTGCGCGACC GGATCTTCTA CGAGGGCGAC ATCGCCGACC GGGCCCTGGT CCGCCGGGTC TTCGACGAGC ACCCCGACAT CGACGCCACC ATCCACATGG CCGCCCGGAT CGTGGTGCCC GAGTCGGTCG AGAAGCCCTA CGAGTACTAC CGGGACAACG TCGCCAAGTC CCTGGAGCTC TTCGACGAGC TCAACACCCT CGGCAAGGGC CGGGTGCTGT TCTCCTCCTC CGCGTCGATC TACGCCCTCA AGGACGACTT CGAGGTCTCC GAGGGCGACC GGCTCGAGCC GGCCTCGCCG TACGCCCGGA CCAAGCGGAT GATGGAGGAG GTGCTCCAGG ACATGTCGGC GGCGACCGAC CTGCGGGCGA TCATCCTGCG CTACTTCAAC CCGATCGGCT CCGACCCGGA CCTGGAGTCC GGCATCTACG CCAAGGAGCC CTCGCACGTG CTCGGCCAGC TGGTGATGGC CGCGCGCGGG CAGAAGGACG CCTTCACGAT CACCGGCACC GACCACCCCA CCCGCGACGG CACCGGCATC CGCGACTACA TCCACGTCTG GGACCTGGCC CGCGCGCACG TCCGCGCGGT CGAGCGGTTC GACGAGGTGA TCGACGCCGC CGGCGAGCCC AGCGTGATCA TCAACGTCGG CACCGGCTCC GGGGTGACCG TGCGCGAGCT GGTCACCGCG TTCCAGAACG TGTTCGGCCA GGAGGTGCCG GTCCGGGAGG CGCCGCCACG CCCCGGCGAC GCGGTGGGGG CGTTCGCGAA CGTCGACCGC TCCGGGCGGC TGCTCGACTG GCGCACCGAG CTCTCCCTCG AGGACGCGAT CGCCTCGGCG CTGGCGTGGG GCGCGAAGCG CCAGGAGATC CTGGGCTACG AGTGA
|
Protein sequence | MKVLVTGGAG YIGSTTAKAL EEAGHTPVIL DSLLTGPLAF VRDRIFYEGD IADRALVRRV FDEHPDIDAT IHMAARIVVP ESVEKPYEYY RDNVAKSLEL FDELNTLGKG RVLFSSSASI YALKDDFEVS EGDRLEPASP YARTKRMMEE VLQDMSAATD LRAIILRYFN PIGSDPDLES GIYAKEPSHV LGQLVMAARG QKDAFTITGT DHPTRDGTGI RDYIHVWDLA RAHVRAVERF DEVIDAAGEP SVIINVGTGS GVTVRELVTA FQNVFGQEVP VREAPPRPGD AVGAFANVDR SGRLLDWRTE LSLEDAIASA LAWGAKRQEI LGYE
|
| |