Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4184 |
Symbol | |
ID | 4596698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4422049 |
End bp | 4423881 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639778790 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_925368 |
Protein GI | 119718403 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGACT TGGTCGCGCG TATGAGGCGG CGCCGCGCGC CGCTCGCCAT CGCGTTCGAC GTGTGTGCCT GGTTGCTCGG CTACGCGGCG TTCGCCTGGC TTCGGCTCGA CAGTGACGCC TCGAGCGTGC CGTGGCCGGA GGTGCTCGCC TTCGCGACCG TGACTGCGAC GCTGTACGTG GCACTGGCGG CGCCCCTCCG GCTCCACCAG GGCCGCGCCC GCACCGCGAG CCTGGAGGAG ATGGTGCTCC TCGGGATGCT GATGGGCGGC GTCGGTGGCG GCGTCTTCCT GGTGAACCTG TTCGGCCAGT GGATCCCCCG CAGCATCCCG GCCGGGGCGA CCTGCGGTGC GCTCGTGCTC GCCGCGTGGG CGCGGGCGAG CTGGCGGACC CTGCAGGAGC GCGACGAGCG CACCGGCGCC GGTGAGGACA CCGAGCGGAC GCTGGTGATG GGCGCGGGTG AGGCCGGCCG CGAGCTGATC ACCTCGATGC AGCGCGACCC GCTGCGGCGC TACCTGCCGG TCGGACTGCT CGACGACGAT CCCTACAAGC GGCACCGCCG GCTGCGCGGC GTACCCGTCC TCGGGACCAG CTGCGACCTG GAGAAGGAGG TCGCGCGCAC CGGGGCGACC ATGGTCGTGA TCGCGATCCC CAGCGCCAGT GCCGAGACGG TCAACCGGCT CCGGCTCACG GCTCTGGACG CCCATGTGTC GGTCAAGGTG TTGCCGTCCA CCACGCAGCT CCTCACCGAC AGCGTCGGCA TCCGCGACCT GCGGGACATC AACATCACCG ACGTACTCGG CCGCAACCAG CTGGACACGG ACGTCGCCTC GATCGCCGGC TACCTGGCCG GGCGGAAGGT GCTGGTCACC GGTGCCGGCG GCTCCATCGG CTCCGAGCTG TGCCGCCAGA TCTACCGCTA CCAGCCGGCC GAGCTGATGA TGCTCGACCG TGACGAGTCC GCGCTCCACT CGGTCCAGCT CTCGATCCAC GGCCGTGCCC TGCTGGACTC GGACGACGTC ATCCTGTGCG ACATCCGGGA CGAGAAGGCG GTGCGAACCA TCTTCGCGAA CCGTCGCCCC GACGTCGTCT TCCACGCCGC CGCGCTCAAG CACCTGCCGA TGCTCGAGCA GTACCCGGCC GAGGCCGTGA AGACCAACGT GATCGGCACC CGCACGGTTC TCGATGCCGC AGACCTCGTC GGTGTCGACA GGTTCGTGAA CATCTCCACG GACAAGGCGG CGAACCCGTC CAGCGTCCTG GGCTACTCCA AGCGGGTCGC CGAGCGGATC ACAGCCGCCC AGGCGCGCGA GGCTTCCGGG ACGTATCTCT CGGTGCGCTT CGGGAACGTG CTCGGCAGCC GTGGCTCGGT GCTGGCGGCG TTCGCCCGGC AGATCGCCGC CGGTGGGCCG ATCACGGTCA CCCACCCGGA CGTCAGCCGC TTCTTCATGA CGATCGAGGA GGCCTGCCAG CTGGTCATCC AGGCGGCTGC GATCGGCGGG CCGGGGGAGG CGCTCGTCCT CGACATGGGC GAGCCGGTGA AGATCGTGGA CGTCGCCGAG CAGCTGATCG AGCAGGCCGG CACGCCGGTG CCGATCGAAT ACACCGGGCT GCGCGAGGGC GAGAAGTTGC ACGAGGAGCT CTTCGGCGAA GGCGAGCCGT GCGACGTCCG GCCGCGGCAC CCGCTGGTCT CGCACGTGCC GGTGCCACCG ATCACCGACG GCGAGGTCCT GGGCCTCACC CTCGTCGGCG AGCCTGACGA CGTGCGGCAG GCGCTGCACG ACGCGTGCCT GGTGTCGATC GAGGCCGACG ACCCGTCCTC GCTGCGGAAC TGA
|
Protein sequence | MSDLVARMRR RRAPLAIAFD VCAWLLGYAA FAWLRLDSDA SSVPWPEVLA FATVTATLYV ALAAPLRLHQ GRARTASLEE MVLLGMLMGG VGGGVFLVNL FGQWIPRSIP AGATCGALVL AAWARASWRT LQERDERTGA GEDTERTLVM GAGEAGRELI TSMQRDPLRR YLPVGLLDDD PYKRHRRLRG VPVLGTSCDL EKEVARTGAT MVVIAIPSAS AETVNRLRLT ALDAHVSVKV LPSTTQLLTD SVGIRDLRDI NITDVLGRNQ LDTDVASIAG YLAGRKVLVT GAGGSIGSEL CRQIYRYQPA ELMMLDRDES ALHSVQLSIH GRALLDSDDV ILCDIRDEKA VRTIFANRRP DVVFHAAALK HLPMLEQYPA EAVKTNVIGT RTVLDAADLV GVDRFVNIST DKAANPSSVL GYSKRVAERI TAAQAREASG TYLSVRFGNV LGSRGSVLAA FARQIAAGGP ITVTHPDVSR FFMTIEEACQ LVIQAAAIGG PGEALVLDMG EPVKIVDVAE QLIEQAGTPV PIEYTGLREG EKLHEELFGE GEPCDVRPRH PLVSHVPVPP ITDGEVLGLT LVGEPDDVRQ ALHDACLVSI EADDPSSLRN
|
| |