Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1997 |
Symbol | |
ID | 7268913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 2441979 |
End bp | 2443931 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643566828 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_002463321 |
Protein GI | 219848888 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.521686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGCTCG AATTATCCTC ACCAACCCGC AATCGTTATT TTTTCTTGCT TGATACGATC TTGTTACCTA TTGCCGCCTA CTTGAGCTTT GTGGTACGTC TTGACGAATT ACCGTCAGGC GCAGCACTTA TGGGTTGGTT TATACTGGCA GCGCTTGCCA CACCGATCCA CCTCTTTGTC TTTCGACAAT TGGGTGTCTA TGCACGCTAT TGGCGCTACG CTTCCCTTGA CGAACTACTA TTGCTTATTT CGGCGGTGTC ACTCGCTATG ATCATCGCCG CACCAACCAC GTTCATCGTT GCCGGTATCA TACCAATGGC TCTCCTCCCG CGGTCGGTAC CGGTTATCTT CTTCTGTTTT AACCTCGTTA TGACGATTGG GCCACGCCTG TTATCACGCC TGCACTGGCA CCATCAGCTA ATGCAGCGCA AGCGTAAGGG TGAAATCAAC GGTAAGCAGC AACGAGTGTT GATTATGGGT GCCGGCTCGG CTGGTACGAT GATCGCGCGT GAATTGCGTG ATAATCCGCA ACTTGGTATG GTAGCGGTTG GGTTTCTCGA TGATGACCCG CTTAAGTTCG GTATGTGCAT CTATGGCATA CCGGTACTCG GAAATCGCTT CGACATACCG CGCCTGGCCC GTGAATATAG TGCGCATCTG GTCATCATCG CGATGCCTTC GGCCACCGGC AAAGAGATTC GTAGCATTGT CGAACTCTGT GAGCGAACCG GGGTCAAAAC CAAAATTATG CCCGGTCTCT ACGAAATGCT CGACGGGAAG GTTAGCGTTA ATCAATTGCG TAACGTGCAG ATTGAAGACC TATTGCGTCG GAAACCGGTG CAGACCGACA TTGTCGCCGT CCATAATCTG TTGCGTGGCA AGCGCGTGAT GGTGACCGGT GGGGGTGGCT CGATCGGTTC CGAACTCTGC CGTCAGATCC TGCGCGCCGA GCCGGCCGAG TTGATAATCC TCGGTCACGG CGAAAACTCG GTGTTTACCA TCGAACAGGA GCTACGCCGG CAAGTAACGT CTGCTACCCG TCTCACTACG ATCATCGCCG ATATTCGCTT TGCCGAGCGT GTTATGCATA TCTTTGAACA ATACCAACCC GAGATCGTTT TTCACGCGGC AGCGCACAAG CACGTGCCGT TGATGGAATT GCATCCGTCT GAAGCGGTGA CCAATAACGT ACTCGGGACG CGCAATCTCC TGAGCGCGGC AATGCAGGTT AACGTGAGCC ACTTTGTGAT GATCTCGAGC GACAAAGCTG TTAATCCCAC CAGTGTGATG GGGGCCACGA AACGGGTAGC CGAGCTGCTG GTTCACGAAG CGGCCCGATT AACCGGACGA GCATACGTCG CAGTACGTTT CGGTAATGTG CTCGGTTCAC GCGGATCGGT CGTATTGACG TTCAAGCAGC AGATCGCCGC CGGTGGTCCG GTCACCGTTA CCCATCCGGA GATGCGCCGG TTCTTTATGA CCATCCCTGA AGCGGTGCAA TTGACACTCC AAGCCTCGGT GTTGGGGAAG GGCGGCGAAG TGTTTGTCCT CGATATGGGT GAACCGATCC GCATCGTCGA TCTCGCGCGC GATATGATCG AGTTGTCCGG TTTACAGGTC GGACGCGATA TTGATATCGT CTTTACCGGC CTCCGTCCCG GCGAGAAGCT GTACGAAGAG CTGTTTATCG AGGGGGAAGA ATATCAGCGC ACCGAGCACG CCAAAATCTT TATCGCACGC AACGCTTCAC AGTTTGTCCC TCGCGCACTC GCCGATCAGA TCCGTATCCT TGAGATGGCA GCCTTTCATG AAGATACGGC CTTGCTCCTC CGTACCTTGC ATCGGCTCGT TCCGACCTTC AAACAACCGA TGCCACTACC GATGACTGAA CCCAAGCCGC GCGAACAGGC TGTGGGTGAA CCGCTGTGGA AACGACAGAT GGCAAGCGAT TAA
|
Protein sequence | MKLELSSPTR NRYFFLLDTI LLPIAAYLSF VVRLDELPSG AALMGWFILA ALATPIHLFV FRQLGVYARY WRYASLDELL LLISAVSLAM IIAAPTTFIV AGIIPMALLP RSVPVIFFCF NLVMTIGPRL LSRLHWHHQL MQRKRKGEIN GKQQRVLIMG AGSAGTMIAR ELRDNPQLGM VAVGFLDDDP LKFGMCIYGI PVLGNRFDIP RLAREYSAHL VIIAMPSATG KEIRSIVELC ERTGVKTKIM PGLYEMLDGK VSVNQLRNVQ IEDLLRRKPV QTDIVAVHNL LRGKRVMVTG GGGSIGSELC RQILRAEPAE LIILGHGENS VFTIEQELRR QVTSATRLTT IIADIRFAER VMHIFEQYQP EIVFHAAAHK HVPLMELHPS EAVTNNVLGT RNLLSAAMQV NVSHFVMISS DKAVNPTSVM GATKRVAELL VHEAARLTGR AYVAVRFGNV LGSRGSVVLT FKQQIAAGGP VTVTHPEMRR FFMTIPEAVQ LTLQASVLGK GGEVFVLDMG EPIRIVDLAR DMIELSGLQV GRDIDIVFTG LRPGEKLYEE LFIEGEEYQR TEHAKIFIAR NASQFVPRAL ADQIRILEMA AFHEDTALLL RTLHRLVPTF KQPMPLPMTE PKPREQAVGE PLWKRQMASD
|
| |