Gene Cagg_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1997 
Symbol 
ID7268913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2441979 
End bp2443931 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content54% 
IMG OID643566828 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_002463321 
Protein GI219848888 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.521686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGCTCG AATTATCCTC ACCAACCCGC AATCGTTATT TTTTCTTGCT TGATACGATC 
TTGTTACCTA TTGCCGCCTA CTTGAGCTTT GTGGTACGTC TTGACGAATT ACCGTCAGGC
GCAGCACTTA TGGGTTGGTT TATACTGGCA GCGCTTGCCA CACCGATCCA CCTCTTTGTC
TTTCGACAAT TGGGTGTCTA TGCACGCTAT TGGCGCTACG CTTCCCTTGA CGAACTACTA
TTGCTTATTT CGGCGGTGTC ACTCGCTATG ATCATCGCCG CACCAACCAC GTTCATCGTT
GCCGGTATCA TACCAATGGC TCTCCTCCCG CGGTCGGTAC CGGTTATCTT CTTCTGTTTT
AACCTCGTTA TGACGATTGG GCCACGCCTG TTATCACGCC TGCACTGGCA CCATCAGCTA
ATGCAGCGCA AGCGTAAGGG TGAAATCAAC GGTAAGCAGC AACGAGTGTT GATTATGGGT
GCCGGCTCGG CTGGTACGAT GATCGCGCGT GAATTGCGTG ATAATCCGCA ACTTGGTATG
GTAGCGGTTG GGTTTCTCGA TGATGACCCG CTTAAGTTCG GTATGTGCAT CTATGGCATA
CCGGTACTCG GAAATCGCTT CGACATACCG CGCCTGGCCC GTGAATATAG TGCGCATCTG
GTCATCATCG CGATGCCTTC GGCCACCGGC AAAGAGATTC GTAGCATTGT CGAACTCTGT
GAGCGAACCG GGGTCAAAAC CAAAATTATG CCCGGTCTCT ACGAAATGCT CGACGGGAAG
GTTAGCGTTA ATCAATTGCG TAACGTGCAG ATTGAAGACC TATTGCGTCG GAAACCGGTG
CAGACCGACA TTGTCGCCGT CCATAATCTG TTGCGTGGCA AGCGCGTGAT GGTGACCGGT
GGGGGTGGCT CGATCGGTTC CGAACTCTGC CGTCAGATCC TGCGCGCCGA GCCGGCCGAG
TTGATAATCC TCGGTCACGG CGAAAACTCG GTGTTTACCA TCGAACAGGA GCTACGCCGG
CAAGTAACGT CTGCTACCCG TCTCACTACG ATCATCGCCG ATATTCGCTT TGCCGAGCGT
GTTATGCATA TCTTTGAACA ATACCAACCC GAGATCGTTT TTCACGCGGC AGCGCACAAG
CACGTGCCGT TGATGGAATT GCATCCGTCT GAAGCGGTGA CCAATAACGT ACTCGGGACG
CGCAATCTCC TGAGCGCGGC AATGCAGGTT AACGTGAGCC ACTTTGTGAT GATCTCGAGC
GACAAAGCTG TTAATCCCAC CAGTGTGATG GGGGCCACGA AACGGGTAGC CGAGCTGCTG
GTTCACGAAG CGGCCCGATT AACCGGACGA GCATACGTCG CAGTACGTTT CGGTAATGTG
CTCGGTTCAC GCGGATCGGT CGTATTGACG TTCAAGCAGC AGATCGCCGC CGGTGGTCCG
GTCACCGTTA CCCATCCGGA GATGCGCCGG TTCTTTATGA CCATCCCTGA AGCGGTGCAA
TTGACACTCC AAGCCTCGGT GTTGGGGAAG GGCGGCGAAG TGTTTGTCCT CGATATGGGT
GAACCGATCC GCATCGTCGA TCTCGCGCGC GATATGATCG AGTTGTCCGG TTTACAGGTC
GGACGCGATA TTGATATCGT CTTTACCGGC CTCCGTCCCG GCGAGAAGCT GTACGAAGAG
CTGTTTATCG AGGGGGAAGA ATATCAGCGC ACCGAGCACG CCAAAATCTT TATCGCACGC
AACGCTTCAC AGTTTGTCCC TCGCGCACTC GCCGATCAGA TCCGTATCCT TGAGATGGCA
GCCTTTCATG AAGATACGGC CTTGCTCCTC CGTACCTTGC ATCGGCTCGT TCCGACCTTC
AAACAACCGA TGCCACTACC GATGACTGAA CCCAAGCCGC GCGAACAGGC TGTGGGTGAA
CCGCTGTGGA AACGACAGAT GGCAAGCGAT TAA
 
Protein sequence
MKLELSSPTR NRYFFLLDTI LLPIAAYLSF VVRLDELPSG AALMGWFILA ALATPIHLFV 
FRQLGVYARY WRYASLDELL LLISAVSLAM IIAAPTTFIV AGIIPMALLP RSVPVIFFCF
NLVMTIGPRL LSRLHWHHQL MQRKRKGEIN GKQQRVLIMG AGSAGTMIAR ELRDNPQLGM
VAVGFLDDDP LKFGMCIYGI PVLGNRFDIP RLAREYSAHL VIIAMPSATG KEIRSIVELC
ERTGVKTKIM PGLYEMLDGK VSVNQLRNVQ IEDLLRRKPV QTDIVAVHNL LRGKRVMVTG
GGGSIGSELC RQILRAEPAE LIILGHGENS VFTIEQELRR QVTSATRLTT IIADIRFAER
VMHIFEQYQP EIVFHAAAHK HVPLMELHPS EAVTNNVLGT RNLLSAAMQV NVSHFVMISS
DKAVNPTSVM GATKRVAELL VHEAARLTGR AYVAVRFGNV LGSRGSVVLT FKQQIAAGGP
VTVTHPEMRR FFMTIPEAVQ LTLQASVLGK GGEVFVLDMG EPIRIVDLAR DMIELSGLQV
GRDIDIVFTG LRPGEKLYEE LFIEGEEYQR TEHAKIFIAR NASQFVPRAL ADQIRILEMA
AFHEDTALLL RTLHRLVPTF KQPMPLPMTE PKPREQAVGE PLWKRQMASD