Gene Cagg_2090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2090 
Symbol 
ID7267597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2556079 
End bp2559438 
Gene Length3360 bp 
Protein Length1119 aa 
Translation table11 
GC content56% 
IMG OID643566924 
Producttrehalose synthase 
Protein accessionYP_002463413 
Protein GI219848980 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCA CACGGCCTGC CAAGACACCG ACGATTATCG CCGATGATCC ACTCTGGTAC 
AAAGACGCCA TCATCTACGA GGTGCATGTC CGCGCGTTCT GCGATAGCAA CGGTGACGGT
ATCGGCGATT TTCCCGGTCT GACCAGCAAA CTCGATTATT TGCAGGATCT CGGCGTCACA
GCAATCTGGT TGCTCCCCTT TTATCCCTCA CCGCTGCGTG ATGACGGGTA CGATATTGCC
GACTATACCA ACATTCACCC CAATTACGGC ACCTTATCCG ATTTCAAGGT CTTCATACGC
GAAGCGCATC GGCGCGGTGT GCGCGTCATT ACCGAGTTAG TCTGTAACCA CACCTCGGAT
CAACATCCGT GGTTTCAACG CGCCCGCCGC GCGAAACCCG GTTCGTCGGC ACGTAATTTT
TATGTGTGGT CGGATACCCC CGACCGCTAT AAAGACGCAC GGATTATTTT TAAAGATTTC
GAGACAAGCA ACTGGACGTG GGACCCGGTG GCCCAAGCCT ACTATTGGCA CCGTTTTTAT
AGCCATCAGC CCGATCTGAA CTTTGAGAAC CCGGCGGTAC AACGCGCCGT GTTTAAAGCG
ATGGAGTTTT GGCTCGATCT CGGTGTTGAC GGTATGCGGC TCGATGCCAT CCCCTATCTC
TACGAAGCCG AAGGTACCAA TTGTGAGAAT CTGCCCGAAA CCCATGCGTT TCTCAAGCGC
CTACGCCGGC ATATGGATGA AAAATACCAC GGCCGTATGT TTCTGGCCGA AGCCAATCAA
TGGCCGGAAG ATGCGGTGGC CTATTTCGGT GATGGCGATG AGTGCCACAT GGCCTTTCAC
TTCCCGGTCA TGCCACGCCT CTTTATGTCG GTTCATCTCG AGGATCGCTA CCCGATCATT
GATATTATGC GGCAGACACC ACCCATTCCT GAGAACTGCC AGTGGGCGAT CTTCTTGCGC
AACCACGATG AGTTGACGCT TGAGATGGTT ACCGATGAAG AACGTGACTA TATGTACCGG
GTATACGCCC GCGATCCACA AGCCCGGATC AATCTCGGCA TTCGTCGCCG ACTCGCGCCG
CTGCTCGGCA ACCATCGCCG TAAAATCGAA TTGATGAACG GTCTCCTCTT CTCATTACCC
GGTACACCGG TGATCTACTA CGGTGATGAG ATCGGGATGG GTGACAATAT TTACCTCGGT
GATCGCAACG GGGTACGCAC ACCGATGCAA TGGAGCGGCG ACCGTAACGC CGGTTTCTCA
CGCGCGAATC CACAGCAGCT CTATTTGCCG GTCATTACCG ATCCGGAATA TCACTACGAG
ACGGTGAACG TCGAGACGCA GAGTGCAAAT CAGCATTCAC TCCTCTGGTG GACAAGACGG
CTGATTGCAC TGCGCAAGCG GTATTCGGCA TTCGGACGCG GTACGCTCGA GTTTCTTTAC
CCCGAAAACC GCAAAGTGCT CTGCTTCTTA CGCAAAACTG CCGATCAGAT TTTACTCGCC
GTCTTCAATC TGTCACGCTT TGTCCAAGGG GTCGAGATCG ATCTCTCACC CTATCGGGGA
TTGATGCCGG TTGAGCTATT TGGGCAAGTA GAGTTTCCGC CTATCGGCGA TCAGCCGTAC
TTTTTAACCC TCGGCCCGCA CAGTTTCTAC TGGTTTACCC TGACCCCACA GCGGGTTGAG
GGGGTACGAG TCACGACGTC ACCACCTGAA ACTGAGCTAA CCGAGATACC GGTTGATGCT
ATCGAGTGGG ATGCGATCTT TTACGATGGT CGGCAAGTCC GTCTCGAACA GATTTTACCC
GATTACCTCC GCTACCGACG CTGGTTTGGC GCCAAGACGC GCAAGATCAA GCAAGTAAAC
ATCATCGAGT TTGCCCGCCT CGATTATGCC GGCGGCCCGG CCTACCTGAC CTTACTGAAC
GTGCAGTATG TCGAGGGCCC CCCCGAGCTG TACATGCTGC CGATGGCTTA CGTCGAGGGA
GAGCGGGCCG ATCAGATACT TGCCGATCAG CGGCATATGG TCATCGCCCG CCTGAGAGTT
GGCCGCCGAC CCGCAGCGGG TATTCTGTAC GATCCGCTTG GTGAGCGTCG GTTTGCTTCG
GCACTTCTCG AGTTGACGAT CGGGCGACGG CGCCTGCGCG GCGAAGCGGG TGGGGAACTG
GTCGGCGGGA CAACCCGTGC GCTGCGTAAA CTGCTCACCG GCAGTGATGG TCTTGAACCG
AGCTTGATGC GCGGTGAGCA GAGCAATTCA TCGATCAACT TTGGCAGTCG CCTGATTATG
AAGCTCTTCC GCAAGATCGA ACCGGGTCGT AATCCCGATC TCGAACTTGG GCGTTTCCTG
ACCGAAGAAG TCGGTTTTCC ACACACACCG CCGGTAGCCG GCTTTATCGA ATATCAGCGT
GGTAAAGACG AACCATTGAC CCTTGCGATT GTGCAGGGCT ATGTTCAAAA TGAAGGAGAC
GCCTTTGATT ACGCCCTCGA TGTGGTGCGC CGGTATTACG ATACGATATT GACCCGCGCC
GATCTGACAC CACCGTCGGT GAAAGCCAGC GTCGCCGAGT TGGTTGCTGC GGCGTTCAAC
CCACCTGCTC CGCTTGCCGA AGAACTCATC GGCGGTTACC TCGAATCGGC ACGGTTGCTT
GGTCAACGTA CCGCCGAGAT GCATCGGGCA TTGGCAAAAG GGAGCGGGCC GGCAATGGCG
CCGGAACCCT TCTCAACCCT CTATCAGCGC TCGATCTACC AGAGTGTCCG CAGCCTGATC
GGACGAACTC TGCAAGACCT GCGCAAACTA TTGCCATCGT TACCACCGGC GGTGCGTCCG
GCTGCCGAAC AAGTTGCGCA GAGTGAAGAA GCGCTGCTCG CCCGCCTTCA TCGCATCACC
GGGGACAAAA TCGAGACGGT TCGCATCCGC ATCCACGGTG ATTACCATCT CGAACAGGTG
CTCTTTACCG GTAAAGACTA CATGATCATT GACTTCGAGG GTGAGCCACT CCGCCCGATC
AGCGAGCGGC GGATCAAGCG CTCACCACTA CGCGATGTCG CCGGGATGTT ACGCTCATAT
CAGTATGCAG CTTACGCTGT GCTCTTCAGC CGCAATGGCA CGACAAACCA TCACGAAGAG
ATCGAACGGC TGCAACAATG GGCCGACTTC TGGAGCTTTT GGGTCTGCGC TGCCTTCCTC
GAAGGCTATC TATCTACGGC CCGCAATGAG CCGTTCATCC CCACCGACCG TGCTGACCTT
GAAGCACTGC TCGAAACCTT CGTGATTGAG AAGGCTATCT ATGAACTCAG TTACGAGATG
AATAACCGAC CCGATTGGCT ACCGATCCCG ATCAACGGCA TTTTACGCCA ATTGGAGTAA
 
Protein sequence
MPRTRPAKTP TIIADDPLWY KDAIIYEVHV RAFCDSNGDG IGDFPGLTSK LDYLQDLGVT 
AIWLLPFYPS PLRDDGYDIA DYTNIHPNYG TLSDFKVFIR EAHRRGVRVI TELVCNHTSD
QHPWFQRARR AKPGSSARNF YVWSDTPDRY KDARIIFKDF ETSNWTWDPV AQAYYWHRFY
SHQPDLNFEN PAVQRAVFKA MEFWLDLGVD GMRLDAIPYL YEAEGTNCEN LPETHAFLKR
LRRHMDEKYH GRMFLAEANQ WPEDAVAYFG DGDECHMAFH FPVMPRLFMS VHLEDRYPII
DIMRQTPPIP ENCQWAIFLR NHDELTLEMV TDEERDYMYR VYARDPQARI NLGIRRRLAP
LLGNHRRKIE LMNGLLFSLP GTPVIYYGDE IGMGDNIYLG DRNGVRTPMQ WSGDRNAGFS
RANPQQLYLP VITDPEYHYE TVNVETQSAN QHSLLWWTRR LIALRKRYSA FGRGTLEFLY
PENRKVLCFL RKTADQILLA VFNLSRFVQG VEIDLSPYRG LMPVELFGQV EFPPIGDQPY
FLTLGPHSFY WFTLTPQRVE GVRVTTSPPE TELTEIPVDA IEWDAIFYDG RQVRLEQILP
DYLRYRRWFG AKTRKIKQVN IIEFARLDYA GGPAYLTLLN VQYVEGPPEL YMLPMAYVEG
ERADQILADQ RHMVIARLRV GRRPAAGILY DPLGERRFAS ALLELTIGRR RLRGEAGGEL
VGGTTRALRK LLTGSDGLEP SLMRGEQSNS SINFGSRLIM KLFRKIEPGR NPDLELGRFL
TEEVGFPHTP PVAGFIEYQR GKDEPLTLAI VQGYVQNEGD AFDYALDVVR RYYDTILTRA
DLTPPSVKAS VAELVAAAFN PPAPLAEELI GGYLESARLL GQRTAEMHRA LAKGSGPAMA
PEPFSTLYQR SIYQSVRSLI GRTLQDLRKL LPSLPPAVRP AAEQVAQSEE ALLARLHRIT
GDKIETVRIR IHGDYHLEQV LFTGKDYMII DFEGEPLRPI SERRIKRSPL RDVAGMLRSY
QYAAYAVLFS RNGTTNHHEE IERLQQWADF WSFWVCAAFL EGYLSTARNE PFIPTDRADL
EALLETFVIE KAIYELSYEM NNRPDWLPIP INGILRQLE