Gene Cagg_2092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2092 
Symbol 
ID7267599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2561737 
End bp2563515 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content58% 
IMG OID643566926 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_002463415 
Protein GI219848982 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAATC TGCCACAACC CGGTGCGGTT TACCGCAGTG ATGGTACAAC TGCCTTTACC 
TTATGGGCGC CAACTGCCGC GACCGTTGAA CTGATCGTTC TTGATCCGAA ACCCCGGACA
GTGACGATGA AACCGATCGG TTATGACTGC TGGCACGTAG TCACGGAAGC GCCACCCGGC
ACACGCTACC GCTACCGACT CGATGGTCAG CGGGAGCGAC CCGATCCGGC ATCGCGCTGC
CAACCCGAAG GCGTGCATGG CCCATCGGCA GTTGTAGATC ATCACTTCGA CTGGAGTGAT
GCGGCGTGGC GCGGCGTACC ACTCACCGAT CTTGTCATCT ACGAGCTACA CGTCGGCACC
TTTACCCCAG AAGGAACCTT CACGGCCATC ATCCCCCATT TACCGATACT CCGTGACCTC
GGCGTGACGG CTATCGAACT CATGCCGGTA GCCCACTTCC CCGGCCAACG TAATTGGGGC
TACGATGGCG TCTACCTCTA CGCTCCACAC ACGGTCTATG GTGGCGTCAA GGGTCTGAAG
CAATTGGTCG ATGCTGCCCA TGCCCACGGC ATCGCCGTCA TCCTCGATGT GGTCTACAAT
CACTTTGGCC CCGAAGGTAA CTATTTGTGG GACATCGCAC CGCCGGCGTT TACCGATCGC
TACCGCACAC CGTGGGGATC GGCGATCAAT TACGATGGTC CCGACAGTGA TCTGGTCCGC
TGGTTGATCA TCGAGAACGC CCTGGAATGG CTGCGTGAGT ATCATATTGA CGGCCTGCGG
CTCGATGCAA CGCATGCGAT CTTCGATGTA TCGCCGTACC ACGTGCTTGA AGAGTTAGCC
GACCGGGTGC GCGAACAAGC GATCCGCCTC GGTCGTCCGG CGTATCTCTT CGCCGAGCAT
CCGCTCAACG ATCCGCGCTT TGCCCGCCCT AAGGTCCTCG GCGGGTACGG TTTAAGCGGC
ATTTGGTCGG ACGATTTTCA TCATGCCCTG CATAGTTTTC TCACCGGTGA ACAAAACGGG
TACTACGCAG GCTTCGGTAG TTTGGCACAG ATAGCGACGG CAATCGAGCG TAGCTTTGTC
TTTGCCGGTG AATATTCACC GCACGCCCGC CGTCGCTTTG GTCGCGATCC TTCCGAACTT
GCACCCGAAC AATTTGTGGT CTTTTTGCAA AACCACGACC AAGTCGGTAA TCGTGCTATT
GGTGACCGAC TAGGGGCAAC GTTGAGTGAA GCGCAATTAC GAGTTGCTGC CGCGACGGTA
CTGCTCAGCC CGTATACACC GTTGATCTTC ATGGGTGAGG AGTATAACGA GCCGGCGCCC
TTTCAATATT TTACCGACCA CAGTGACCCG GCACTGATTA CCGGAGTACG CGAGGGCCGT
AAACGTGAGT TTGCCTACTT CTTACGTCCC GGCCAGGAAG TGCCTGACCC ACAAGATCCG
ACCACGTTTA CCCGATCGAA ACTCAATCAC GCTTTACGCA CGGTCGGCAG GCACGCGGCC
CATCAGGCCT TTTACCGCGA ACTGTTGCGG TTGCGGCGTG AGCTACCGGG GTTACGTCAA
CGGCCCCGTA CCCGTGTGCA AGGTCAGACA ATCGTGGTAG AATGGCCGCG CATCCGGCTA
CTGCTCAATT TCGGCCCGGA TCCAATACGG ATCGAACTGC CGGTAGCATC CTGGCAGATC
CGGCTCGACA GTGGCGATCC GCCCGCAACA ATCCTAGACG GCATGAGAGT CACGTGCAGC
GGGTACAGTG CTGTGCTGCT AACCACACAC AATGAGTAA
 
Protein sequence
MFNLPQPGAV YRSDGTTAFT LWAPTAATVE LIVLDPKPRT VTMKPIGYDC WHVVTEAPPG 
TRYRYRLDGQ RERPDPASRC QPEGVHGPSA VVDHHFDWSD AAWRGVPLTD LVIYELHVGT
FTPEGTFTAI IPHLPILRDL GVTAIELMPV AHFPGQRNWG YDGVYLYAPH TVYGGVKGLK
QLVDAAHAHG IAVILDVVYN HFGPEGNYLW DIAPPAFTDR YRTPWGSAIN YDGPDSDLVR
WLIIENALEW LREYHIDGLR LDATHAIFDV SPYHVLEELA DRVREQAIRL GRPAYLFAEH
PLNDPRFARP KVLGGYGLSG IWSDDFHHAL HSFLTGEQNG YYAGFGSLAQ IATAIERSFV
FAGEYSPHAR RRFGRDPSEL APEQFVVFLQ NHDQVGNRAI GDRLGATLSE AQLRVAAATV
LLSPYTPLIF MGEEYNEPAP FQYFTDHSDP ALITGVREGR KREFAYFLRP GQEVPDPQDP
TTFTRSKLNH ALRTVGRHAA HQAFYRELLR LRRELPGLRQ RPRTRVQGQT IVVEWPRIRL
LLNFGPDPIR IELPVASWQI RLDSGDPPAT ILDGMRVTCS GYSAVLLTTH NE