Gene Cagg_0406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0406 
Symbol 
ID7266574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp501673 
End bp502959 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content59% 
IMG OID643565273 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002461787 
Protein GI219847354 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00219397 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGCAA TGAAGAGTGC CCTGATCGCT ATGTTTCTCG CCGTAGCCAT GATGCTTGCG 
GCATGTGGCG GTGGTCAAAC CGCACAACCG ACTACTGCGC CGGCGGAGCA ACCGACTACT
GCGCCGGCGG AGCAACCGAC TACTGCGCCT GCTGCCCAAC AGGTGACGAT TAAGATTTGG
CACCAGTGGG ATGGTGCCTA CCTGACCGCA ATCGAGCAAG CGTTCCGTGA TTACGAGGCT
GCTCACCCGA ATGTCAAGAT CGACCTCTCG AAGCCGGAAG ATGTGAGCAA CGCGCTCAAT
GTGGCCATCC CGGCCGGTGA AGGTCCCGAT ATTATCGGTT GGGCCAACGA CCAGATCGGC
CAGCAGGCGC TGGTGGGTAA CATCGTCGCG CTCAACGATT ATGGGATCAC CGAAGAATTC
CTGCGCAGCA CCTACGAGCC GGCGGCTGTG AATGGCGTCA TCTGGCAGGG CAAGATCTGG
GCGTTGCCGG AGACGCAGGA AGGTATTGCC TTGATCTACA ACAAGGCCGT CATTGGTGAC
ATGCAGTTGC CGACCAACCT CGATGAGCTG CTGGAAATGG CGACGAAGTT CCGCGCTGAG
AACCCTGATA AGACGCTTGT CTGCAATCAG GGATTCGGTG GGAACGATGC TTATCACGTC
GCTCCGATCT ACTTCGGGTA TGGCGTGCCG AGCTACGTGG ACGATCAGGG CAATGTCTAC
GTCAATACGC CGGAGATGAT CAAGGGCGGT GAGTGGCTGG CCGCGATGAG CAAAGTCTCG
TTCAGCGAGC AGAGCTACGA TATCTGCAAG GCAGCATTGG CCGAGGGTAA GGCTGCCATG
TGGTGGACCG GCCCGTGGGC GATTGCCGGT ATCGAGCAGG ATGGCGTTGA TTACGGTATT
CTGCCGTTGG GCAAGCCCTT CGTCGGTATC AAGACCTTGA TGCTGACCCG CAACGCGGTT
GAGCGTGGCA ATGCCGAGGT CGCGTTGGAC ATTATGAAGT ACTTCACCAG TGCGGAGGTG
CAGACCAAAC TGGCGCTGAC CAACAAGACG GTGCCGGCGG CGACGGCTGC ACTCAAGAAT
CCAGAGGTGG CTGCCCTTCC GACCCTGGCC GGGTTTGGTG CTGCCCTGAA TGCGGGTGTA
CCGATGGCGA ATACACCTTA CGCTTCGGCC CAGTGGGGTC CGGTGGGTGA GGCCAGCGTT
GCGATTTGGA CCGGTGCGCA GACGCCTGCT GATGCGCTAG CTGCCGCTGC TAAAGCGATT
GAAGAAGCCA TCATGCAGAT GAAGTAG
 
Protein sequence
MKAMKSALIA MFLAVAMMLA ACGGGQTAQP TTAPAEQPTT APAEQPTTAP AAQQVTIKIW 
HQWDGAYLTA IEQAFRDYEA AHPNVKIDLS KPEDVSNALN VAIPAGEGPD IIGWANDQIG
QQALVGNIVA LNDYGITEEF LRSTYEPAAV NGVIWQGKIW ALPETQEGIA LIYNKAVIGD
MQLPTNLDEL LEMATKFRAE NPDKTLVCNQ GFGGNDAYHV APIYFGYGVP SYVDDQGNVY
VNTPEMIKGG EWLAAMSKVS FSEQSYDICK AALAEGKAAM WWTGPWAIAG IEQDGVDYGI
LPLGKPFVGI KTLMLTRNAV ERGNAEVALD IMKYFTSAEV QTKLALTNKT VPAATAALKN
PEVAALPTLA GFGAALNAGV PMANTPYASA QWGPVGEASV AIWTGAQTPA DALAAAAKAI
EEAIMQMK