Gene Cagg_3183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3183 
Symbol 
ID7269932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3862409 
End bp3863644 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content58% 
IMG OID643568004 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002464477 
Protein GI219850044 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.641192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.908857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCGTT CGACGTTTTC ACTGCTTACC CTGCTCGCTC TGTTCGCTAC CTTCCTCGCT 
GCTTGCGGTT CAGCTCCCAC TCCCGCTCAG CCAACATCGG CCCCCGCTCA ACCGACTGCT
GCTAGCCAAC CTGCTACCGG CGGAATGGCG ATTACCGGTA CTGTCACGCT CTGGCACGCC
TACGGGACCG GCAGTGCCGA AGAGAAGGCC ATCAATATAC TGATCGACCG TGCCCGTGCC
GCGTATCCGC AGGCTACCAT CAATGTGTTG CAAATCCCGT TCGACCAGAT TTTCAACAAG
TTCAACAACG AAGTATCGTC CGGTGGTGGG CCTGATATGT TCATTGCCCC GAACGATAGT
CTCGGTAGCC AGATTCGCGC CGGCGTCTTG GCCGATCTCA GCGAGTATCA GAGCATGCTG
ACCGACGTCG CGCCGACCGG TGTGGCCGGT ATGTCGCTCA ATGGCAAGCT GTACGGTATT
CCCGAGTCGT TCAAGGCGGT AGCACTCTAT TACAACAAGA GCAAGATTAC AAACCCGCCA
ACAACGACCG ATGAGCTGTT GGCCATGGTC AAAGAGGGCA AGACGCTGGT GCTCAACCAG
AATGCCTACC ACAATTTCGG CTGGTTGCAG GCATTTGGCG GCCAACTGAT GGATAATAAC
GGCAAGTGCA TTGCCGATCA GGCCGGTGGT CCTGAGTGGT TCGCCTACCT CAAGGCGTTG
AAGGAGGTGC CAACCGTCAC CTTCTCGACC GACGGTGGGC AGGCCGATTC GTTGTTCAAG
GATGGCAAGG CCGACATGAT CATCAACGGC CCTTGGGTAC TCGGTGACTA CCGCGCCGTG
TTAGGCGATA ACCTCGGTGT GGCACCGATG CCGGGCGCTA CCAAACCTGC CGGGCCGCTC
ACCGGTGTTG ATGGCTTCTA CGTGAGCATC AACAGCCAGA ACATTGCCGG TGCCGTCGCA
TTAGCAATGT TCCTGACCAG CCCTGAGTCG ATGAAGGTAT ATGTCGACGA GGCCGGTCAT
GTGCCGGTAA GCACCAAAGT TCAGATTTCC GACCCACTGG TGCAAGCGTT TGCCCAGGCT
TCGGCAACCG GTGTACCACG GCCACAGATT CCTGAACTCG ACAACTACTG GGGCCCCTTT
GGCGACGCTA TGACGAAGGT GCTCGATGGT GGCGCCGATC CGGCTGCGGC TGTGGCCGAA
GCCTGTGCGC TGATGAACAC CGCAAACGGT AAGTAA
 
Protein sequence
MKRSTFSLLT LLALFATFLA ACGSAPTPAQ PTSAPAQPTA ASQPATGGMA ITGTVTLWHA 
YGTGSAEEKA INILIDRARA AYPQATINVL QIPFDQIFNK FNNEVSSGGG PDMFIAPNDS
LGSQIRAGVL ADLSEYQSML TDVAPTGVAG MSLNGKLYGI PESFKAVALY YNKSKITNPP
TTTDELLAMV KEGKTLVLNQ NAYHNFGWLQ AFGGQLMDNN GKCIADQAGG PEWFAYLKAL
KEVPTVTFST DGGQADSLFK DGKADMIING PWVLGDYRAV LGDNLGVAPM PGATKPAGPL
TGVDGFYVSI NSQNIAGAVA LAMFLTSPES MKVYVDEAGH VPVSTKVQIS DPLVQAFAQA
SATGVPRPQI PELDNYWGPF GDAMTKVLDG GADPAAAVAE ACALMNTANG K