Gene Cagg_0729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0729 
Symbol 
ID7268048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp907390 
End bp908721 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content57% 
IMG OID643565580 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002462089 
Protein GI219847656 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000254304 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTTC GGAAACACAC TCTGTCGTGG ATTGCGCTGA TTGCCCTGTT TACCGTGATG 
CTAGCGGCGT GTGGCGGTGG TCAGCCTACC ACAGGGAGTG GCAGTGGTGG GCAAAGCGGC
AGCAGTGCGA ATACCGGTGG CAGCGGCCAA GCGGTTACCA TTCGCTGGCG GACTCGCCCC
GGTGATGCTG CCGAGCAGCG TGTCTATGAA GAGTTAAATA CCCTTGTCAA CGAAAAACTC
AAGGATAAAG GGATCACCGC AGTATACGAT CCGGCGCCCA ATCAGGGCTA CTTCGAGAAG
CTGAAGACCG AGTTGGCAGC CGGCAATGCC CCTGACATCT TCTGGATCGG TGGTGTCGAG
TTAGCGGATT TTGTCAATAC CGGTCAGATT CTCGATCTGA AGCCACTGAT CGATGCCGAT
AGCAGCTTCC AGTTGAGCAA CTTTTACCCG AACGTGATCG AGCAGTTGAC GCGCGATGGG
AAGATCTACG GTCTGCCGCG CGACATCTCG ACGATGGTCG TGTATTACAA CGAAGACCTG
TTCAAAGCCG CAGGCTTGAA GACGCCGAAA GAGTTGGCGG CTGAGGGTAA CTGGAATTGG
GATACTATGC TCGAAGCGGC ACGCAAACTG ACCGATCCGG CGAAGCAGCA GTACGGCCTC
GGGTTTGGTA ACTGGTGGGG ACCGGCTTGG GGTTACTTTG TTAACGCTGC GGGTGGTAGT
CCCTTCACGC CTGACCGTCG CGGGTGTGCG TTGAATTCAC CAGAAGCGAT CAACGGCGCC
AAGATGGTGC GGATGCTCTA CGATGAGAAG CTCCTGCCGG CCGGTGATGC GGATGGTGAG
GCACTCTTCA ATGCCGGTAA GGTAGCGATG TATTTCAATG GCCGCTGGTT TACCCCCGGT
GTCCGCACCA ATGCCCAGTT CAACTGGGAC GTGGCGGTGA TGCCGGAGGG CAAGGTGAAG
AGTACATGGC TCTTCTGGGG GCCGTATCTG GTTAATGCAA AGACCGCTAA CGCGCAGGCA
GCTTGGGAGG TGCTGAAGGT ACTGACCAGC GCCGAGGCCA CGGCTAAGGT CGCGGCGTTA
GGGACAAACA TCCCGCCACG CAGCGATCAA GAGGCGGTCA ATGCATTCCT CGCCTCGACG
CCACCGGCCA ATAATCAGGC TTTCCTTGAT GGGATCCCCT ATGCAGCACT GGAAGCACCG
GTGTGGGATG GAAGCTGGGC AGATTTCAGT GGTATTGTCC AGAGCCTCTG GGACCAGATG
ATCGCCGGAC AGATCACGCC TGAGCAATTT GGTCAGCAGG CATGTGAACA GGCGGCCAGC
ACCTTTAAGT AG
 
Protein sequence
MMLRKHTLSW IALIALFTVM LAACGGGQPT TGSGSGGQSG SSANTGGSGQ AVTIRWRTRP 
GDAAEQRVYE ELNTLVNEKL KDKGITAVYD PAPNQGYFEK LKTELAAGNA PDIFWIGGVE
LADFVNTGQI LDLKPLIDAD SSFQLSNFYP NVIEQLTRDG KIYGLPRDIS TMVVYYNEDL
FKAAGLKTPK ELAAEGNWNW DTMLEAARKL TDPAKQQYGL GFGNWWGPAW GYFVNAAGGS
PFTPDRRGCA LNSPEAINGA KMVRMLYDEK LLPAGDADGE ALFNAGKVAM YFNGRWFTPG
VRTNAQFNWD VAVMPEGKVK STWLFWGPYL VNAKTANAQA AWEVLKVLTS AEATAKVAAL
GTNIPPRSDQ EAVNAFLAST PPANNQAFLD GIPYAALEAP VWDGSWADFS GIVQSLWDQM
IAGQITPEQF GQQACEQAAS TFK