Gene Cagg_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1686 
Symbol 
ID7268988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2057643 
End bp2059124 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content57% 
IMG OID643566528 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002463023 
Protein GI219848590 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCTT TGAAAATGCT ACTCGTCAGC TTCACGTTGA TCGTCGTTGC CCTCGTCAAT 
GCGGCATGCG GCAACAGCCC GACGACTCCT GCGACCCAAC CCACCTCTGC GCCTGCCGAA
CCAACCGCTG CTGCTGCCCC CACCTCAACA CCTGCTGCAA CCCAAGCAAC CGCCGGTGAC
CGTATCAAAG TGCGCTGGTT CGTCGGCTTG GGCGCCGGTA CAGATGAAGG TGCTATTCCG
CCCCAGAATG CCTTTGTCGA ACGATTTAAT GCCGGCCAAG ACAAGATCGA ACTGGTGCTT
GAGATCGTTG ACAACAACGT CGCCTTCGAC ACCCTTGCCA CCCAGATTGC TGCCGGCAAC
GCACCGTGCA TCGTCGGCCC AGTCGGTATC CGTGGACGCG ACAGCTTCAA AGGTGCGTGG
CTTGACTTGC AGCCATTCAT TGACAAATAC AACTACGATC TGAGCGACTT CGATCCCAAT
CTGGTCAAGT TTTATCAGGT GAAGGAGGAG GGTCAACTTG GCATTCCGTT CGCCATCTTC
CCCTCGTTTA TCATCTACAA CAAAGATCTA TTCGATGAAG CCGGCCTCCC TTACCCACCC
GCACGCCATG GCGAGCCGTG GATTGATGAG AACGGCGTCG AGCATGAGTG GAACATTGAG
ACGCTGACCG AACTGGCCAA GAAGCTGACG GTTGACGTCA ACGGCAACGA TGCTACCTCA
CCCGATTTCG ATCCGACCAA GATTGTGCAG TTTGGTTGGA TGAACCAATG GACCGACCCG
CGCGGCATTG GTACCTTCTT CGGCGCCGGT TCGCTGGTCG ATGAGAACGG CAATGCGCAG
ATCCCCGAGC ACTGGAAGGC AGCGTGGAAG TGGACCTACG ACGGTTGGTG GAAGGATTGG
TTCATTCCGA ACGGCCCCTA CGGCGGCGCC GACTTCCTGC AAGGCCCCGG TGGACCCTTC
TCGTCGGGCA ATCTGGCGAT GATTCACATC CACATGTGGT ACGTCGCGCC ATGGGCACTC
GGTAATGTCG ATTTCGACTG GAACTTGGCA GCAACCCCCA GCTACAACGG CAAGATCACG
GCCAAGATGC ACGCCGACAC CTTTGGCATC CTCAAAGGGT GCCCGTACCC CGATGCCGCC
TTCGAGGTAT TGAGCTACAT GCTCAGTCCC GAGCACGTCA ATGAGCTGCT GACCATCTAC
GGCGGTATGC CGGCCCGCCT CTCGCTGCAA GACAACTACT TTGCGCAGTA TAATCAGACC
AGCTTCCCCA ACAAGACCGA CATCAACTGG GACGTGGTCG TGGAGGCGAT GGCCTACGCC
GACAACCCCA ACCACGAAAG CTGGATGCCC AGCTTCCAAG AGACGACCGA CCGCTACAAC
GAGTTCTGGA ACTACCTCGC CAACACGCCC GATGCCGATT TCGAGGCCGA GGTAGCGAAG
CTGCAAGCGG ATTTGCAGAA GATCTTCGAC GCAGCAAAGT AG
 
Protein sequence
MKSLKMLLVS FTLIVVALVN AACGNSPTTP ATQPTSAPAE PTAAAAPTST PAATQATAGD 
RIKVRWFVGL GAGTDEGAIP PQNAFVERFN AGQDKIELVL EIVDNNVAFD TLATQIAAGN
APCIVGPVGI RGRDSFKGAW LDLQPFIDKY NYDLSDFDPN LVKFYQVKEE GQLGIPFAIF
PSFIIYNKDL FDEAGLPYPP ARHGEPWIDE NGVEHEWNIE TLTELAKKLT VDVNGNDATS
PDFDPTKIVQ FGWMNQWTDP RGIGTFFGAG SLVDENGNAQ IPEHWKAAWK WTYDGWWKDW
FIPNGPYGGA DFLQGPGGPF SSGNLAMIHI HMWYVAPWAL GNVDFDWNLA ATPSYNGKIT
AKMHADTFGI LKGCPYPDAA FEVLSYMLSP EHVNELLTIY GGMPARLSLQ DNYFAQYNQT
SFPNKTDINW DVVVEAMAYA DNPNHESWMP SFQETTDRYN EFWNYLANTP DADFEAEVAK
LQADLQKIFD AAK