Gene Cagg_3483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3483 
Symbol 
ID7267506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4247644 
End bp4248957 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content63% 
IMG OID643568292 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002464759 
Protein GI219850326 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGCTT GCATTGCCCG CGCCGCGGCG CTGATCGGCG TTGCGGCGCT GATCTGCATT 
GCATGCGCTG CGCCGGCACC CGCGCCAACG CCGACGGTGA CCGTGACCGC AGCGCCCACA
ACGACCACAG CGCCTACGGC GACCGCAATG CCCGCTGCTA CGCCTACAGC ACCGCCAACG
GCCACACCGG TCACCCGTAA TTTAACCATT TGGGTTGCCG AACTCCCTGA TGCACAGGCG
GTAGTTGCTA GCGAACTACA CCGAGCAGCA CAGATCGGCG ATCTCCACGT AACAATTGTG
CCCCGTGATC CTGATGGGTT GCTCATTAGT CTCGCCACCG ATCACCTGCT CGGCCTGCCG
CCGCCCGACC TGATCTGGGC CGATCAAGAG GCACTGGTAG GGTTGTTAGC CGATAACGCG
CTAGCACCGA TTGGCATCGA GCTGCCGGCT GACCTCTTGC CCGGCTTGCG TACTCTGGCG
AGTAGCAACA ATACGCTGTG GGGAGCACCG ATCACCGCCC AAGATATGCT GCTGCTGCTG
TACCAGCTCG ATTGCTCACC GCCGGCGACG GCTGCCGATT TGGTGGCGGC AGCGCAAGCG
GCCCGCACAT CCGAACGGGC CGGATTTGTG CAAGGCTGGG GGGCGGCGCG CTGGCTAGCG
CCATGGTTCT ACGTTGCCGG CGGCGCCTTT ACCACCCCCG ACGGCGCTGA GCCAACCCTC
GACACACCGG CAATGACGAC CACCCTTACC CTCCTTCGCG ATCTCTACCG GGCTGCGCCG
CACAACGGCG ATAGTTATAC GCGCGGTCAA CGTCTGCTGA CCCAAGGGTT TGCCGCCTAT
GCAATCGATG GTGACTGGGC TTGGGCGACC TATCGCGCCA TCAGCGATAC GTTCCAGATC
GGGATCGCAC CACTCCCCTC GTTCAACGGT AATCCGACCC GCCCACTGAT CGGCGGCAGC
CTGTTGATGC GCCATCGGGA TGGACAAGCC ACACCAGAGG ATGTGACAAC ACTGATCACA
ACCCTCTATC AACCAGAGGT GCAACTGCGG CTGAGCGCAG CTTTAGGCCG CCTACCGGCC
CGGCGTGAGC TGCTAACCGA TGCCAGCATT CAAACCGATC CAGTCCGTGC TATAGCCGCG
ATGCAAGTGA CCGATGCCCC CGGCCTTCCC CCAACACCTG CCGTCCGCTG CGCCATTTTC
GGCGTTGAAT CGTACCTGTA CAGCGCCACC ACCGGCGATC TCCCCATCAA TGAAGCCCCT
CTCCGTATGC AACGCGAAGC GTTGGCCTGC TTGCGCCAAT TTGTGCAACC CTAA
 
Protein sequence
MIACIARAAA LIGVAALICI ACAAPAPAPT PTVTVTAAPT TTTAPTATAM PAATPTAPPT 
ATPVTRNLTI WVAELPDAQA VVASELHRAA QIGDLHVTIV PRDPDGLLIS LATDHLLGLP
PPDLIWADQE ALVGLLADNA LAPIGIELPA DLLPGLRTLA SSNNTLWGAP ITAQDMLLLL
YQLDCSPPAT AADLVAAAQA ARTSERAGFV QGWGAARWLA PWFYVAGGAF TTPDGAEPTL
DTPAMTTTLT LLRDLYRAAP HNGDSYTRGQ RLLTQGFAAY AIDGDWAWAT YRAISDTFQI
GIAPLPSFNG NPTRPLIGGS LLMRHRDGQA TPEDVTTLIT TLYQPEVQLR LSAALGRLPA
RRELLTDASI QTDPVRAIAA MQVTDAPGLP PTPAVRCAIF GVESYLYSAT TGDLPINEAP
LRMQREALAC LRQFVQP