Gene Cagg_0090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0090 
Symbol 
ID7266828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp127748 
End bp129217 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content59% 
IMG OID643564963 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002461479 
Protein GI219847046 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.271567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCG CTGCCAAAGG TGGCTTCCTG CTCGTGATAC TGATGCTGCT CGCGGCCATG 
GCCGGCTGTA CGCCAGCGGC TATACCACCG ACTCCCCAAG CGCCAACCGC TGCGCCACAG
ACACCCCCTG AGCAACCGAC CGCCGCACCG ACCGCCGCAC CGACCGCTGC ACCGACTGCT
GCTCCCACTG CTGCCGCCAA GCAACCGGTG ACCCTGCGCT ACGCCAACTG GAACCTCGGC
ACTGAGGAAG AGAACAACAT TCAGCGCCGC TTGGTCAAGG CGTATACTGA GATGAACCCG
CACGTGACGA TCGAGTTCGT TGATATGTCG GGTGGTGGCT GGGACGATAT GCTCAACACC
TATGCAGCCC GCGGTGAGCT ACCCGATGTC TTTATGGCCA ACAACATGCC GCTCTACGTT
AAGAACGGCT GGCTGGCCGA TTTGACCGAG CTGGTGGCGA ATGATCCCGA TTGGGCGCTC
ATCCCGCAAG TGCTGCGGTC AGGTGTCACC TATAACGGCA AGGTGATGGG TTTGCCGGCG
GCGCAGTTCA TTATGGGCTA TTTCGTCAAC CGCGATCTCT ACGAAGCGGC TAACCTTGAT
GCGCCTGAGT ACGGTTTCAC GCTCGACGAG TTCAACGCGG CGGTGACCGG CTTACACAAC
CCGTCCCGAG GCGTTCTCGG TCTCGACGAG ATGGAGTTCG TGATGGGCTG GTACCCGCAC
GTGCTCGACA ACAAGTTGCA GTGGTTCAGC TTCGATGGCG TTCACATGAA CTACAACTCA
CCGGCGTTCA AAGACACGGT GGCGCGGGTG GCCGAGCTGA AGCCCTACAC ATGGCAGGGC
TTGACCGATG AGCAGAAGGT CAACTTCAAA TCGGCCGGAC CGTGGGAGCT GTTCCTGAAC
CAAGAAGTCG GCATGCGCTG GGAAGGCGGT TGGGCCATTC CGCAGATTGC GCAGAACGCT
ACCTTTGACT GGGACTTCGT CGGCATCCCC GGCGGTAATC AGGCGATTGT GATGGACATC
ATCGCTGTCT CGAAGACGGC GCCGAATCTG GAGGAAGCCT ACCAATTCGC GCGCTGGATG
ACCTTTGCCC GCGCCGCTTA CGCCAAAGAG GTGGAACTGG CCCGCGAGAT GGGTAGCGTG
CCAAGCAAAA TGCCGGTCGC GATTGACACT GAGTCGCTGG CGCTCTACCG CCAATTCTTC
GACAAGCCGG GTCTCAATGC AGCCCTTGAG AATCTGAACA ATAGCCTTGT CGAGTCACTG
GCCAAACTCG TACCGGGTTA TATCCAGGCA CGCTGGGAAG GCAAACCCGG CATCGACATC
GGCGAAGATA AAGATGTGAA CATGTGGTTC ATGTTCGCCC ATGCCGGCGA TGGCATCTAC
AAGTACGAGG ATTACGCACC GAAATTAGAG ACGTTCGCCA ACAATATCCT CGATACGGCG
CGGGCCGAGG TTGACGCCGC CTTGCGATAG
 
Protein sequence
MKPAAKGGFL LVILMLLAAM AGCTPAAIPP TPQAPTAAPQ TPPEQPTAAP TAAPTAAPTA 
APTAAAKQPV TLRYANWNLG TEEENNIQRR LVKAYTEMNP HVTIEFVDMS GGGWDDMLNT
YAARGELPDV FMANNMPLYV KNGWLADLTE LVANDPDWAL IPQVLRSGVT YNGKVMGLPA
AQFIMGYFVN RDLYEAANLD APEYGFTLDE FNAAVTGLHN PSRGVLGLDE MEFVMGWYPH
VLDNKLQWFS FDGVHMNYNS PAFKDTVARV AELKPYTWQG LTDEQKVNFK SAGPWELFLN
QEVGMRWEGG WAIPQIAQNA TFDWDFVGIP GGNQAIVMDI IAVSKTAPNL EEAYQFARWM
TFARAAYAKE VELAREMGSV PSKMPVAIDT ESLALYRQFF DKPGLNAALE NLNNSLVESL
AKLVPGYIQA RWEGKPGIDI GEDKDVNMWF MFAHAGDGIY KYEDYAPKLE TFANNILDTA
RAEVDAALR