Gene Cagg_2195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2195 
Symbol 
ID7266768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2689993 
End bp2691345 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content57% 
IMG OID643567026 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002463514 
Protein GI219849081 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATGA AACGTACATT CACCCGGCGC GAGTTGTTGC GCCTGATGGT GGCCGGGAGC 
GGTGCGGCGG TATTGGCGGC GTGTGGTACG CAAGGCGGGC AGACGGGAAC GCAGGCCACC
CAAGCACCGG CTGTGGTCAG CCAGCCCGGA TCGAAGGTCA AGATTACCTA CTGGGGTTCG
TTCAGTGGGA ATCTGGGCGA AGCTGAGCAG GCGATGGTCA AGGCGTTTAA CGAGGAGCAG
GATGAGGTCG AGGTTGAGTA TCAGTTTCAA GGCAGTTACG AAGAGACGGC ACAGAAGTTC
ACCGCTGCTT TGCAAGCTAA TACCACGCCT GATGTCATCC TGCTCTCGGA TGTCTGGTGG
TTTGGCTTTT ATCTGGCCGG TGCGATTACA GCACTCGATG ACCTCGCCAG GCAGGTGAAT
CTCGATTTCA ATGATTACGA ACCGGTGTTG CTCAATGAAG GTGTGCGCAA AGGTGTCCAT
TACTGGATCC CATTTGCGCG CAGCACACCG CTCTTCTACT ACAACAAAGA CATTTGGGCC
GAGGCGGGTC TGCCCGATCG CGCCCCAGAG ACGTGGGCCG AGTTTAGCGA GTGGGCGCCG
AAGTTGGTCA AGAGCGATGG CAGCCGGTCC GCGTTTGGTC ACCCTAACGG TGCGAGCTAC
ATTGCGTGGC TCTTCCAGGG GGTGGTGTGG CAGTTTGGCG GTCAGTACTC GCAACCTGAC
TTCACCATGA CGATGACCGA TCCGAATACG TTGCGCGCGG CTCAGTTCTA CCAGGATACG
GTGGTCAAAA ATAAGTGGGC TATCTTGTCG CCCAACCTTA ATCAAGATTT CATCGGTGGG
GCGATTGCCT CGATGATGGC CTCAACCGGT TCATTAGCCG GGATTCAGGC TAACGCTACC
TTCCCGGTGG GAGTCGGCTT CTTGCCGCGA GAGACCAACT TTGGTTGCCC GACCGGTGGC
GCCGGTTTGG CGATTGTCAG CCGTGCTCCT GCCGAGAAGC AACTGGCGGC GATGAAGTAT
ATCGCGTTTG CGACCAACCC TACCAGCGCC GGTGTGTGGT CGCGGAGCAC GGGATATATG
CCGGTACGGA TTAGCACCAA GCAGACGCCG GAGATGATCG AGTTCTTCAA ACAAAACCCC
AACTTCAAGA CGGCGGTTGA TCAATTGCCT AAGACTCGTG CGCAAGATGC GGCACGTGTG
TTTGTGCGCA ACGGTGACCA AATTATCGGT AAGGGACTCG AGCGGATCAT CGTCAACGGT
GAAGCACCGA GTGCTGTGTT TGCCGAGGTT AATAACGAGC TGACCGAGGG CGCCAAGCCG
ATCCTGGAGG ATCTCAAAGC ACGCGAAGGC TGA
 
Protein sequence
MSMKRTFTRR ELLRLMVAGS GAAVLAACGT QGGQTGTQAT QAPAVVSQPG SKVKITYWGS 
FSGNLGEAEQ AMVKAFNEEQ DEVEVEYQFQ GSYEETAQKF TAALQANTTP DVILLSDVWW
FGFYLAGAIT ALDDLARQVN LDFNDYEPVL LNEGVRKGVH YWIPFARSTP LFYYNKDIWA
EAGLPDRAPE TWAEFSEWAP KLVKSDGSRS AFGHPNGASY IAWLFQGVVW QFGGQYSQPD
FTMTMTDPNT LRAAQFYQDT VVKNKWAILS PNLNQDFIGG AIASMMASTG SLAGIQANAT
FPVGVGFLPR ETNFGCPTGG AGLAIVSRAP AEKQLAAMKY IAFATNPTSA GVWSRSTGYM
PVRISTKQTP EMIEFFKQNP NFKTAVDQLP KTRAQDAARV FVRNGDQIIG KGLERIIVNG
EAPSAVFAEV NNELTEGAKP ILEDLKAREG