Gene Cagg_3716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3716 
Symbol 
ID7268252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4515824 
End bp4517146 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content57% 
IMG OID643568523 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002464988 
Protein GI219850555 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000472149 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCACGTT CGCGGATGAC TTTGTTGGTA ATTCTGATGA CCGTCTTCAG CTTTGTGCTG 
GCGGCGTGTG GTGGTGGATC GACGTCTCAA CCTTCTGGTG GTGAGCAGAC GGGACAGACG
ACCGGCGAAA AGAAGCGGGT AACGATCCTC GGTGCGTTTG GTGGCGGTGA AGCCGATGCT
TTCGAGGAGG TGATCAAGGT TTTCGAGGCA GCTAACCCAG ATATCGATGT GGTCTACACC
GGCGTTAATG ACTTCGACAC CCAGATTGTC GTGCGGGTGC AAGCCGGTGA TCCGCCGGAT
ATTGCCGGTT TCCCGCAGCC GGGTGGTGCC GCACGGTTGG CTGCTGAAGG TAATCTGGTA
CCGCTGTGGC CGGAAGTTAT CTCGTTGATC GACAAGAACT ATGCCCCGTT CTGGAAGGAA
TTGGGTACGT TCGACGGCAC GCCCTACGGA GTCTTCCATC GTGTCAATGC CAAGGGCTTT
ATCTGGTATA ACAAACCGGC GTTTGAGGCT GCCGGCTATA AGGTTCCCAC CACGTGGGAA
GAGTTGAAGG CCCTGACCGA GCAGATGAAA GCCAACGGTC ATACACCGTG GTGCGACGGG
ATCGAATCGG GTGCAGCGAC CGGTTGGAAG GGCACCGACT GGATCGAAAA CATTATGCTG
CGTACCCAAA CCACCGCTGT TTACGACAAA TGGATTTCGG GTGAAGTGCC CTTCAGCTCA
CCAGAAGTTA AGCGCGCTTT CGAGATTTTG GGTGAGGTCT GGTTCACCGA TGGTAATGTC
TTCGGTGGTC GCCAGTCGAT TGTGCTCACG AACTTCGGTG ATGCGGCGAC CTTCCTCTTC
ACCGAGCCAC CGAACTGCTG GTTGCACTTG CAAGGTAGCT TCGTTACCAA CTTCTTCCAA
GACTCGGTCA AGGCCGATCT TGATAACAAG GTTGGTTTGT TCGTGATGCC GCCGATTGAT
CCTAACGTCA CCCCGGCGCT GGAGGTTGGT GGTGATGTGT TCGTGATGCT CAAGGGACGT
GATCGGCCCG AAGTGCGGAA GTTCATGGAG TTTATGGCGA CCGGTGCATC GGCAACACCG
TGGGCACGGC TCGGTGGTGG TATCTTCCCG CACAAGGATC AAGACCTGAC GGTCTATCCG
ACCTCGATCG AGCGGCAGGT CGCCGAAGCG ATCCTCGCTG CGCAAGCCGC TCGCTTCGAT
GCTTCGGATG CGATGCCGGC GGCGGTGAAT GCAGCGTTCT GGAAGGGCGT GACCGACTGG
GCTAGTGGTA CGCGCGATCT TGATACCGTG TTGGCCGAGA TCGACGCGGC GCGTAATCAG
TAG
 
Protein sequence
MSRSRMTLLV ILMTVFSFVL AACGGGSTSQ PSGGEQTGQT TGEKKRVTIL GAFGGGEADA 
FEEVIKVFEA ANPDIDVVYT GVNDFDTQIV VRVQAGDPPD IAGFPQPGGA ARLAAEGNLV
PLWPEVISLI DKNYAPFWKE LGTFDGTPYG VFHRVNAKGF IWYNKPAFEA AGYKVPTTWE
ELKALTEQMK ANGHTPWCDG IESGAATGWK GTDWIENIML RTQTTAVYDK WISGEVPFSS
PEVKRAFEIL GEVWFTDGNV FGGRQSIVLT NFGDAATFLF TEPPNCWLHL QGSFVTNFFQ
DSVKADLDNK VGLFVMPPID PNVTPALEVG GDVFVMLKGR DRPEVRKFME FMATGASATP
WARLGGGIFP HKDQDLTVYP TSIERQVAEA ILAAQAARFD ASDAMPAAVN AAFWKGVTDW
ASGTRDLDTV LAEIDAARNQ