Gene Cagg_3719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3719 
Symbol 
ID7268255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4519448 
End bp4520896 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content60% 
IMG OID643568526 
ProductGlycosyl hydrolase family 32 domain protein 
Protein accessionYP_002464991 
Protein GI219850558 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000148821 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACACTAT TTCTGATGAG ATCGAAAGAG CAGGGTATGA TTAGCGACCC GCATCGCCCC 
CGCTACCACT TTTTACCGCT GGCCAACTGG ATGAATGACC CCAACGGGCT GATCCAGTGG
GGCGAGACGT TTCACCTGTT TTACCAGTAC AATCCCGCTG GAGCGTACCA TCGCAATATC
CATTGGGGGC ATGCGACGAG CGCCGATTTA CTATATTGGC AACATCAGCC CATCGCCCTT
GCTCCGACAC CGGGCGGCCC CGATGCCGAT GGTTGCTGGT CGGGTTGCGC AGTCAATGAT
TATGGCACGC CAACGTTGAT CTATACCGGT TTTCGCTTGC CTGAAGAACA AACTCCTTGT
CTGGCGGTGA GTCGCGATGG GTTGCTGACG TGGCAGAAGT GGCCGGAACC GATCATTCCC
GCTCCTCCAG CCGATCTCGA TCTGCTCGGT TTTCGCGATC ATACGGTCTG GCGTGAGAAT
GGCCGGTGGG CGATGCTGAT TGGCGCCGGT ATTCGCGGTC AAGGCGGCAC GGTGCTGTTG
TACCGGTCGG ATGATCTGCG CCGCTGGGAA TACGGCGGGC CGCTGGTGAT CGGTGATGCT
GGCCAGTTCG ATCCAGTCTG GACAGGCACG CTCTGGGAGT GTCCAGACTT TTTTTCGTTA
AACGGTGATC ACGCACTGAT CTGTTCGGTG TGGGATCGGT GCCCGTATTA CACCATCGCG
ATGCGCGGTG CGTACCGTGA TGGCCGGTTT ACGCCATCCC TGACTCACAA GCTCGATTAC
GGCGATGCCC ATTTTTACGC ACCGCAGACG ATGCCGTTGC GCGATGGACG CCGGATCATG
TTCGGTTGGG TGATGGAGGG ACGGAGCGAG GCGGCGGTGC TGGCCGCCGG TTGGGCGGGG
GTGATGTCGT TGCCGCGTGA GGTGCAGGTA AGCAGCGATG GGCAGGTAGT GGCGTTACCA
ATTGCAGAAG TGACGCAATT GCGTGGTATG GAACGGCGAA TGTCGCCTGC CCGGATCATG
CCCGGTGCGC TACAGTGGAC ACCGATCTGT GGCGCGCATC TTGAGCTAGA GGTGGTATTG
CTGCCCCCGT CGCAAGGCAC GTGTAGTGTG TGGCTACGGG CCAGCCCCGA TGGGGCTGAA
GCGACTATTC TGCGCTACAA TCGTGCCACT GCTACTCTCA CCCTCGACCG TAGCCGTTCG
AGCCTGAGCA GTGATGTCTG GCACGACTCT CACCATGCCC CCTTGCCGTT GGCTCCCGAC
GAACCGCTTC GCCTCCGTAT CTTTCTCGAC GGCTCGCTGA TCGAAGTCTT TGCCAACGAC
CGCCGCTCAA TCACCAGTCG TATCTATCCC AGCCGGCCCG ATAGTGACGG GGTTGCTTTG
CAGGTCGAAG GCAACCCCGC CGAGCTGGTG ATGATGCGGG CGTGGGAAAT GGCCGATATT
TGGGCGTGA
 
Protein sequence
MTLFLMRSKE QGMISDPHRP RYHFLPLANW MNDPNGLIQW GETFHLFYQY NPAGAYHRNI 
HWGHATSADL LYWQHQPIAL APTPGGPDAD GCWSGCAVND YGTPTLIYTG FRLPEEQTPC
LAVSRDGLLT WQKWPEPIIP APPADLDLLG FRDHTVWREN GRWAMLIGAG IRGQGGTVLL
YRSDDLRRWE YGGPLVIGDA GQFDPVWTGT LWECPDFFSL NGDHALICSV WDRCPYYTIA
MRGAYRDGRF TPSLTHKLDY GDAHFYAPQT MPLRDGRRIM FGWVMEGRSE AAVLAAGWAG
VMSLPREVQV SSDGQVVALP IAEVTQLRGM ERRMSPARIM PGALQWTPIC GAHLELEVVL
LPPSQGTCSV WLRASPDGAE ATILRYNRAT ATLTLDRSRS SLSSDVWHDS HHAPLPLAPD
EPLRLRIFLD GSLIEVFAND RRSITSRIYP SRPDSDGVAL QVEGNPAELV MMRAWEMADI
WA