Gene Cagg_0717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0717 
Symbol 
ID7266969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp888190 
End bp889794 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content58% 
IMG OID643565568 
Productglycosyl hydrolase 53 protein 
Protein accessionYP_002462077 
Protein GI219847644 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCAC TACTCCATCT GACATTCCTG CTCACCGCTG CTGTGATGTT CGTAGCCGAT 
GCGCCGACCC TACACCGGCC CATCACCCAA CCAATTGCGA TTCGTACCCC AACCGGGCCA
CTGGCTATCG GAATCAACAG CCATCTGGCA ACTCGTTATC CCGATGCTGC GACAATGGCA
ATCCCGGCAG CAATCGTCGC CGATCTCGGC GTGCAGTGGG TGCGTGAAGA CCTGCATTGG
CATCGCATCC AACCCCAACC CGATGTCTGG GATTGGGTAT TTACCGATGC CGCTCTCTAT
GCGCTCAGCC GTCAGGATGT ACGGATTCTG GGCGTCTTGG GACCGTCCGT CGGCTGGGCC
ACCGCCGACC CGACCGACCG GCCCAATCTC ATTTCGTTTG CCCCACCCGA TGAAGATGCG
TTTGTCACGT ATGCCGGTGC CGTCGTCCAA CGCTACAAAC ACCTCATTAA ACACTGGCAA
ATCTGGAACG AACCCGACCA AACCCTTTTC TGGCGGCCAT CACCCGATCC GGCACGCTAT
ACCCGCCTCC TGATTGCTAC TGCCCAAACG ATTCGCACGA TTGATCCGAC GGCCACCATC
GTTCTTGGCG GCATTAACCC TTTCGACACC GGTTTTCTGC GTGCGATTGC TGCTTATGGT
GGATGGAACG CCTTTGATGT GATCGCCATC CACCCCTACG TCGATCCACT CAATCCAGAA
GAGGGGAACC TCATTGCTGC TGCGGATGGT GTGCGTGCCG TGGCTGCGCG ATACGGCATG
AAACCGATCT GGGCCACCGA AGTGGGGTGG GCGAGCGGCC CCGGCGACCG CGATGCGCTT
GGGCTGACGA ATGCTACGCT TCAAGCTGCG TACCTCTCCC GTACCTACTG TGCGCTCTGG
TACGGCGGGG TGAGTGCTGT CTTCTGGTAC ATGCTCAAAG ACGATCCGCA CAATCCGTAT
GGGCTGTTTG CGTATGGGAG CGGACGGGCC GATTTTAGCA CACCCAAACC GGCAGTAACC
GCGATGCGTG AACTGCCCGA CACACTTGCC GCGTGTCACC TAGAGCCACC CACTACTACC
ATTCCACTCC TTACCGGCAG CCAACCTGTG CAATGGCGAC GGCCCAGCCA GCCCAACGGT
TCACTCCGGT TGATCGAACA CGATCGCGTC TTCCACATCA GCTATCGCTT TACAACCCGT
ATGAATGATT ACGTGGCTTT TGCGCTGAGC AACCCTATCC CCCTACCCGA CGATACAACC
GCCATCACCG TTCAGCTTTT CGGTGATGGC AATGGGCACC GACTACGGCT CTGGTTACGC
GACAGCGAAG GCGAAACCTT CTCGCTGACT GCCGGTATTA TCGGCCCACC TGCCTGGCAA
ACCATCAACA CTCCACTGAG CCGTCGACCG ATGCAGTATG AACTGATTGC CGGGAATGGC
AACAGGCAAC CTGACGCTCC ACTCGCGTTA GCGGCCATCG TTATCGATGA CGAAGATGAT
ACCTGGACCG GGATGGGTGA AGTGCTGATC GAGCGCATAG CCGCGGTACG CACCACACTT
GACGCAGCCT CACTCCCGAC GTATACTGTC ACTCAACGGC GATGA
 
Protein sequence
MRALLHLTFL LTAAVMFVAD APTLHRPITQ PIAIRTPTGP LAIGINSHLA TRYPDAATMA 
IPAAIVADLG VQWVREDLHW HRIQPQPDVW DWVFTDAALY ALSRQDVRIL GVLGPSVGWA
TADPTDRPNL ISFAPPDEDA FVTYAGAVVQ RYKHLIKHWQ IWNEPDQTLF WRPSPDPARY
TRLLIATAQT IRTIDPTATI VLGGINPFDT GFLRAIAAYG GWNAFDVIAI HPYVDPLNPE
EGNLIAAADG VRAVAARYGM KPIWATEVGW ASGPGDRDAL GLTNATLQAA YLSRTYCALW
YGGVSAVFWY MLKDDPHNPY GLFAYGSGRA DFSTPKPAVT AMRELPDTLA ACHLEPPTTT
IPLLTGSQPV QWRRPSQPNG SLRLIEHDRV FHISYRFTTR MNDYVAFALS NPIPLPDDTT
AITVQLFGDG NGHRLRLWLR DSEGETFSLT AGIIGPPAWQ TINTPLSRRP MQYELIAGNG
NRQPDAPLAL AAIVIDDEDD TWTGMGEVLI ERIAAVRTTL DAASLPTYTV TQRR