Gene Cagg_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2089 
Symbol 
ID7267596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2553834 
End bp2555873 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content56% 
IMG OID643566923 
Productalpha amylase catalytic region 
Protein accessionYP_002463412 
Protein GI219848979 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTA AGGCGCCACG CGCTCGCTCG CGCCGCACGG CGACCCCGTC TGTCCCCGAC 
CTGGCTGCCG TTGGCGAGGG TCGGCGGCGG GTAATCATCG AAGCGGTTGA ACCGATCATT
GATGGCGGAC GTTATCCGGT GAAGCGGATC GTCGGCGACA CCATCACTGT GCGCTGTGAT
CTGTTTGCCG ACGGTCATGA TGAGCTGGCA GCAGTTGTGC GCTACCGTCC ACTTGGCGCT
AAGGCATGGC ATGAAGCAGC GCTACGCCAC TTAGTCAATG ATCGTTGGGA AGGACAATTT
CCGCTCACGA GCATCGGTCG TCACGAGTTT CAGGTTGTGG CATGGATCGA CCGCTTTGCG
ACGTGGGTGC ATCAGCTCGA AAAGCGGTTA GCAGCCGGTC AAGATGTGCA GGTTGATTTG
CAGATTGGGG CCGCGCTGGT CGGTGAAGCA GCACGGAATG CCGATGGCGC TGACGCGGCC
GTCCTGCATG TTGCCGAGGC GGCACTGATT GCCGGCGATG TCGATACGGC CTTTGCCGGC
GAGATTTATG CGCTGATGGC CATCCATCTC CCACGCCGTT TTGCCACCGA ATCGGAGATC
TACCCGATCA CGGTCGAACG GGAGCGCGCA CGGTTTTCGG CGTGGTACGA ACTCTTCCCC
CGCTCGACGG CGACCGAGCC GGGACGACAC GGTACCTTCC GTGATGTGAT CAAGCGGCTC
CCCTACGTTG CCGAACTCGG ATTTGATGTG CTCTACTTGC CACCGATCCA TCCGATTGGG
CACACCTTCC GCAAAGGCAA GAATAATAGC CTCACCGCCG GCGCCGACGA TCCGGGCAGC
CCGTGGGCGA TTGGCAACGC CGATGGCGGA CATATGAGCG TTCACCCTCA GTTGGGTACG
CTCGACGATT TTCGTGCGTT GGTGGATGAA GCCCGTAAGT ATGGGATTGA GGTCGCCCTC
GATATTGCGT TTCAATGCTC ACCCGACCAT CCATACGTGC GTGAACATCC AGAATGGTTC
CGCGCACGCC CCGACGGTAC CATTCAGTAT GCCGAGAATC CACCCAAGAA GTACCAAGAT
ATTTATCCGT TCGATTTTGA GACGGATGCG TGGCAGGAGT TGTGGCAAGA ACTGCGTCAA
ATTTTCGTCT TCTGGATCGA GCAGGGAGTA CGCATTTTTC GCGTTGATAA TCCGCACACC
AAAAACTTTC GATTTTGGGA GTGGTGCCTC AACTCGCTGA AAGCTGAATA TCCTGATCTG
ATCTTTCTCT CGGAAGCGTT TACACGCCCG AAGGTGATGT ATCATCTGGC GAAAATTGGT
TTTACCCAGA GTTATACGTA CTATACATGG CGTACCACCA AGGCCGAAAT CACGCAGTAC
ATGCGTGAAC TGACGACGCC ACCGGTCAGT GATCTGTTCA TTCCCAACTT TTGGCCTAAT
ACACCCGATA TCCTTACCCC TCAGTTCTAC CGAGGGCAAC GCGCGGTATT CATCACCCGC
GCTGCAATGG CAGCTACGCT CACGGCCAGT TGGGGCATGT ACGGCCCGGC GTATGAGTTG
ATGGAACACA CGCCGGTACC CGGCCGTGAA GAGTACATCG ACAACGAGAA GTACGAAATT
CGGTACTGGA ATCTTGATGC GCCGCACAGT CTGCGTGGCT TCATCGCGCA ACTCAATCGA
ATTCGGCGCA ACCACCCGGC CCTCCACCGC AACGACACCC TACGCTTTCA TCGCGTTGAT
ATTGATTTTC ACGAGCACGA GTGGTTGCTG GCCTACAGTA AGACCTCGCT CGACGGGCAA
GATATTATTC TGGTAGTGGT TAATCTCGAC CCGGATCACA CCCATCGAGG TTGGGTACAG
GTACCGGTAG CCGATTGGAA CCTTACCAAC ATCTATCAGG CCCATGACCT GTTAACCGAC
GCTCGCTATC AGTGGTCGGG CGAATTTAAC TATGTCGAAC TTAGTCCGGC AGCTCCGGTG
CATATTTTCC GCATTCGCCG CCTCCGCCGC GACGAGCGAG GGTTTGAGTA CTTTGCCTGA
 
Protein sequence
MTVKAPRARS RRTATPSVPD LAAVGEGRRR VIIEAVEPII DGGRYPVKRI VGDTITVRCD 
LFADGHDELA AVVRYRPLGA KAWHEAALRH LVNDRWEGQF PLTSIGRHEF QVVAWIDRFA
TWVHQLEKRL AAGQDVQVDL QIGAALVGEA ARNADGADAA VLHVAEAALI AGDVDTAFAG
EIYALMAIHL PRRFATESEI YPITVERERA RFSAWYELFP RSTATEPGRH GTFRDVIKRL
PYVAELGFDV LYLPPIHPIG HTFRKGKNNS LTAGADDPGS PWAIGNADGG HMSVHPQLGT
LDDFRALVDE ARKYGIEVAL DIAFQCSPDH PYVREHPEWF RARPDGTIQY AENPPKKYQD
IYPFDFETDA WQELWQELRQ IFVFWIEQGV RIFRVDNPHT KNFRFWEWCL NSLKAEYPDL
IFLSEAFTRP KVMYHLAKIG FTQSYTYYTW RTTKAEITQY MRELTTPPVS DLFIPNFWPN
TPDILTPQFY RGQRAVFITR AAMAATLTAS WGMYGPAYEL MEHTPVPGRE EYIDNEKYEI
RYWNLDAPHS LRGFIAQLNR IRRNHPALHR NDTLRFHRVD IDFHEHEWLL AYSKTSLDGQ
DIILVVVNLD PDHTHRGWVQ VPVADWNLTN IYQAHDLLTD ARYQWSGEFN YVELSPAAPV
HIFRIRRLRR DERGFEYFA