Gene Cagg_2274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2274 
Symbol 
ID7266687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2778107 
End bp2779951 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content61% 
IMG OID643567105 
Productalpha amylase catalytic region 
Protein accessionYP_002463590 
Protein GI219849157 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0525857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCC GGTCTTGGGA GGCGAGCATT CACCACGATG GATCGCCTCG CTACGTGCAG 
TTTGGTGGGC GCGTGGGTGA TACCGCACGC TTGCGCTTGC GCATCGCCGC CGATGCTCCG
GTCGGTGGCG TGTTTGTACG TACTTGCCCC GATGGCGAAC AGCACTTCAC GCCAATGCGT
GATCTGGGTG TGCAGGGGGT GTGTCGTTGG TGGGAAGGCG AGTTGCCGAT CCGGATGCTG
CGTACCAACT ATCGCTTCCT GTTGCGAGCC GATGATGGCG TGTACTGGTA CAGCGCCGGT
GGCGTAACGC GCTATTACCC GACCGACGCC AATGATTTTG TCCTGTTGGC GAACTACCAT
GCCCCGACGT GGGTGCGTGA CGCGGTCTTT TACCAAATCT TTCCCGACCG CTTCGCCGAC
GGTGATCCCG GCAATAATGT CCGCGACGGT GAGTATCTGT ACCAGGGGCG TCCGGTTGTT
GCCCGGCGCT GGCACGAGTT GCCACAGCGG GCGACCGGTT CAATCGAGTT TTACGGTGGT
GATCTGCAAG GGATCGCCCA ACGGCTCGAT TATCTAACCG ATCTCGGCGT TTCAGCCCTT
TATCTCAATC CGATATTCCG CGCACCGTCG AACCACAAAT ACGACGTTGA AGACTACACG
CACATCGATC CGCATCTCGG TGGTGAAGCC GGTTTGCTAA CGCTGCGTCA GGCCCTCGAC
GAACGAACCA TGCGATTGGT GCTCGACATC GTACCAAACC ACTGCGGCGT TACGCATCCC
TGGTTTGTGG CCGCCCAGGC TGATCGACAC GCACCAACGG CCGAGTTCTT CACATTTCGT
CGTCACCCGG ACGAATATGA ATGCTGGCTG GGTGTGAAGA CGCTGCCCAA ACTCAATTAT
CGCAGCGTGC GCCTGCGCGA AGTGATGTAC GCCGGACCCG ACGCGATTAT GCGGCGCTGG
TTGCGCCCAC CCTACCGCAT TGATGGCTGG CGGATCGATG TCGCCAATAT GCTGGGCCGG
CTCGGTCCCG ACAACCTCGG CCACAAGATC GGGCGCGGCA TCCGGCGGGC GGTCAAGGCC
GAACAACCTA ACGCCTACCT GCTCGGCGAG CACTTCTTCG ATGGGACGCC CCATCTGCAA
GGGGAAGAGC TGGACGCTAC GATGAACTAT CAGGGCTTCA CCTTTCCCGT GTGGCGTTGG
TTAGCCGGCT TTGAATTCAA TCCACAGCGG CCCGAAGCCG ATCCGCGACC GATTGCAACT
GAAACAATGG CCGCTCAATG GACGGTCTTT CGGGCGGCAA TTCCGTGGCA GATCGCGACG
CAGCAGTTCA ATCTCCTCGG TAGCCACGAT ACCCCCCGGT TGCGCACCAT TGTAGGAGAT
GATCTGGCCC GTGTGCGCGT GGCAATGACA CTGCTCTTCA CCTATCCCGG TGTGCCGTGT
ATCTACTATG GCGACGAGAT TGGCCTGGCC GGCGGCGGCG ATCCCGATTG CCGGCGGACG
ATGCCGTGGG ATGAAGCAGA GTGGGATCAC GATCTGCGCG CGTTTGTGCG GCGGTTAGCG
CATTTGCGGC GTAGCGCGCC GGCGTTGCGT TGGGGCGGAT TCCAGCAGCT CTACGCCCAA
GGCGAGACGA TCGCCTTTCA GCGTGAAGCG CCGGAAGAAC GCCTGATTGT AGTCGCGCGC
CGCAGCGACG ATGGCTTGCG GGCATTGCCG GTGCGCCACG CCGGTCTGGC CGATGGCGTG
ACCTTGCGCG AACTATTCAC CAGCGCCGAG ACGGTCGCGC GCAATGGTAT GTTGGACATC
AGCAGTCTGC CGGCGACCGG CGCGCAGGTG TGGCGGGCAT TGTAG
 
Protein sequence
MTIRSWEASI HHDGSPRYVQ FGGRVGDTAR LRLRIAADAP VGGVFVRTCP DGEQHFTPMR 
DLGVQGVCRW WEGELPIRML RTNYRFLLRA DDGVYWYSAG GVTRYYPTDA NDFVLLANYH
APTWVRDAVF YQIFPDRFAD GDPGNNVRDG EYLYQGRPVV ARRWHELPQR ATGSIEFYGG
DLQGIAQRLD YLTDLGVSAL YLNPIFRAPS NHKYDVEDYT HIDPHLGGEA GLLTLRQALD
ERTMRLVLDI VPNHCGVTHP WFVAAQADRH APTAEFFTFR RHPDEYECWL GVKTLPKLNY
RSVRLREVMY AGPDAIMRRW LRPPYRIDGW RIDVANMLGR LGPDNLGHKI GRGIRRAVKA
EQPNAYLLGE HFFDGTPHLQ GEELDATMNY QGFTFPVWRW LAGFEFNPQR PEADPRPIAT
ETMAAQWTVF RAAIPWQIAT QQFNLLGSHD TPRLRTIVGD DLARVRVAMT LLFTYPGVPC
IYYGDEIGLA GGGDPDCRRT MPWDEAEWDH DLRAFVRRLA HLRRSAPALR WGGFQQLYAQ
GETIAFQREA PEERLIVVAR RSDDGLRALP VRHAGLADGV TLRELFTSAE TVARNGMLDI
SSLPATGAQV WRAL