Gene Cagg_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2071 
Symbol 
ID7269230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2534561 
End bp2536168 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content57% 
IMG OID643566906 
Productalpha-L-arabinofuranosidase-like protein 
Protein accessionYP_002463395 
Protein GI219848962 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATGCC GTCATCGCTC CCTCATAGTC TGGTTAGTCT TGTGGCTCAC CGCTTGTGGC 
GCACCGGTCA CCACACCCAC TGCTACGACC GTCCCCGCCG CTTCGACGCC AACATCGGTT
ATTGCTGAAC CATCAACGCT GCCTACCTCA ACGGTACTTC CCTCCGGCCC GGTGATTGTG
ATCGATTCGA CCCAGACCCG GCCTTTTCCA CGCCAACTCC TCGGGACAAA TGTTCCGGCG
TGGCTTAACC CTACGCGACT CGGTGATGAA ACATTTATCA GCCGCACTGC CGATCTGGGC
CTGAGCCTAC TCCGCATGCC CGGTGGTAGC TGGAGCAATG CGTATGCGTG GGCCGATTGC
GAAACCGGAG GCGAAGGTTG CTACTGGCCG TGGGCCGCCA AACCTTCAGA TTTTCTGCGC
TTCGCCCGTG CGGTGAATGC CGAGATTATC TGGACGGTCT CGATCAACAG CTCGGCCCAA
GAGGCAGCGG CGCTGGTCGC CTTCTTCAAC GGTGCAACCG ATGATGAGCG CCCGCTCGGC
GTTGATGCAC GCGGGCGCGA TTGGCTCACG GTTGGTCATT GGGCGCGTCT GCGGGCCGAA
ACCGGTAATT CCACACCTTT CCCGGTACGT TACTGGGAAA TCGGCAACGA GGTGTATGGT
GCCAAACGGG ATGTGGGGCC TAATTGTGCA GAATGGGGCT GGGAAGATGT TTGGACATGC
GATCCGGATG AGTACCTCCA CGGGGCAACT GTCAATGGAA TTGCCTACGA TGGCTATCTG
GCTTTCTACG ATGCGATGAA AGCCGTTGAT CCTTCGATCC AGATCGGGGC CGTGGGGGTA
GAAAAACCCG ATGAATGGAG TAATTGGGGG AACCGGGTAA TTGCCGGCGC CGGCGAGAAA
CTCGATTTTT ATGTTGTTCA CTATTACCCC TACTTCCAAC CCCCAGAAAA CCCAGCCGGC
GCATTGCAGC AACCTCAGCG CAGTTGGTCA ACGATCATGG CCGACCTCCA GGCCGCGTTT
AGACGCTACA GCGGGCGGCA AATACCGGTG GCCGTCACCG AATATAACCT TGTGGCCTTT
CAAGATGCCG ACAACGGCCA ATTGATGCGT CGGGCGGTAA ACATGCTCTT CATCGCCGAT
ACCATCGGCC AGATGGCAAC CAACGGTGTT ACCATCGCTA ATCAGTGGGA TCTGGCAAAT
GGCCGAGCGT ATAACGGTAC CGATTACGGG TTGCTCGATG CCGATACGTT TGAGCCAAAC
CCGGCTTACT ACGCTCTCCA ACTCTGGAGT CGGTTTGGGG ATGAATTGCT AACGACCCAA
ACACCGTTCG ATCCGGCGCA GACCCTGAGT GTCTATGCGG GTCGCCATAC CGATGGCACA
CTGACGCTGC TGGCAATTAA CAAAACCGCA CAACCGCAGA CGGCAACGAT CATCGTACCA
ACGGGTCCAT GGCGCGTTGC CACCACGGTA GTACAAGCCT CTGATCTACT GGCCGAATCT
GTCGCCCTCG ACATCCGTGA TAACGGCATA ACAAGCGATG AAAATGCTAA CTACGAACAC
ACCTTTGCGC CATATACGCT CACCCTCTTA ACGTTCACAG AACCGTAG
 
Protein sequence
MICRHRSLIV WLVLWLTACG APVTTPTATT VPAASTPTSV IAEPSTLPTS TVLPSGPVIV 
IDSTQTRPFP RQLLGTNVPA WLNPTRLGDE TFISRTADLG LSLLRMPGGS WSNAYAWADC
ETGGEGCYWP WAAKPSDFLR FARAVNAEII WTVSINSSAQ EAAALVAFFN GATDDERPLG
VDARGRDWLT VGHWARLRAE TGNSTPFPVR YWEIGNEVYG AKRDVGPNCA EWGWEDVWTC
DPDEYLHGAT VNGIAYDGYL AFYDAMKAVD PSIQIGAVGV EKPDEWSNWG NRVIAGAGEK
LDFYVVHYYP YFQPPENPAG ALQQPQRSWS TIMADLQAAF RRYSGRQIPV AVTEYNLVAF
QDADNGQLMR RAVNMLFIAD TIGQMATNGV TIANQWDLAN GRAYNGTDYG LLDADTFEPN
PAYYALQLWS RFGDELLTTQ TPFDPAQTLS VYAGRHTDGT LTLLAINKTA QPQTATIIVP
TGPWRVATTV VQASDLLAES VALDIRDNGI TSDENANYEH TFAPYTLTLL TFTEP