Gene Cagg_0415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0415 
Symbol 
ID7266583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp514360 
End bp515610 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content56% 
IMG OID643565282 
ProductNHL repeat-containing protein 
Protein accessionYP_002461796 
Protein GI219847363 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.96309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGGA TCCCGACGAT CCTCATGGTG TTCATAGCAA TCGCACTCCT CACCATTGCC 
TGGTCTCAGT CTGCACAGAA TCTTTTTTTA CCGCTGATCA GCCGTCTACC GGTTATTCCA
CCGGCAGACG AAGTGGCAAA CCGCCTCCAA GTGGCCGAAG GTTATGCCGT CCGTTTATAT
GTAAGTGGGC TTAATCGACC GCGCCTGATG GCAATAGGAC CTGATGGCGC ACTGTATGTT
GCCGAGCGGG GTACGAACCG GATAGTACGA CTGGTTGATG GACATAGTGA TGGATATGCC
GACACGCCAC AACCGGTTGC GATCAATCTG ACAGGGGTAC ATAGTCTCGA ATGGTACGCG
GGTGATTTGT ACGCTGCGGG TAATGCAACG GTTTGGCGGT TACGTGATGT GAACGGTGAT
AGACAATGGA GTACCGATGA AATCGTCGCA TTAGTGATGG ATTTACCCAG CGATGGGGGA
CACTCAACCC GCACTGCACG GATCGGGCCG GATGGGATGC TCTACGTATC GGTCGGCTCA
AAGTGTAATA TCACCGTCAA CTGTAGCGAA GGCGACCCTC GCCGGGCCGC TATCCTTCGT
TATACACTCG ATGGAGATAT TCCCGCCGAC AATCCCTTCG CCGATGACCC TGATCCGCGT
CGCCGCGCAG TGTGGGCTGA AGGGTTGCGC AACAGTGTTG ATTTTATCTT CTTGCCAGAC
GGTCGGCTAT GGGCTACCCA CAACGGTAGC GATGGTTTGG GCAATGATCT GCCACCGGAA
GAGGTGGTAA TCGAGGTTGA ACGTGGCAAG CACTACGGCT GGCCCTACTG CTACACCGCC
GAGCTGGGGC CGGTGCCGCG CAACACACAA GAGGTACGTG ACACCCGGAT TCCGCTCGAT
ACAACGTTTA CCGGCTGCGA ACAAGCAACG CCGGCACTTT TCACCGATGT AGCTCACTCT
GCGCCACTGG GAATTGATCG GTTGGCGAAT GGCGATGTGT TGATCGCCTA CCATGGCTCG
TGGAATGCTG ATGAAACACC GCGCGACTGC CGCGTGCAAC GGATTCGCGT CACCGATGGA
CAGCCAGTCT CGGCAGAGCC GTTCTTGACC GGCTTCCGCA ATAATCCCCA ACAAGAATGT
GGCGGTGCAT GGGGCCGACC GGCAGGAGTC ACGATTGCAC CAGACGGATC GATCTTTGTT
TCCGATGATA AAAACGGGAA TATTTATCGG ATCGTACCGG TCGGTAGTTA G
 
Protein sequence
MKRIPTILMV FIAIALLTIA WSQSAQNLFL PLISRLPVIP PADEVANRLQ VAEGYAVRLY 
VSGLNRPRLM AIGPDGALYV AERGTNRIVR LVDGHSDGYA DTPQPVAINL TGVHSLEWYA
GDLYAAGNAT VWRLRDVNGD RQWSTDEIVA LVMDLPSDGG HSTRTARIGP DGMLYVSVGS
KCNITVNCSE GDPRRAAILR YTLDGDIPAD NPFADDPDPR RRAVWAEGLR NSVDFIFLPD
GRLWATHNGS DGLGNDLPPE EVVIEVERGK HYGWPYCYTA ELGPVPRNTQ EVRDTRIPLD
TTFTGCEQAT PALFTDVAHS APLGIDRLAN GDVLIAYHGS WNADETPRDC RVQRIRVTDG
QPVSAEPFLT GFRNNPQQEC GGAWGRPAGV TIAPDGSIFV SDDKNGNIYR IVPVGS