Gene Cagg_2150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2150 
Symbol 
ID7267658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2640629 
End bp2641642 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content52% 
IMG OID643566982 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002463470 
Protein GI219849037 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000169098 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGATC TGCAAACCCT ATCCGGCGCA CGAGTCTTGA TTACCGGTGG GCTTGGCTTT 
ATTGGCTCCA ATCTGGCGCA CAGGCTGGTT GAACTTGGTG CACAGGTCAC TCTCGTCGAC
TCGTTGATCC CTGAATACGG TGGTAATCTC TACAACATTG CCGGTATCGA AGATCGAGTG
CGCGTCAATA TTGCCGACGT GCGCGATGAG TATTCCATGA ACTATCTGGT ACAAGGGCAC
GATATCCTGT TTAACCTCGC CGGACAGACC AGCCATCTCG ACTCGATGCG TAACCCCTAC
ACCGACCTCG ATATTAACTG TCGTGCCCAA TTATCAATCC TCGAAGCCTG TCGTAAGCAC
AATCCGCGAA TCACGGTAGT CTACGCTTCA ACCCGCCAAA TTTATGGCAA GCCCGATTAT
CTGCCGGTCG ATGAACGCCA TCTGTTGCAT CCGGTTGATG TCAACGGTAT CAACAAAATG
GCCGGTGAAT GGTACCACAT TCTCTACAAC AATGTGTACG GCATTCGGGC ATGTGCTTTA
CGCCTAACGA ACACCTACGG CCCACGCATG CGGGTGAAGG ACGCGCGCCA AACCTTTCTC
GGCGTCTGGA TCAGAAATGT GATCGAGGGC AAACCGATCC AAGTGTGGGG TGACGGCAAA
CAACTGCGTG ACTTCACCTA TATCGACGAT TGTGTGGATG CACTGTTGTT AGCAGCTCTG
CATCCGGCTG CAACCGGACA AATTTTTAAT CTGGGCGGTT TAGAGGTGAT TAATCTGCGT
GATCTGGCAG CCTTAACGGT AGAAGTGGCC GGTGGCGGCA GTTTCGAGAT TATTCCCTAC
CCACCCGACC GTAAGCCGAT CGACATTGGT GATTACTACG CCGATGATCG TCGTATTCGG
CAGATGTTGG GCTGGCAACC ACGTATCGAT CTCCGTACCG GCTTAGCCCG CACGATTGCC
TTCTACCGCG AACATCACCA ACACTATTGG GATTCGGTCG TGGAAGGAGT TTAA
 
Protein sequence
MIDLQTLSGA RVLITGGLGF IGSNLAHRLV ELGAQVTLVD SLIPEYGGNL YNIAGIEDRV 
RVNIADVRDE YSMNYLVQGH DILFNLAGQT SHLDSMRNPY TDLDINCRAQ LSILEACRKH
NPRITVVYAS TRQIYGKPDY LPVDERHLLH PVDVNGINKM AGEWYHILYN NVYGIRACAL
RLTNTYGPRM RVKDARQTFL GVWIRNVIEG KPIQVWGDGK QLRDFTYIDD CVDALLLAAL
HPAATGQIFN LGGLEVINLR DLAALTVEVA GGGSFEIIPY PPDRKPIDIG DYYADDRRIR
QMLGWQPRID LRTGLARTIA FYREHHQHYW DSVVEGV