Gene Cfla_0618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0618 
Symbol 
ID9144488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp665315 
End bp667051 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content58% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003635729 
Protein GI296128479 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.508373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.13002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGAGCGC GCGCAGTCGT CCGGAGGCTA ACGATCAATG ACGGCACTAC GGTTGACGTA 
CCTGAGTCAG GCGTTGTCCT GATTGTCGGA CCAAACAACA CAGGCAAGAG CCAGGCACTC
AGGGACGTCA TCAAACTCAT GACGTCGTCC GGCGAACCAG GGATCGTGAT TCGAGAGGCC
GAGATAGAGC ACTTTGGTTC CGAAGACGAT CTCATTGAAA CGTTTGCGTC CGACCGAGCT
ATTCTTAGGA CGGCGACGGG AGCCGATCAA GCGCACCTAG GAGTTCACGG AGTCCAAGCG
ATTTCCTCTA TCCGCCAGTG GTGGTCGTCC CCGCACGCCC GTCACCTGGT CGGCGGCTAT
TTCGCGATTC ATGCCGACAC GGAGAGCCGC CTAGAAGCGA GCAAACCGGC GCCTTCGATG
GATCTATATG AAAATTCGCC CTCGCATCCA CTTCATCACG TGTATGCAAA TCCTGAGCTA
GAGACGCGCC TAAACGACAT TAGCCGTCGA GCGTTCAATT CAGGGTTAAT TCTGGATGCG
TGGTCCGGCG GCAATCAATG GGCGTTTCGC GTGGGAAATA TCGACCCTCC AGACTCGCCC
CGGCCATCGG TCGCTTACCT CGACGAACTC CGAAAGGCTC CCTTGCTCCA TCAAGTTGGA
GATGGAGTCC GGAGCATGCT CGGGTTGCTG CTCCGCTTTT ACACCGGCCA CCAGAATATT
TCGTTGATTG ACGAGCCCGA GGCCTTCTTG CACCCACCTC AGGCAAGGTA TATCGCGCGG
CTTCTCGCCG ATGAGGCCGC AACCACCGAG AGATCCATCC TCGTCGGCAC GCACAGTACG
GAAATCGTGC ATGGGGTGCT GGAGAGCTCT GCGTCAGCGA CGCTTGTCAG ACTCTCGCGG
AATCGAACCA TCAACAACGC GGCCGTTCTC GACAACGATG CAGTTCGAAA ACTCTGGTCA
GACCCATTGC TGCGATACTC CAACCTGTTG GATGGCCTTT TCACCGACGC AGTAATAGTC
TGCGAAGCAG ATGCAGATTG CAAGTATTTC GCGGCCGTCA GGGACACTTT TGAAGACGAA
GCGGTGGAGT CACGACGGCC CGACATCCTA TTCACCAGTT GCGGGGGTAA GCACAAAATG
CATGCTGCCG TGGAGGCGCT TGTTGCCGCG AGCGTCCCCG TAGCGGTAAT TTGCGACTTT
GACACCCTCA ACGAGTGGGC GACGCTACGT CGACTATTTG TGTCGGCGGG CGGCGATCCA
GGTCTTATCG AGACCGACTG GAAGATTCTG AATGCTGCAT TGACTTCAGG TGACCGAAAC
CCAAGCAAGA TGGGTGTCAA GGAGTCCTTA GATCGATCTT TCGATGCAAT CGAAGAACCC
GAACTCACGC GAAAGAACAT CGAGTCCCTA CGCCGCGTTC TTCGAATTGA GAACGGCTGG
GATCGCGTGA AGAACTCCGG AAAATCCGCG GTGCCAGCGG GCGACCCTTA CCGCGCATGC
GAGCGCATTA TCGCAGCACT CGCTGACCGG CGTATACACC TTGTGCCCGT CGGCGAAATG
GAAGACCTCG TTCCGGCGGT CGGTGGCCAC GGCGCGGCGT GGGTAGCAGA GGTTCTCGAG
CAAGGGTTGC ACAACTCGCC CGACAGCGAC GGCGCCCGAG TGCTAATGCG CGCAGTCCTC
GACTCGCTCG ATCGTGGCGA CGCCGACGCC GTTGTCGAAG AGGGGGCCGA TGCTTGA
 
Protein sequence
MRARAVVRRL TINDGTTVDV PESGVVLIVG PNNTGKSQAL RDVIKLMTSS GEPGIVIREA 
EIEHFGSEDD LIETFASDRA ILRTATGADQ AHLGVHGVQA ISSIRQWWSS PHARHLVGGY
FAIHADTESR LEASKPAPSM DLYENSPSHP LHHVYANPEL ETRLNDISRR AFNSGLILDA
WSGGNQWAFR VGNIDPPDSP RPSVAYLDEL RKAPLLHQVG DGVRSMLGLL LRFYTGHQNI
SLIDEPEAFL HPPQARYIAR LLADEAATTE RSILVGTHST EIVHGVLESS ASATLVRLSR
NRTINNAAVL DNDAVRKLWS DPLLRYSNLL DGLFTDAVIV CEADADCKYF AAVRDTFEDE
AVESRRPDIL FTSCGGKHKM HAAVEALVAA SVPVAVICDF DTLNEWATLR RLFVSAGGDP
GLIETDWKIL NAALTSGDRN PSKMGVKESL DRSFDAIEEP ELTRKNIESL RRVLRIENGW
DRVKNSGKSA VPAGDPYRAC ERIIAALADR RIHLVPVGEM EDLVPAVGGH GAAWVAEVLE
QGLHNSPDSD GARVLMRAVL DSLDRGDADA VVEEGADA