Gene Cfla_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2039 
Symbol 
ID9145935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2275540 
End bp2277285 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content78% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003637133 
Protein GI296129883 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.566112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.522751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC GGACGACGGT CGTGTGGGTG CCCGACTGGC CCGTGGTCGC GGCGATGACC 
GCCGACGAGG TCCCGGTCGA CGTGCCTGCC GCGGTGCACG ACGGGCGTCG GATCACCGCG
GTGTCCGCGC TCGCGCGTGC CGACGGGGTG CGCCGCGGGA TGCGGCGGCG GCAGGCGCAG
GGCGTGTGCC CCGAGCTCGT GCTGCTGCCG GTCGACGACG CGCGCGACGT GCGGCTGTTC
GAGCCCGTCG CCGCGGCGAC CGAGACGGTC GTCGCCGGTG TCGAGGTCGC CCGGCCGGGC
ATGCTCCTGC TGCCTGCCGG CGGAGCCGCG CGCTACCACG GGTCGGAGGA GGCGCTCGCC
GAGCGCGTCG TCGACGCGGT CGCGCGCGCC ACCGGTCACG AGTGCGGCGT GGGCACGGCC
GACGGGCTGC TCGCCGCCGT GCTGGCCGCG CGCACCGGCG CCGTCGTCGA GCCCGGGCTG
TCGCCCGTCT TCCTCGCGCC GCACGGTGTC ACCGAGCTCG TGCACGCCAC CACCACGCCC
GAGCAGGCCG CCGAGGTCAT GCGCCTGGTC GACCTGCTGC ACCGCCTGGG GCTGCGCACG
CTCGGGGCGT TCGCGGCGCT GCCGGCCCCC GACGTGCACG CGCGGTTCGG GCGGCTGGGC
TCGTGGGCGC GCACGCTCGC GCGCGGCCTC GACGAGCGGC CGCCCGCGCG TCGTCGTCCC
GAGGCCGACC TCGAGGTGGA CGTCGAGCTC GACCCGCCCG TCGACCGCGT GGACACCGCG
ACGTTCGCCG GACGGCGCCT GGCCGAGGAG CTGCACGCCG AGCTCGTCGC GCGCTCCGTG
ACGTGCGGGC GCCTGCAGAT CACCGCGCGG ACCGACGACG GGACCGAGCT GGTGCGCACG
TGGCGTACCG ACCTGGGCGG CTGGGGCGGG CTCGCCGCCG CGCGCATCAC CGACCGGATC
CGCTGGCAGC TCGACGGGTG GCTCACCGCG GCGGCCGTCG CCACGGCTCG GGACCGGCGC
CGTGAGGTGC GCGAGCGTGA ACGCGGGGCG CGGGGACGCG GCGAGGGCGG GGGACCGGCG
CACGGCGAGC ACGGACCGGT GCGCGGGACG CACGGGGCGG TCCTCGATGT CCGGGTGCCG
GACGACGAGG ACGACACGGC GCCCGTCGCG CTGGTGCGTC TGACCCTCAC GGCGCTCGAC
GTCGCGCCCG CGGGGTCCGA GGCGACGCAG CTGTGGGGCG GACCGTCGGG TGGGGACCTG
CGGGCGCACC GCGCGCTCGA GCGCGCGCAG AGCATCGTCG GCGGGCCGGG CGTGCTCACC
GCGACCCTGC AGGGCGGGCG TGACGTGCGT GACCAGGTGC ACGTGCGGCC GTGGGGCGAG
CAGAGCGACC CGCCGCGCCC GCTCGACCGT CCCTGGCCGG GCCGGCTGCC GGACCCGGCA
CCCGCGACCG TGCTGGTCGA CCCCGTGCAC GTCGAGGTGC GGGACGTGCA CGGCTCGCCC
GTGCGCGTCG ACCGGCGGGG GCGGCTGAGC GGACCACCCG GCAGCGTGCT CGCCGGGAGC
GGCCCCGACC GGGTGCGCGT CGTCGCCGGG TGGGCGGGGC CGTGGCTGCT GACCGACCGC
TGGTGGACGC ACCCCGGCGC CGGGCCGCAG GTGCGCGCCC ACCTGCAGGT CGCGTTCGAC
GACGGCGGCG CGGTGCTGCT CACGCACACC GACGGGGCGT GGACGTACGA GGCGGACTAT
GACTGA
 
Protein sequence
MSTRTTVVWV PDWPVVAAMT ADEVPVDVPA AVHDGRRITA VSALARADGV RRGMRRRQAQ 
GVCPELVLLP VDDARDVRLF EPVAAATETV VAGVEVARPG MLLLPAGGAA RYHGSEEALA
ERVVDAVARA TGHECGVGTA DGLLAAVLAA RTGAVVEPGL SPVFLAPHGV TELVHATTTP
EQAAEVMRLV DLLHRLGLRT LGAFAALPAP DVHARFGRLG SWARTLARGL DERPPARRRP
EADLEVDVEL DPPVDRVDTA TFAGRRLAEE LHAELVARSV TCGRLQITAR TDDGTELVRT
WRTDLGGWGG LAAARITDRI RWQLDGWLTA AAVATARDRR REVRERERGA RGRGEGGGPA
HGEHGPVRGT HGAVLDVRVP DDEDDTAPVA LVRLTLTALD VAPAGSEATQ LWGGPSGGDL
RAHRALERAQ SIVGGPGVLT ATLQGGRDVR DQVHVRPWGE QSDPPRPLDR PWPGRLPDPA
PATVLVDPVH VEVRDVHGSP VRVDRRGRLS GPPGSVLAGS GPDRVRVVAG WAGPWLLTDR
WWTHPGAGPQ VRAHLQVAFD DGGAVLLTHT DGAWTYEADY D