Gene Cfla_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0804 
Symbol 
ID9144676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp869462 
End bp871351 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content64% 
IMG OID 
Productprotein of unknown function DUF87 
Protein accessionYP_003635913 
Protein GI296128663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTCGCT CAACGGATCG CCCAAGCGAG CGACCCGCCT GGATCCGGCG CGTCCGGGTC 
TTCCAGGCGC TGGCTGCAAC CGCAATCCTG CTCGTCGCTG CCTCGAACGT GTCCTCCGAA
CCACTGGCCC AGTGGTGGTT CTGGGTCGCT GCGACTACCG CGCTCACACT CGCCCTTGTC
GAGCCGTACT ACACCGGTGT GCAAGCGGCG ATGCTCTTTG GTGCCGCAGG CCTCGCGGCC
GGCCTCACTG CTGATCGGGC GGGCGTCGAG CCGCTCTGGA TCGGCCACGT CGTCCTTGCC
GGTGCCGTCT TCGTCGCCGC CCTTACCGCC CTCGCAAGTC AGCCAGGGAG ACTACGCGAC
GGATCACGGT GGGTCGCGAC CCGTTTCGGA CGACCTCTCT GGCTTGGTCT ATCAGCCGTG
ACGATCGAAG CATTACGACA GGCAGCGTCT GGCGCACCGA CGATCGCGAT GACACTGGCT
GGCGGCACGC TCGCCGCGGT TCTGGTAGCC GCCCCAGACT GGTATCGGCT CGTGGGCGTG
GCGCAGCCAG CGCCGGATGG CATCGCAATT TTTGAAACCG CGGTGGAGCC AAATCTGATG
CTACTCGCGA CCGACAGGCG TTACACCCCT GGCGCCTACG TTGAAGTTCA TGGCGTTACG
GCTTCACGAG GGGTCGTAGT CGGAAATTTG GCGCACAAGG GCGGGAACCG GATCCAGGTC
GCCTTGGAAG AACCATGGCA CGAGGTCGCC GATTCCAGTG GTCAGCAATG TGAGGTAGTG
ACGCTCTCCC ACCCTCCTGC TCGGGCGGTC GCGTTTGTCT CAGAGGGCTC TACGGACCGG
GTACTCAGCC TCCGCCCGTT CGGCGGCCTT GTTCGGGGGG ACACCGTCTA CTGGGAGGAG
GCGACCTCCG GCGCGCGATA CCTCTACCAG GTAGTGGCAC GCGAGCTGGC AAGGGAAATG
TGGGACGCCT CATCCGTGGT TACCGAGAGA GCGACCGCGG TGCTGCTGGG TGCTGCGGGT
CCAGGTGGTC TAACCCCCGG CACCGCGCTT CCTGCGCCGT ATGTTCCGGT GCTCTCGGCG
GATGAGGTGA CCGGACCGCT GGCTCCGGGA TTTGAGCGAA TCGGCACCAT CGCTGGAACG
GCACTGCCCT TTGGAGTCTC CGTCGCACAG CTGCGCGGCC ACCACCTTGC GATCCTCGGC
ATGTCAGGCA TGGGTAAGAG CACGGTTGCC CGGCGACTCA TTGACCTCAT GTCGTCAGCA
TCGGTCGTCG TTTCTCTTGA CGGGACAGGT GAGTACCGGG CGCGCTTTGG ACTGCCCGCC
TGGAACGACG CGGTGGGACT CACCACTCCT GGGGCATGGG TATACGAACC CGCGGGCGTT
CCCGCGCTGC GCGTATCTGA GTTCATCAAG ATGGCGATGA CGCAAGCGGC AGCGGAATAC
GCGGTAGGGG ACCCACTACG GCGCACTGTC CTCCTCGAGG AGGCTCACTC TTACCTGCCG
GAGTGGAACT TCGTTGCCGA CCGCAACGAA TCGAGCTACG TAGCCCAAAG CTGCCGCTAC
ATTCTCCAGG CAAGAAAGTT CGGCTTGAGC TTCATTCTCG TGTCGCAGCG CACCGCAGTG
ATCAGTAAGT CGGCACTCTC CCAGTGCGAG AGCTATATAG CGCTGCGAAC GCTCGACGCG
ACAAGCCTTG AGTATCTGGA AGGCGTGCTC GGCAGCCAGT TTCGCGAGAC CGTCTCGGGC
CTCCAGAGGT ACCAGGCCGT GTGTGCCGGC CCAGCGTTCA GTACATCAAC ACCGGTTGTA
GTGAACCTCG ATCCCTATCC AGCACCGCCC CCGGCCGGTC CACCAACGTC AACCGGCGCT
CCCACCACTG CATCTCACAC AGGGGTCTGA
 
Protein sequence
MPRSTDRPSE RPAWIRRVRV FQALAATAIL LVAASNVSSE PLAQWWFWVA ATTALTLALV 
EPYYTGVQAA MLFGAAGLAA GLTADRAGVE PLWIGHVVLA GAVFVAALTA LASQPGRLRD
GSRWVATRFG RPLWLGLSAV TIEALRQAAS GAPTIAMTLA GGTLAAVLVA APDWYRLVGV
AQPAPDGIAI FETAVEPNLM LLATDRRYTP GAYVEVHGVT ASRGVVVGNL AHKGGNRIQV
ALEEPWHEVA DSSGQQCEVV TLSHPPARAV AFVSEGSTDR VLSLRPFGGL VRGDTVYWEE
ATSGARYLYQ VVARELAREM WDASSVVTER ATAVLLGAAG PGGLTPGTAL PAPYVPVLSA
DEVTGPLAPG FERIGTIAGT ALPFGVSVAQ LRGHHLAILG MSGMGKSTVA RRLIDLMSSA
SVVVSLDGTG EYRARFGLPA WNDAVGLTTP GAWVYEPAGV PALRVSEFIK MAMTQAAAEY
AVGDPLRRTV LLEEAHSYLP EWNFVADRNE SSYVAQSCRY ILQARKFGLS FILVSQRTAV
ISKSALSQCE SYIALRTLDA TSLEYLEGVL GSQFRETVSG LQRYQAVCAG PAFSTSTPVV
VNLDPYPAPP PAGPPTSTGA PTTASHTGV