Gene Cfla_0489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0489 
Symbol 
ID9144356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp521984 
End bp524062 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003635603 
Protein GI296128353 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0388236 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCTG CACCGCCCCG GTCCTCGGGG AGCACGACGT CCGACATCCC CGGCGAGCTC 
GCGGAGGAGC TCGCCGCCGA GCGGCGGTAC CTCGCGTCGG CCCGGGACGC CCTGCGTCGC
ATGCGCGAGC GTGCGGAGAA GATGCTGGAC GTCGGCGCAG GGGTGGGCGG TGACGCGTAC
GCCTCCGAGC GGCTGGGCTT CACGCTGACC CGGCGGGTCG CCCAGCTGTC CGACCAGTCG
GACGTGCCGC TGTTCTTCGG GCGCCTCGAG CTCGCGGACG CGATCGACGA CGGTTCGACC
CGCTACTACG TCGGGCGTCG GCACGTGACC GACGAGGAGA GTCACCCCCT GGTGCTCGAC
TGGCGGGCCC CCGTGTCCCG GGCGTTCTAC CGGGCGAGCC CGCGCGAGCC GCTCGGGGTC
GCGGTGCGAC GCCGGTTCGG CTCCTCCGGC GGCGCCCTGA CGAGCCTCGA GGACGAGCAC
CTCGACCGCG GTGAGGAGAC CGGCGCGGCC AGCCGCATCC TGACCGACGA GATCGAGCGA
CCCCGCGTGG GCCCCATGCG TGACATCGTC GCGACGATCC AACCGGAGCA GGACGAGCTG
GTCCGTGCCG ACCTGTCCGT GTCGCTGTGC GTCCAGGGCG GGCCGGGCAC CGGCAAGACG
GCGGTCGGCC TGCACCGCGC GGCCTACCTC TTCTACACCC ACCGGCAGCG GCTCGAACGC
GCCGGTGTGC TCGTGGTCGG CCCGAACCGC GCCCTCATCC AGTACGTCAG CAACGTCCTG
CCGGCCCTCG GCGAGCTCGA CGCCGAGCAG ATCAGCATCG ACGAGCTGCC CGCCGTCGCC
GTGCAGGGCG TCGACGAGCC CGACGTCGCC GCGCTGAAGC ACGACGCCCG CATGGCGAGC
GTCCTGCACC GCGCGCTCTG GAGCCTGGTC GTCCCGGCGC GCGAGCCCCT GACCGTCCCC
TACGGGTCCG CCCGCCACAC GCTCGGTCCC GCAGCGCTCG ACCGCGTCGT CGAGGAGGTC
CTGGAGTCGG CGCCCACGTA CGCCGCCGGT CGGGACCGGC TCCGGGCACG GGTCGTGGCC
CGGCTGCAGC GGCAGGTCGA GTCGCGGCAC GCCGAGGCAC CGAGCGAGAC GTGGTTCCGC
TCGACGAGCC GCAGCCGCCC GGTGACGCGC TTCCTGGACG CGGTGTGGCC GACGGTCGCA
CCGGAGCAGC TGCTGCACCG GCTCCTGTCC GACCCCGCGG TGCTCGCCGC AGCGGCGGAG
GGCGTCCTCA CGGACGAGGA GCAGCGCCTC CTGCAGTCGC GGGTACCCGC ACGCACGTAC
CGCAGAGAGC GGTGGAGCAG CGCCGACGCC TTCCTGATCG ACGAGGTCGC GGGGCTCGTC
GAGCGTCCGC GCGGCTACAG CCACATCGTC GCCGACGAGG CGCAGGACCT CTCCGCGATG
CAGTGCCGGG CGCTCGCCCG CCGCAGCGTC CACGGGTCCC TCACCGTCCT CGGCGACCTC
GCGCAGGGCA CCACGCCGTG GGCCGCGCGG AGCTGGCACG ACACCATGAC GCACCTGGGA
CGGCCCGACG CCGCGCTCGT GCCCCTGACG ACCGGGTTCC GCGTGCCGTC CGTCGTCATG
GACCTCGCCA ACCGGCTCGT CCCGGAGCTC GGGCTCGACG TCCCCCTCGC GACCTCGCTG
CGGCACGACG GTTCCCTGCG GGTGTGCCGC GTCCACGACG TCGTGGCAGC CGCCGTGGAC
GAGTGCCGCA GCGCCCTCGC GCGCGAGGGG TCGATCGGGC TCGTCGCGCC CGACTCGCTC
GTGGACGACG TCGCTGCGGC GCTGGCGGCC GCGGGCCTGG CGTGGAACCA CGCGGACGAC
CTGGCCGGCG AGCACCCCCT GACGGTCGTC CCGGCGACCC TGGCGAAGGG GCTGGAGTTC
GACACGGTCG TCGCCCTCGA GCCGGCGCGT GTCGTGGCCG AGGAGCGCCG CGGGCTGAAC
CGGCTGTACG TCGTGCTCAC CCGCGCGGTG TCCGACCTCG TCGTCCTGCA CCGGGACCCT
CTGCCACCGG CGCTCGAGGA GCCGGCGGCG GCGCGGTAG
 
Protein sequence
MDAAPPRSSG STTSDIPGEL AEELAAERRY LASARDALRR MRERAEKMLD VGAGVGGDAY 
ASERLGFTLT RRVAQLSDQS DVPLFFGRLE LADAIDDGST RYYVGRRHVT DEESHPLVLD
WRAPVSRAFY RASPREPLGV AVRRRFGSSG GALTSLEDEH LDRGEETGAA SRILTDEIER
PRVGPMRDIV ATIQPEQDEL VRADLSVSLC VQGGPGTGKT AVGLHRAAYL FYTHRQRLER
AGVLVVGPNR ALIQYVSNVL PALGELDAEQ ISIDELPAVA VQGVDEPDVA ALKHDARMAS
VLHRALWSLV VPAREPLTVP YGSARHTLGP AALDRVVEEV LESAPTYAAG RDRLRARVVA
RLQRQVESRH AEAPSETWFR STSRSRPVTR FLDAVWPTVA PEQLLHRLLS DPAVLAAAAE
GVLTDEEQRL LQSRVPARTY RRERWSSADA FLIDEVAGLV ERPRGYSHIV ADEAQDLSAM
QCRALARRSV HGSLTVLGDL AQGTTPWAAR SWHDTMTHLG RPDAALVPLT TGFRVPSVVM
DLANRLVPEL GLDVPLATSL RHDGSLRVCR VHDVVAAAVD ECRSALAREG SIGLVAPDSL
VDDVAAALAA AGLAWNHADD LAGEHPLTVV PATLAKGLEF DTVVALEPAR VVAEERRGLN
RLYVVLTRAV SDLVVLHRDP LPPALEEPAA AR