Gene Cfla_2887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2887 
Symbol 
ID9146796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3198207 
End bp3201200 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content80% 
IMG OID 
Producttranscriptional regulator domain protein 
Protein accessionYP_003637970 
Protein GI296130720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0904649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.154883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGCGG CCGACGACGT GCCCGCGAGC ACCGGAGCAG TCCCGGTGCT GCGCGCCCGC 
CTGGCGCGCC CGGAACCGAC CGGGCTGCGC CGGCCGCGGC TGCTGGCCCG GCTCGTCGGC
GCCGCGGCGC CGCCGCTCGT CGCCGTGGTC GCCGGCCCGG GGTGCGGCAA GACGACGCTG
CTGGCGCACG TCGCGGAGGC CGCGGGCCTG CCCGTCGCGT GGGTGACCCT CGACCCCGCG
CTCGACGACC CGCAGGTGCT GCTGCGGCAC CTGCACGTCG CGTGCGCGGC CCTGCCGTGG
CGCGGACCGC ACGAGCCGTG GCCCACCGCG GACGCCGCGC TCGCCGCGCT CGGTGACGGC
CTCGCGGGCC CGGCGCTGCT CGTGCTCGAC GACGTGCACA CGGCCGAGCG GCGCCGTGCC
GCGCAGGTCG TCGACCTGCT CGTGCGCCAC CAGCCCGCCG GTCTGCAGCT CGCGCTCGGC
AGCCGCGGCC TGCACCCCGG CGTGCCGAGC AGGGGGCTCG CGGGCACGGC GCGCGTCGTC
GACGGCGAGG CGCTGCGCTT CCGCACGTGG GAGGTCGACG AGTTGTTCCG CGGGCACCAC
GCGACGCCGC TGCCCGCGGC GGAGGTCGCC GAGCTCACGC AGCGCACGCA CGGCTGGGCG
GCAGGTCTCG AGCTGTTCCA CCTGGCCACG CGTGACCGGC CGGCGAGCTC GCGCACCCGC
CTGCTGGAGC GTGCGGGACG CGGTCCCGCG GTGGCCGACT ACCTCGCGGG CCACGTGCTC
GACCCGCTGC CGGTCTCGCT GCGCGAGTTC CTCGTGCGCA CGTCCGTCCT GGGGCGCCTG
TCCGCCGAGC GGTGCGACGC GCTGCTCGGC ACCGACGGCG CCGCCCGGAT GCTCGCCGCG
GCGCACCGCC ACGGGTTGCT CACGGCGGTC GACGACGCCG GCGGCACGCG CTACGCCTGC
CACGAGGTGG TGCGCGGGTA CCTGCTCGAC ACGCTCGCCG AGCGGGCGGG CGGGGCCGTC
GCGCGCGAGC TGCACCGACG GGCGGCAGCT CTCGCGCTCG CCGCCGGCGA GCACGACGAG
GCCGTGCGCT CGTCGTGCCG TGCGCAGGAC GAGGACGCCG TGCGCCGCGT CCTGCACCTG
GCCGGGGAGG ACCTCGCGCG CCGTCCCGGG CCGTGGCTCG ACCTCCTGCC GGACGCGGTC
CGCGACGACG ACCCGTGGGT CGCGGTCGCG GTCGCGCGCC GTCACCTCGC CGAGGGCGTG
CCGGAGCGGG CCCTGGCGGC CTACGGGGAC GCCGCGGAGC GCGCGGCGAC CGCTGCCGAC
CGGACGCTCG TGATCCGGGA GTGCCACCTG GTCCGCACGT GGGCGGAGCC GTCGCCGGGT
GCGCCGACGT ACTGGGTCGC GCTGGTGCGC CGCGCACTGG CCCGGCCGCG CGACTGCCTC
GCGGCGGACG TCGCCGCGCA CGACGTGGGT CGTGCACTCG CGCCCGCCGT CGCCGCGCTC
GTGGGCGGCG ACGTACGGGA CGCGGCGCGC CGGTTCGCCG TGGTGCACAG GCGTGCCGGG
ACCCCGCCGG TCGTGGAGGC GCTCGCGCTG CTGGGCGAGG CCGCGGCCCT CACGCTCGTG
CACGACCGTG GCGCGCGTGA CGCGCGGGAG CGCGCGCACA CCGCCGCCGG GCTGCTCGGT
GCGCCCGCGC TGGAGCGGCT GGCGGACGGC CTCGACCTCG CGGCCGACCC GGCGGTCGGC
GACGGGCTGC GTCACCTGCG CGAGCGTTGC ACGGGCGTCG GCGACACGTG GGGTGCGGCC
GTCCTCGGGC TGCTCGACGC CCTGCGGGCC GTCACCGCGG TCCCGTCGCG CCGCCGCGCC
GTGGTGGTCG CGCGGGACGC GGCCGACGAG CTCGAGGCGC TGGGAGCGCC CGCGCTGGCG
GCGTGGGCCC GCGCGGCGGG TGCCGTGGTC CGTGCCCGCT GCGGCGACGG TCCGCCGCCC
GACGGGCTCG AGGCCGACGC GGCGCGCAGC GGTCCGGTGC CGCTCGCGCT CGTCCTCGCC
GTGCTCGACG CGCCGCGGGA CCGCTGGGGT CGGGCGGCAC GCGCCGCGGG GGTCCCCGGG
TGGGTGCGGC TCGTCGCGCA CCGTGCGGCC ACCCGCACGG CACCTGCGCC CCCGATCCCC
GTCGCCGGTA CGCCCGCGGC CGCCGCCGCG ACCGCAGCGG GCGGTACGGC CGTGCTGCGG
GCGCTCGACG TGCAGTGCCT GGGGACGTTC CGGCTGACCG TCGCCGGGGT CGCTGTGCCG
ACCGCCGGAC TGCGTCCGCA GCACGCGGCA CTGCTGCGCG CGCTGGCGCT GGCCGCGCCC
GCTCCCGTGC ACCGCGACCG CCTCGTCGAG TGGTTCTGGC CCGGTCGGCT GCCCTCGCGG
GGGCAGCACT CCCTGCAGGT CGCCGTCAGC GACGTGCGGC GCGCGCTCGA CGCGGCGAGC
CCGGGTGCCG GGGCGGTGCT GCGTCGCGTC GACCAGGGGT ACGCGCTCGA GGGGGGCGGG
ACCACCGTGC ACGACGTGCG GGCGCTCGAG GACGCGCTGC GCGCGGCGGA CGTCGCGCTC
GACAGGGGTG ACGAGGCCGC GGCGCTCGTC GCCCTGCGTC GCGTCGTGGC GGACGGTGCG
GCTGAGCTGC TGCCGCAGGA CGGGACGGCC GAGTGGGTGC TCGGGCCGCG CGAGCGGCTG
CGCGCCGCGG TGGTGCGGGC GTGCGCGGCG CTCGCGGACC TGCTGGCGGC GCGGGGCGAC
GCGTCGGGTG CCCTCGCGGC CGTGCGCCGC GGCCTCGCGC TCGACCGGTA CCAGGACGAC
CTGTGGCGCC GCCTCGTCAG AGGGCTCGCC GCCGACGGCC GCCCGGCGGC GGCAGCTGCG
GCCCGCCGCG AGTACGCCGC GGTGCTCAGC GAGCTGGGTG TCGCGTCCGG CACGCCGCCC
GCGCTGCGGG CCGGTGCTGC GACGCACGTG CTGCCCACCC TCACGGGGCG CTGA
 
Protein sequence
MRAADDVPAS TGAVPVLRAR LARPEPTGLR RPRLLARLVG AAAPPLVAVV AGPGCGKTTL 
LAHVAEAAGL PVAWVTLDPA LDDPQVLLRH LHVACAALPW RGPHEPWPTA DAALAALGDG
LAGPALLVLD DVHTAERRRA AQVVDLLVRH QPAGLQLALG SRGLHPGVPS RGLAGTARVV
DGEALRFRTW EVDELFRGHH ATPLPAAEVA ELTQRTHGWA AGLELFHLAT RDRPASSRTR
LLERAGRGPA VADYLAGHVL DPLPVSLREF LVRTSVLGRL SAERCDALLG TDGAARMLAA
AHRHGLLTAV DDAGGTRYAC HEVVRGYLLD TLAERAGGAV ARELHRRAAA LALAAGEHDE
AVRSSCRAQD EDAVRRVLHL AGEDLARRPG PWLDLLPDAV RDDDPWVAVA VARRHLAEGV
PERALAAYGD AAERAATAAD RTLVIRECHL VRTWAEPSPG APTYWVALVR RALARPRDCL
AADVAAHDVG RALAPAVAAL VGGDVRDAAR RFAVVHRRAG TPPVVEALAL LGEAAALTLV
HDRGARDARE RAHTAAGLLG APALERLADG LDLAADPAVG DGLRHLRERC TGVGDTWGAA
VLGLLDALRA VTAVPSRRRA VVVARDAADE LEALGAPALA AWARAAGAVV RARCGDGPPP
DGLEADAARS GPVPLALVLA VLDAPRDRWG RAARAAGVPG WVRLVAHRAA TRTAPAPPIP
VAGTPAAAAA TAAGGTAVLR ALDVQCLGTF RLTVAGVAVP TAGLRPQHAA LLRALALAAP
APVHRDRLVE WFWPGRLPSR GQHSLQVAVS DVRRALDAAS PGAGAVLRRV DQGYALEGGG
TTVHDVRALE DALRAADVAL DRGDEAAALV ALRRVVADGA AELLPQDGTA EWVLGPRERL
RAAVVRACAA LADLLAARGD ASGALAAVRR GLALDRYQDD LWRRLVRGLA ADGRPAAAAA
ARREYAAVLS ELGVASGTPP ALRAGAATHV LPTLTGR