Gene Cfla_0502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0502 
Symbol 
ID9144369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp534730 
End bp536565 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content79% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003635615 
Protein GI296128365 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000614607 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000229515 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGCATCG GGGTGCTCGG TCCCGTCGTC GCGCACGCCG CCGGCGACGC GTTGACGCTG 
CCCCGGCCGC GCTCGCGCGA GGTGCTGGCG GTGCTCGTCG CGGCGGGCGG GCGCACGGTC
CGCACGGACG CGCTGGTCGA CGACCTGTGG GACGGCACCC CTCCGCCGGG TGCGGTCGGG
GCGGTGCGCA CGTTCGTCGC CGAGCTGCGC CGCGCGCTGG AGCCCGACCG GCCCCCGCGC
ACACCGCCGC GCGTCGTCGT CACCCGCGGC CCGGGCTACG CGCTCGACGT CCCGCCGGAC
GCCGTCGACG CGTGGCGCGT GGCGCGGCTG GCGGCCGAGG CCCGCACCGC TCCCCCGGAC
ACCGCGGTGC GTCTGCTCGC CCACGCCCTC GCCGCGTGGC GCGGTGAGCC GTACGAGGAG
CTCGCCGACC GCCCCTGGGT GCAGCCCGAG CGCACCCGCC TGGTGACGCT GCGCGCCGAC
GTCACCGAGC AGCTGGCGGA TGCGCTGCTG GCGACGGGCC GCGCGGTCGA CGTGGTCCCG
CTGCTCGACG CGCACGTCGG CGCGCACCCG TGGCGCGAGG ACGGCTGGCG GCTGCTGGCC
ACGGCGCTGC ACCGCTTGCA CCGGCCGGCG GACGCGCTCG ACGTGCTGCG CCGCGCCCGA
CGCACCCTCG CCGAGGACCT CGGCCTCGAC CCCGGCCCAG CCCTGCGGGA CCTGGAGCAG
CAGGTCCTGG AGCGTCGCGA CGACGACGCG TGGCGCGACG ACAGCCTGTC CGCGCTCGAC
CGCCGCGGCG GGCGCGCGCG GCTGGAGGCG TCGGGCGCGG TCCTCACGAG CCTGGCCGTG
TCGGGTGACC TCGCGACGGT CCGCGCGCAG CGGCTCGCGT CCATCGCGGC GGCCGAGAGG
CTGGGCGACC CGTTGCTGAC GGCGCGGGTG GTCGGCGGCC TGGAGGCGCC GGGGGTGTGG
ACGCGGTCGG ACGACGACGA GCTCGCGGCG GCGGTCGTCG CGGCGGCCGT CCGGACGCTG
CCGCGGGTGA CGGGCAGCCC GGTGACGCGC GGGCGACTGC TCGCGACGAT CGCGATCGAG
GACCGCGGCA CGGCGGCGCG CGAGACCGAG GCGCTCGAGG CCGAACGGAT CGCGCGCGAC
CTGGACGACC GGCACCTGCT GTGCCTGGCT CTCAGCGGGC GCGCGATGCA GCGGTTCGGG
AGCACGGGGC TGGCGGCGGA CCGCGAGCGG ATCGGCGCCG AGCTCGTCGC GACGGCGGTG
TGCGCGGAGT CCACGACGTT CCGGATCGCC GGGCGCATCG TGCGGATGCA GGCGCTGTGC
GCGCTGGGGC GGCTCGACGA GGCGGCGGCG GAGGCCGACG AGGTCGACGC GCTGGCCGCG
TCCGCCGAGC GCCCGCTGGC GACGACGTTC ACCGCGTGGT TCCGCCACAC CTTCGCCGAC
GGGCCGGAGC CGGCGGCGCC CGACGAGATG CCGGGGTTCT CGCACGGGAT CGTCGCGCTG
GCCCGGGTGA CGCGGCTGGT CCGCGACGGC GGCACGCTCC CCGGGCCGGA CGCCGCGGGC
GACCTGGGCC CGTACGCGCC GTGGGTCCGG CCGCTCCTGC TGGTGCGCGC GGGCGACGTC
GACGGCGCCC GCCGGGCCGT GCGTGGGGCA CCGGCGCCGC CGCACGACCT GCTGCAGGAG
GTCGCCTGGG GCCTGCTGCT GACGGCGGCC CGCGAGGCCG GCGCACCCGA CGTCGTCGAC
CGTGCCCGTG ACGCCCTGGC ACCGGCCGTC GACGAGCGTG CGGCGGGCAG CGGCGTCGTC
GACGCGGGTC CCGTCCGCGC GTTGCTGCGG GGCTGA
 
Protein sequence
MRIGVLGPVV AHAAGDALTL PRPRSREVLA VLVAAGGRTV RTDALVDDLW DGTPPPGAVG 
AVRTFVAELR RALEPDRPPR TPPRVVVTRG PGYALDVPPD AVDAWRVARL AAEARTAPPD
TAVRLLAHAL AAWRGEPYEE LADRPWVQPE RTRLVTLRAD VTEQLADALL ATGRAVDVVP
LLDAHVGAHP WREDGWRLLA TALHRLHRPA DALDVLRRAR RTLAEDLGLD PGPALRDLEQ
QVLERRDDDA WRDDSLSALD RRGGRARLEA SGAVLTSLAV SGDLATVRAQ RLASIAAAER
LGDPLLTARV VGGLEAPGVW TRSDDDELAA AVVAAAVRTL PRVTGSPVTR GRLLATIAIE
DRGTAARETE ALEAERIARD LDDRHLLCLA LSGRAMQRFG STGLAADRER IGAELVATAV
CAESTTFRIA GRIVRMQALC ALGRLDEAAA EADEVDALAA SAERPLATTF TAWFRHTFAD
GPEPAAPDEM PGFSHGIVAL ARVTRLVRDG GTLPGPDAAG DLGPYAPWVR PLLLVRAGDV
DGARRAVRGA PAPPHDLLQE VAWGLLLTAA REAGAPDVVD RARDALAPAV DERAAGSGVV
DAGPVRALLR G