Gene Cfla_1775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1775 
Symbol 
ID9145664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1979035 
End bp1980144 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content77% 
IMG OID 
ProductUroporphyrinogen III synthase HEM4 
Protein accessionYP_003636871 
Protein GI296129621 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.488954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.025641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC AGACGCTGGC CGGGTGCGTC GTGCTCGTCA CGGCGGACCG GCGGGCTGCC 
GAGCTGCGCG CTGCGCTCGA GCGGCGTGGC GCGACGGTGC GTCACGCGCC CGCGCTCGGG
ATGGTGCCGC ACACCGACGA CGCCCTGCTG CTCGCCCGCA CGCGCGACCT GCTCGCGGAC
CCGCCGGACA CCGTGGTCGT CACCACGGGC ATCGGCTTCC GCGGGTGGGT CGAGGCAGCC
GACGCCGCCG GGATCGCGGA CCGTCTGCTC GAGGTGCTGG CCACGACCCG CATCGTGGCC
CGCGGCCCGA AGGCGCGCGG CGCCATCCAG GCCGCCGGGC TGAGCGCCGA CTGGGTGGCG
GAGTCCGAGA CGAGCGCCGA GATCGCCGAG GTGCTGCTCG ACGAGGGCGT CGCCGGCCGC
GACGTCGTCA TCCAGCACCA CGGCGCCGGC GCGGACGGCC TCGACGAGGC GTTCGCCGCG
GCCGGGGCGC GCGTGCGCAG CCTCGTCGTC TACCGGTGGG GTCCGCCGCC CGACCCGGAC
CTCGTCCGCG ACTCCGCCCG CGCGGTGGCC GACGGCGAGA TCGACACCGT CGTGTTCACG
TCGGCGCCCG GCGCCGCGGC GTGGGTCGCG GCGGCACGCG ACGCGGACGC GCTCGACGGC
GTGCTGCGCC GCCACACGTC CGGGGCGGTG GTGTTCGCGG CCGTCGGACC GGTCACCGCC
AAGCCGCTGG TCGACGTCGG CATCGACCCG CTCGTGCCGG ACCGGGGGCG GCTCGGCTCG
CTCGTGCGCG CCGTCGTGAC GCACTACGGC GGCCTCGAGG CACTCGACAC CGTCGCGGGC
CAGCTGCGCG TGCGCCGCGG CGCCGCCGTC CTCGACGGGC GCGTGCTGCC GCTGTCGCGC
ACCGGGCTCG AGGTGCTGCG GCTGCTCGCG CACGCGCGCG GCTCGGTCGT CCCGCGTGAC
CGCGTCCTCG ACGTGCTGCC GGGGGTATCG TCCGACCCGC ACGCGGCCGA GGTCGCGATC
GCGCGGCTGC GGGACGCCAC CGGCAGCCGT GCGCTCATCC GCACGGTGGT CAAGCGCGGC
TACCGGCTCG AGCTCCAGGA GCAGGCATGA
 
Protein sequence
MIDQTLAGCV VLVTADRRAA ELRAALERRG ATVRHAPALG MVPHTDDALL LARTRDLLAD 
PPDTVVVTTG IGFRGWVEAA DAAGIADRLL EVLATTRIVA RGPKARGAIQ AAGLSADWVA
ESETSAEIAE VLLDEGVAGR DVVIQHHGAG ADGLDEAFAA AGARVRSLVV YRWGPPPDPD
LVRDSARAVA DGEIDTVVFT SAPGAAAWVA AARDADALDG VLRRHTSGAV VFAAVGPVTA
KPLVDVGIDP LVPDRGRLGS LVRAVVTHYG GLEALDTVAG QLRVRRGAAV LDGRVLPLSR
TGLEVLRLLA HARGSVVPRD RVLDVLPGVS SDPHAAEVAI ARLRDATGSR ALIRTVVKRG
YRLELQEQA