Gene Cfla_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2023 
Symbol 
ID9145918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2251323 
End bp2252588 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content75% 
IMG OID 
ProductDNA-directed DNA polymerase 
Protein accessionYP_003637117 
Protein GI296129867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.137176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGCA TGCCCGGCGC GACGATCCTG CACGCCGACC TCGATGCGTT CTACGCGTCG 
GTCGAGCAGC TGCTGGACCC GCGCCTGCGT GGCCGACCCA TCGCGGTCGG GGGCAGTGCC
GCGGGCGGCG TCGTCCTCGC CGCGTCGTAC GAGGCCAAGC GCTACGGGGT CTCCGGGGGG
ATGCCCGGCT GGCGGGCCGC ACGACTGTGC CCGGGCCTGC AGTTCGTCCC CGGGCGCTTC
CGGGAGTACC AGCCGATCGC CGACAGGGTC ATGGACGTCC TGGGTGACGT GACCCCGGTG
GTGGAGCGGA TCTCGATCGA CGAGGCGTTC CTCGACGTCG CCGGCTCGAC CCACCTCTTC
GGCACGCCGG CGCAGATCGC GGTGCTGCTG CGGCGCCGCG TGCGCGACGA GATCGGCCTG
CCGATCTCGG TCGGGGTCGC CCGCACCAAG CACCTCGCCA AGATCGCGTC GCAGGTCGCC
AAGCCCGACG GCCTCGTGGT CGTCGAGCCC GAGCGCGAGC GGGAGTTCCT CGAGCCGCTG
CCGGTCGGTC TCATGTGGGG CGTGGGGCCC GTCGCGCGGG CGCGGCTCGC CGAGCGGGGC
ATCACGACCA TCGGCGAGCT GGCGCGCACA CCGACGGGCG CGGTCGAGAA GATCCTCGGG
CACGCCGTCG GGTCGCGGAT GTCCGCGCTG GCGCACAACG AGGACCCGCG TCGGGTCGCC
GGGGCGGGTC GTGCCCGGTC GGTGGGGGCG CAGTCGGCCC TCGGCCGGCA GCAGGCGACG
CCCGAGCTCG TGCGCGAGGT GCTCGCCCAG CTCGCCGACC GCGTCGCGGG CCGCATGCGT
GCCAAGGGGC GCGCGGGGCG CACGGTCACC GTGCGCGTGC GGTTCCCCGG CATGCGCTCG
GTCACGCGCT CGCACACGCT TCCCGGGCCG GTCGCCACGA CGCTCACCCT CACCGAGGTC
GCCGAGCAGC TCGTGTGGCA GGCGATCCGC GAGCAGCCGC ACCCCGAGCC CGACGTGACC
CTGCTCGCGA TCTCCGTGTC GGGCCTGGTC GAGCAGTCCT CGCTGCAGCT CGAGCTGCCG
CTGCTCACCG CCGACCCACG TCGCCCGGGA TCGGCGCCGG GTGCGGCCCG CTGGGCGGTC
GACCGCTCGG TCGACGCGGT GCGTGCGCGC TTCGGCAACG CCGCGGTGGG CTACCTGCCG
ACCGCCATGC CACGCGTGCG CACGGTGCCG GACGAGTTCC GCGAGCTCGC GGAGCACGAC
CTGTGA
 
Protein sequence
MRRMPGATIL HADLDAFYAS VEQLLDPRLR GRPIAVGGSA AGGVVLAASY EAKRYGVSGG 
MPGWRAARLC PGLQFVPGRF REYQPIADRV MDVLGDVTPV VERISIDEAF LDVAGSTHLF
GTPAQIAVLL RRRVRDEIGL PISVGVARTK HLAKIASQVA KPDGLVVVEP EREREFLEPL
PVGLMWGVGP VARARLAERG ITTIGELART PTGAVEKILG HAVGSRMSAL AHNEDPRRVA
GAGRARSVGA QSALGRQQAT PELVREVLAQ LADRVAGRMR AKGRAGRTVT VRVRFPGMRS
VTRSHTLPGP VATTLTLTEV AEQLVWQAIR EQPHPEPDVT LLAISVSGLV EQSSLQLELP
LLTADPRRPG SAPGAARWAV DRSVDAVRAR FGNAAVGYLP TAMPRVRTVP DEFRELAEHD
L