Gene Cfla_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2071 
Symbol 
ID9145967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2313921 
End bp2315816 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content77% 
IMG OID 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_003637165 
Protein GI296129915 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGGCC GGACGCGCCC GGACGGTCGC GGTGCGGCCT GCACGACGCT GTCGGACGTC 
GTCGGTAGCG TCCGCGCCAT GTCCAGCCCT CGTCGTCTGC GGCCCTCCCA CCCCGAGGTG
CCGGCGCCGG GCGCGACGGG CGTCCAGGTG GCGCTCGACG AGCTCGGCAC ACCCCTGAGC
GACGTGACGT TCGTCGTCGT CGACCTCGAG ACCACCGGAG GCCGGGCGGC GGAGGACGCC
ATCACCGAGA TCGGCGCGGT GAAGGTGCGG GGCGGCGAGG TCCTCGGCGA GTTCCAGACG
CTCGTGGACC CGGGCGGCCC CGTGCCGCCC TTCATCCAGG TGCTGACCGG GATCACGACG
TCGATGCTCG TCGGCGCGCC GACGATCGGC GAGGTGCTGC CGAGCTTCCT CGAGTTCGCC
CGCGGTGCGG TGCTCGTGGC GCACAACGCG CCGTTCGACG TGGGGTTCCT GCGGGCCGCG
GCCGCCCGGA GCGAGCGCGC GTGGCCGGGT TTCCAGGTGG TCGACACGGT GCGCCTGGCG
CGGCGCGTCG TGCTGCGCGA CGAGGCGCCC AACCACAAGC TGTCGACGCT GGCCGCGCTG
TTCGGCGCGA CGGTGACGCC GAACCACCGG GCGCTGGCCG ACGCGCGCGC GACGGTCGAC
GTGCTGCACG CGCTCCTCGG GCGGCTCGCG CCGCTGGGCG TCACGCACCT CGAGGACCTC
GCGACGGCGA CCGACCCGGT GCCCGCCGAC GTCCGGCGCC GCAGCACGCT CGCGGACGGC
CTGCCGGACG CCCCGGGGGT CTATCTCTTC CGCGGTCCGC GCGACGAGGT GCTGTACGTC
GGGGTCTCCA CGACGTCGCT GCGCCGCCGC GTGCGCTCGT ACTTCACGTC GGCGGAGAAG
CGCGGCCGCA TGCAGGAGAT GGTGCGGCTC GCGGTGCGGG TCGACCCCGT GGTCTGTGCG
ACGCCCCTCG AGGCCCGCGT GCGCGAGCTG CGGCTGATCG CGGAGCACGC GCCGCGCTAC
AACCGGCGCT CGCGCGCACC GGAACGCATG CCCTGGGTCC GGCTGACCGA CGAGCCGTTC
CCGCGCCTGT CCGTGGTGCG CGAGGTGCGG GAGGGCCGCG CCCACATCGG CCCGTTCGCG
TCCCGCGCGC TCGCGCAGCA GGCCGTCGAC GCGCTGCACG CGACATTCCC CGTGCGGCAG
TGCACCGGGC GGCTCCCGGT CGTCCCGTCC GCCGACGCGC ACGCGTGCGT GCTCGCGGAG
GTCGGGCGGT GCGGCGCGCC GTGCACGGGC GGCCAGGACG TGGCCGCGTA CGCCCCCGTC
GCAGCGGCCG TCCGCGACGC GATGACCGGC GACCCGCGCG ATGTCGCGCA GGCTCACGCG
GTGCGCATCC GCACCCTCGC GGCACAGGAG CGGTTCGAGG AGGCTGCGAC CGTCCGCGAC
CGCCTCACGT CGTACGTCCG CGGCGCCGGT CGCGCCCAGC GTCACGCCCG GGCGGCCGCC
TGCCGCGAGC TCGTCGCTGC GCGCCGCACC GACGACGGAG GCTGGGAGCT CCTGCTCGTC
CGGCACGGCC GGTTCGCCGG GACGGCGGTG GTCGACCGCC GCACCGACCC GCGCCCGGCC
GTCGCCGCGC TGCGTGCCGG AGGCGAGCAC GTCACGGCGT CGGTGCCGCC TGCGACGGCG
GCACACCCGG AGGAGACCGA CCTGCTGCTG GCGTGGCTCG AGCAACCTGG CGTCCGGCTC
GTGGAGGTCG ACGGCGAGTG GTCCTCCCCC GCCCGGTCCG CGCAGGCCGT GCGGGACGCG
GCGGCAGCCG TCACGCTCGA CCTCGTCGTC CCGCGGCCCG CACTGGTCGA CGACGCGCCG
ACCGCTCCCG GCGCCGCGCC GCAGCGCACG GCATGA
 
Protein sequence
MPGRTRPDGR GAACTTLSDV VGSVRAMSSP RRLRPSHPEV PAPGATGVQV ALDELGTPLS 
DVTFVVVDLE TTGGRAAEDA ITEIGAVKVR GGEVLGEFQT LVDPGGPVPP FIQVLTGITT
SMLVGAPTIG EVLPSFLEFA RGAVLVAHNA PFDVGFLRAA AARSERAWPG FQVVDTVRLA
RRVVLRDEAP NHKLSTLAAL FGATVTPNHR ALADARATVD VLHALLGRLA PLGVTHLEDL
ATATDPVPAD VRRRSTLADG LPDAPGVYLF RGPRDEVLYV GVSTTSLRRR VRSYFTSAEK
RGRMQEMVRL AVRVDPVVCA TPLEARVREL RLIAEHAPRY NRRSRAPERM PWVRLTDEPF
PRLSVVREVR EGRAHIGPFA SRALAQQAVD ALHATFPVRQ CTGRLPVVPS ADAHACVLAE
VGRCGAPCTG GQDVAAYAPV AAAVRDAMTG DPRDVAQAHA VRIRTLAAQE RFEEAATVRD
RLTSYVRGAG RAQRHARAAA CRELVAARRT DDGGWELLLV RHGRFAGTAV VDRRTDPRPA
VAALRAGGEH VTASVPPATA AHPEETDLLL AWLEQPGVRL VEVDGEWSSP ARSAQAVRDA
AAAVTLDLVV PRPALVDDAP TAPGAAPQRT A