Gene Cfla_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2038 
Symbol 
ID9145934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2272026 
End bp2275547 
Gene Length3522 bp 
Protein Length1173 aa 
Translation table11 
GC content75% 
IMG OID 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_003637132 
Protein GI296129882 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.12539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.70219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAGG GGGACCCGAG GGTGGGGACA GGCACGGGGA CGGACGCCCG GACGGGTGCG 
GGGCGCGGGG CCGGGGCCGA CTGGCCGCCG CGGTACGCCG AGCTGCACGC GCACTCGGCG
TTCAGCTTCC TCGACGGCGC CAGCCAGCCC GAGGAGCTCG CCGCCGAGGC GGCGCGCCTC
GGGCAGAGCG CGCTGGCCCT CACCGACCAC GACGGGCTGT ACGGCGTGGT GCGGTTCGCG
CAGGCCGCGC GTGCCGTCGG GCTGCCCACG GTCTTCGGCG CCGAGCTGCA CCTGCCCGCG
CCCGACCCGC GCCGGCACCC GCGCGAGAAG CGCCCGACCC CCGGCCCGCC CGTGCTCGAC
GCGCCCACCG GCGTACCCGA CCCGCGCGCG TCCCACCTGC TGGTGCTCGC CCGCGGCGCC
GACGGGTACC GCGCGCTGTC GCGCGCGATC GCCGAGGGGC ACCTGCGCAC CGGGCGCAAG
GGCGCCGCGG AGTATCACCT GGAGGAGCTC GCCGAGGCGG CCGCCGGGCA GTGGCTCGTG
CTCACCGGCT GCCGCAAGGG CGCCGTGCGC CGTGCGCTCG CCGGGGGCGA CCCGTCCGGG
GTCCTGGGCG TCGCACCGGG CGGGATCGAG GCGGCGCGCA CCGAGCTCGA CCGGCTCGTC
GCGCTGTTCG GGCGGGACAA CGTCGCCGTC GAGACCACGA TGCACGGCGA CGCGTACGAC
ACCGACCGCG CCGACGCGCT CGCGACCCTC GCCGCCGACG CGCGCCTGCC GCTCGTCGCG
ACCGGCAACG TGCACTACGC CACCGAGCGC GACGCGGACC TCGCGCTCGC GCTCGCGGCG
GTGCGCGCGC GCTCGTCGCT CGACGACCTC GACGGCTGGC TGCCCGGCGC CCCCGTGGCG
CACCTGCGCT CGGCGGCCGA GATGCTGCAC CTGCACCGCC GGCACCCGAG CGCCGTGACC
ACCGCGGCGG ACCTGGCTGC CGAGTGCGCG TTCGACCTGT CGCTCGTCGC ACCGAGCCTG
CCGCCGTACC CCGTCCCGGA CGGGCACACC GAGGCGACGT GGCTCCGCGA GCTCGTGCGC
CGCGGCGCGG CGGAGCTGTA CGGCCCACCG GACGCCGAGC GGGTGCCCGG CGCGTACGCG
CAGCTCGAGC ACGAGCTGCG CGTCATCGAG GACCTCGGGT TCCCCGGGTA CTTCCTCGTC
GTCTACGACC TCGTCGACTT CTGCCGCCGC CAGGGGATCA TGGCGCAGGG GCGCGGCTCA
GCCGCCAACT CCGCGGTCTG CTACGTGCTG CGCGTCACGG CTGTCGACCC CGTGAAGCAC
GGGCTGCTGT TCGAACGGTT CCTCGCCCCG GAGCGTGACG GCCCGCCGGA CATCGACGTC
GACATCGAGT CCGCGCGCCG CGAGGAGGTC ATCCAGCACG TCTACGCCAC GCACGGGCGC
TCGCACGCCG CCCAGGTCGC CAACGTCATC TCCTACCGGC CGCGCTCGGC CGTGCGGGAC
GCGGCGCGCG CGCTGGGGTA CGACGCGGGG CAGCAGGACG CGTGGTCCAC GTCGATCGAG
CGGTGGGGGA GCCTGCGCGG CCCGGAGAAG CCCAGCGCGT GGTGGCACCT CACGCGCTCG
GGGCCGGTCG GGCCGGGCAG CGAGGTCGCC GGCATGCCGA CGGCCGACAA CCACGACGTC
GTCCCCACGC GCCTGCCGCC GTCGGCCGCG GAGGCCGAGG AGATCCCCGA GCACGTCATC
GACCTCGCCG AGCGGTTCCT GCGCCTGCCG CGCCACCTGG GCATCCACTC CGGCGGCATG
GTGATGTGCG ACCGGCCCGT CATCGAGGTG TGCCCCGTGG AGTGGGCGCG CATGGAGGGG
CGCACGGTCC TGCAGTGGGA CAAGGAGGAC TGCGCCGACG CGGGCCTGGT GAAGTTCGAC
CTGCTGGGGC TGGGCATGCT CACCGCGCTG CGCCTGGCGT TCACCGAGGT CGAGAAGCAC
GAGGGCGTCA CGCTCGACCT GCACGGCCTG CCGCACGAGG ACCCCGCGGT GTACGAGCTG
CTGTCCGCCG CCGACACCGT CGGGGTGTTC CAGGTGGAGT CGCGCGCGCA GATGGGGACG
CTGCCGCGCC TGCGGCCCTC GACGTTCTAC GACATCGTGG TCGAGGTCGC GCTCATCCGG
CCCGGGCCCA TCCAGGGCGG ATCGGTGCAC CCGTTCATCA ACCGCGCCAA GGGCCGTGAG
CCCGTCACGT ACCTGCACCC GCTGCTGGAG AAGTCGCTCG GCAAGACCCT CGGTGTCCCG
CTGTTCCAGG AGCAGCTCAT GCAGATGGCC ATCGACGTCG CGGACTTCAC GCCCGCCGAG
GCCGACCAGC TGCGCCGCGC GATGGGGTCG AAGCGGTCGA TGGAGCGCAT GGAGGCGATC
CGCTCACGGC TCATGGAGGG CATGGCCGCC AACGGCATCG GTTCGCAGGT CCGCGAGCAG
ATCTTCGACA AGCTCAAGGC GTTCGCGGAC TTCGGGTTCC CCGAGTCGCA CGCCTACTCG
TTCGCGTTCC TCGTCTACGC CAGCTCGTGG CTCAAGGTGC ACCACCCCGC CGCGTTCTAC
GCCGGGCTGC TCGCGGCGCA GCCCATGGGG TTCTACTCGC CGCAGTCGCT GGCGGCGGAC
GCGCGCCGCC ACGGCATCGA GGTGCTGCGC CCCGACGTGC TCGCGTCCGA GGTGCTGGCC
GTCGTCGAAC GCCTCGGACC ACCGCCGCAC GGCGGGGAAC CCCGGCTGGT GCCGCAGCCC
AGCGGCACCG GTCCGACCCG ACCCGTGGGG GTGCGCACCG GGGAGGGGTC GGTGCGCACC
CTCGCGGTGC GGCAGGGGCT CACGCAGGTG CGGACCATCG GTGAGGACGT CGCGCGTGCG
CTGGTCGACG CGCGCACGGC CGACGGCCCG TTCACCGACC TGCAGGACCT CGTGCGGCGC
GTGCACCTGA CGACCGCGCA GATCGAGGCG CTGGCCACAG CCGGCGCGCT GGACTCCCTC
GGCGTCGACC GCCGCTCGGG ACTGTGGGCC GCAGGCGCGC TCGCGCAGGA GGGGCCGGAC
ACGCTGCCGG GCGTCGCGGT GGGGGTGAAG GCCCCGGCCC TGCCGGGCCT GTCGGGCGTC
GAGGTCGCGA CGGCCGACGT GTGGGCCACG GGCGTGTCCG TGGACTCCTA TCCGACGCAG
TTCGTCCGCG ACGGCCTCGA CGCGGCCGGT GTGCTGACCG TCGAGCAGGC GTTCCGCACC
GAGGAGGGAC GCCGCGTTGC CGTCGCCGGC GTCGTCACCC ACCGCCAGCG TCCCGGCACC
GCGCAGGGCG TGACGTTCCT GTCCCTGGAG GACGAGACCG GCCTGCTCAA CATCGTGTGC
TCCGCGGGCC TGTGGCAGAG GTTCCGGCGC ACGGCCCGTA CCGCCAAGGC GATGGTCGTG
CGCGGGCGGA TCGAGAAGGC CGACGGTGCG ACGAACCTCG TCGCGGAGCA CCTGAGCCCG
CTGTCGCTGA AGGTGCGCAG CCGGTCGCGG GACTTCCAGT GA
 
Protein sequence
MTEGDPRVGT GTGTDARTGA GRGAGADWPP RYAELHAHSA FSFLDGASQP EELAAEAARL 
GQSALALTDH DGLYGVVRFA QAARAVGLPT VFGAELHLPA PDPRRHPREK RPTPGPPVLD
APTGVPDPRA SHLLVLARGA DGYRALSRAI AEGHLRTGRK GAAEYHLEEL AEAAAGQWLV
LTGCRKGAVR RALAGGDPSG VLGVAPGGIE AARTELDRLV ALFGRDNVAV ETTMHGDAYD
TDRADALATL AADARLPLVA TGNVHYATER DADLALALAA VRARSSLDDL DGWLPGAPVA
HLRSAAEMLH LHRRHPSAVT TAADLAAECA FDLSLVAPSL PPYPVPDGHT EATWLRELVR
RGAAELYGPP DAERVPGAYA QLEHELRVIE DLGFPGYFLV VYDLVDFCRR QGIMAQGRGS
AANSAVCYVL RVTAVDPVKH GLLFERFLAP ERDGPPDIDV DIESARREEV IQHVYATHGR
SHAAQVANVI SYRPRSAVRD AARALGYDAG QQDAWSTSIE RWGSLRGPEK PSAWWHLTRS
GPVGPGSEVA GMPTADNHDV VPTRLPPSAA EAEEIPEHVI DLAERFLRLP RHLGIHSGGM
VMCDRPVIEV CPVEWARMEG RTVLQWDKED CADAGLVKFD LLGLGMLTAL RLAFTEVEKH
EGVTLDLHGL PHEDPAVYEL LSAADTVGVF QVESRAQMGT LPRLRPSTFY DIVVEVALIR
PGPIQGGSVH PFINRAKGRE PVTYLHPLLE KSLGKTLGVP LFQEQLMQMA IDVADFTPAE
ADQLRRAMGS KRSMERMEAI RSRLMEGMAA NGIGSQVREQ IFDKLKAFAD FGFPESHAYS
FAFLVYASSW LKVHHPAAFY AGLLAAQPMG FYSPQSLAAD ARRHGIEVLR PDVLASEVLA
VVERLGPPPH GGEPRLVPQP SGTGPTRPVG VRTGEGSVRT LAVRQGLTQV RTIGEDVARA
LVDARTADGP FTDLQDLVRR VHLTTAQIEA LATAGALDSL GVDRRSGLWA AGALAQEGPD
TLPGVAVGVK APALPGLSGV EVATADVWAT GVSVDSYPTQ FVRDGLDAAG VLTVEQAFRT
EEGRRVAVAG VVTHRQRPGT AQGVTFLSLE DETGLLNIVC SAGLWQRFRR TARTAKAMVV
RGRIEKADGA TNLVAEHLSP LSLKVRSRSR DFQ