Gene Cfla_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1969 
Symbol 
ID9145863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2191218 
End bp2193944 
Gene Length2727 bp 
Protein Length908 aa 
Translation table11 
GC content73% 
IMG OID 
ProductDNA polymerase I 
Protein accessionYP_003637063 
Protein GI296129813 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.223022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGCCG ACCAGCCAGC GACCAGCACC TCGACGACCC CGCGCCTGCT CCTCATCGAC 
GGGCACTCGA TGGCCTACCG CGCGTTCTTC GCGCTGCCGG TCGAGAACTT CTCGACGTCG
TCCGGGCAGC CGACGAACGC GGTGTTCGGG TTCACCTCGA TGCTCGCCAA CCTGCTGCGC
GACGAGGAGC CGACGCACGT GGCGGTCGCG TTCGACGCCG GGCGCACCAC GTTCCGCACC
GAGCGGCTCG AGTCCTATAA GGGCAACCGC TCCGCGACGC CCGAGCCGTT CCGCGGCCAG
GTCGACGTCA TCCGCCAGCT CCTCGCGACG ATGCACGTGC AGGTGCTCGA CAAGCCCGGC
TTCGAGGCCG ACGACATCCT CGCGACCCTG ACGGCCCAGG CCGGCGAGCA GGGCATGGAG
GTCCTCGTCT GCTCGGGCGA CCGTGACACG TTCCAGCTGG TCGGGCCGCA GGTCACCGTC
CTGTACCCCG TGCGGGGCGT GTCGGAGATG TCGCGCATGA CCCCCGAGGC GGTCGAGGCG
AAGTACGGTC TGCCGCCCGC GCGCTACCCG GACCTCGCCG CGCTCGTGGG GGAGACGAGC
GACAACCTGC CCGGCGTCCC CGGCGTCGGC CCCAAGACGG CCGCCAAGTG GATCACGCAG
TACGACGGCC TGGCCGGCGT GCTCGAGAAC GCCGAGCGCA TCACCGGCAA GGCCGGCGAG
TCGCTGCGCG CGAACCTCGC GCAGGTCGCG CTCAACCGCG AGCTCAACGA GCTGCGGACC
GACCTCGACC TGCCGCTCGG ACCCGAGGAC CTCGCGGTCC GCCCGTGGGA CCGGGCGGCC
CTGCACCAGA TGCTCGACGA GCTCGAGTTC CGCACGCTGC GGGACCGGCT CTTCGCGATG
CTCCCGGACG AGTCCCGTGA CGAGCGCGTC GCGACCGTCG CGGCGCTCGA CCTCGTCGAG
ACGGGCGTGG GCGGTCTCGG TGCCTGGCTG GACGCGCGCG TCGACCAGGT CCTCGGCCTC
GACGTGCGCG GAACGGGCGC CCCGGGGCGC GGTGACGCGT GGGGCGTCGC GGTCGCCGAC
GGTGCCGGTC AGGCCGTGGC CTACGACCTC ACCGCGATCG ACCCTGCGGA CGAGACGGCT
CTCGCCGCCT GGCTGGCGGA CCCGCAGCGA CCCAAGGCGC TGCACGCCGC CAAGGAGGCG
TCGCACGCGC TCGCGGGCCG CGGCCTGGAC CTCGAGGGCG TCACGTTCGA CACCGAGCTG
GCGGCGTACC TGTGCCAGCC GGACCGGCGC GCGTACGACC TGCCGGACCT CGCGATCGGC
TACCTGCGCC GCGAGCTCGG TGGCGACGAC GGCTCGTCGG CCGGCCAGGG CGCGCTCGAC
CTCGAGGTGG ACGGGGCCGA CGAGGGTCGC CGGGCCGCCG TGCGGGCCGC TGCCGTGCGG
GACCTCGTCG ACGTCCTCGG CGGTGAGGTC GCCGATCGTG GGGCCACGAC GCTGCTGTCG
GACCTCGAGC TCCCGCTGCA GGCCGTGCTC GCCCGCCTCG AGCGCACCGG CATCGCGATC
GACCACGCGT ACCTCTCGGG ACTCGAGCGC GAGTTCGACG GCCAGGTGCA GGGGGCGGCC
GCCGACGCGT ACGCGGTCAT CGGCCGCGAG GTGAACCTCG GCTCGCCGAA GCAGCTCCAG
GAGGTGCTGT TCGACCAGCT CAGGATGCCG AAGACCAAGC GCATCAAGAC CGGCTACACG
ACGGACGCCA ACGCCCTCAC CGACCTGTTC GCGCGCACCG GGCACCCCTT CCTCGAGCAC
CTGCTGGCGC ACCGCGACGC CATCCGGCTG CGCCAGACGG TCGAGGGGCT GCTGCGGTCC
GTCGCCGACG ACGGTCGCAT CCGCACGACG TTCCAGCAGA CCATCGCGGC GACCGGCCGG
CTGTCCTCGG CGGACCCGAA CCTGCAGAAC ATCCCGATCC GCACCGACGC GGGCCGGCAG
ATCCGCCGGG CATTCGTCGT GGGCCCCGGC TACGCGACGC TCCTGACCGC CGACTACTCC
CAGATCGAGA TGCGCATCAT GGCGCACCTG TCGGGCGACG AGGGGCTCAT CGCGGCGTTC
CGCTCGGGGG AGGACCTGCA CAGCTACGTG GGCTCGCGGG TGTTCGGCGT GCCCACGGAC
GAGGTCACAC CGACGATGCG GTCGAAGATC AAGGCGATGA GCTACGGCCT GGCGTACGGC
CTGTCGTCGT ACGGGCTCTC GCAGCAGCTC GCGATCGAGG TGTCGGAGGC GGCGGCGCTC
ATGACGGACT ACTTCGAGCG GTTCGGCGGC GTGCGCGACT ACCTGACCGG CGTCGTGGAC
CAGGCCCGCG CGACGGGCTA CACCGCGACG GTCCTCGGCC GACGCCGCTA CCTTCCGGAC
CTCACGAGCG ACAACCGCCA GCGTCGCGAG GCCGCCGAGC GCATGGCGCT CAACGCGCCG
ATCCAGGGCA GCGCGGCGGA CCTCATCAAG GTCGCGATGC TCGGCGTCGA CGGTGAGCTC
ACCCGTCGGG GGCTGCGCTC GCGGATGCTC CTGCAGGTGC ACGACGAGCT GGTGCTCGAG
GTCGCCGAGG GTGAGCGCGA GGAGGTCGAG GAACTCGTCC GGACGCAGAT GGCGGCGGCG
GGCAGCGGCC TGCCCGACGG CCCGCTGGAC GTCCCCCTCG ACGTCTCCGT GGGCGTCGGC
GAGAGCTGGC ACGCTGCCGG GCACTGA
 
Protein sequence
MSADQPATST STTPRLLLID GHSMAYRAFF ALPVENFSTS SGQPTNAVFG FTSMLANLLR 
DEEPTHVAVA FDAGRTTFRT ERLESYKGNR SATPEPFRGQ VDVIRQLLAT MHVQVLDKPG
FEADDILATL TAQAGEQGME VLVCSGDRDT FQLVGPQVTV LYPVRGVSEM SRMTPEAVEA
KYGLPPARYP DLAALVGETS DNLPGVPGVG PKTAAKWITQ YDGLAGVLEN AERITGKAGE
SLRANLAQVA LNRELNELRT DLDLPLGPED LAVRPWDRAA LHQMLDELEF RTLRDRLFAM
LPDESRDERV ATVAALDLVE TGVGGLGAWL DARVDQVLGL DVRGTGAPGR GDAWGVAVAD
GAGQAVAYDL TAIDPADETA LAAWLADPQR PKALHAAKEA SHALAGRGLD LEGVTFDTEL
AAYLCQPDRR AYDLPDLAIG YLRRELGGDD GSSAGQGALD LEVDGADEGR RAAVRAAAVR
DLVDVLGGEV ADRGATTLLS DLELPLQAVL ARLERTGIAI DHAYLSGLER EFDGQVQGAA
ADAYAVIGRE VNLGSPKQLQ EVLFDQLRMP KTKRIKTGYT TDANALTDLF ARTGHPFLEH
LLAHRDAIRL RQTVEGLLRS VADDGRIRTT FQQTIAATGR LSSADPNLQN IPIRTDAGRQ
IRRAFVVGPG YATLLTADYS QIEMRIMAHL SGDEGLIAAF RSGEDLHSYV GSRVFGVPTD
EVTPTMRSKI KAMSYGLAYG LSSYGLSQQL AIEVSEAAAL MTDYFERFGG VRDYLTGVVD
QARATGYTAT VLGRRRYLPD LTSDNRQRRE AAERMALNAP IQGSAADLIK VAMLGVDGEL
TRRGLRSRML LQVHDELVLE VAEGEREEVE ELVRTQMAAA GSGLPDGPLD VPLDVSVGVG
ESWHAAGH