Gene Cfla_0124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0124 
Symbol 
ID9143989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp149535 
End bp152675 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content79% 
IMG OID 
Productputative exonuclease 
Protein accessionYP_003635243 
Protein GI296127993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0895241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTGC GCTCGCTCAC CGTCCAGGCG ATCGGCCCGT TCGCAGGCCG GCACACGGTC 
GACCTCGACG CGCTCGGGCA GTCCGGGCTG TTCCTGCTGG AGGGGCCGAC CGGGTCGGGC
AAGTCGACGC TCATCGACGC CGTCGTGTTC GCGCTGTACG GGAAGGTCGC AGGCACCGAC
GCGTCCGACG AGCGCCTGCG CTCGGCGTAC GCGGCGGACG ACGTGGAGAG CGTCGTCGAC
CTGGTCTTCG AGGTGCCGTC GGGCGTCTAC CGCGTGCGGC GCACCCCCGC GTACCGCCGG
GCCAAGCGCC GTGGCGAGGG CACGACGACG GCGCAGGCGA GCGTCAAGGC CTGGCGCCTG
CCGGCCGACG TGGAGCTGGG CGAGGACCCC GAGGAGCTCG ACGGCGTCGG CGTGCTGCTC
GGCACGCGCC TGGACGAGGT GGGCCACGAG CTGCAGCGGA TCGTCGGGCT GGACCGCACG
CAGTTCGTCC AGACGGTCGT GCTGCCCCAG GGCGAGTTCG CGCGGTTCCT GCGGGCCACC
GGCGAGGAGC GCCGGGTGCT GCTGCAGAAG ATCTTCGGCA CCCAGGTGTA CGAGCAGCTG
CAGCAGCGGC TCGCCGCGCT GCGGGCCGAG GCGGCGCGGA CCGTCGAGGC GGCGCGTGCC
GGACTCGGGG AGGCGGTCGC CCACCTGCTC GGCGCCTGCG CGCTCGGCCC GGAGGAGCAC
GCGGCGGCAC GCGTCGCGCT CGACGAGGCC GTCGCGGGCG GCCCGGGCGT CGCGGTGCGC
GTCGCGGGCG TCGTCGTGGC CACGACGCGC GCGCTGGACG CCGCCGCCGG CACGCTCGCG
CAGGAGGCTG CCTCCGCCCG CACGTCATGG GACGCGGCAC GGGCCGCGCA CGAGGCGGCG
CGCGCGACCG CGGCTCTCGC CGCCCGGCGG GACGCGCTGC GCGCGGAGCG TCGCGCGCTC
GACGCCGCGG CCGGGCAGCA CGAGGACGAC GTGCAGCGCC TCGCACGGGC GCGCGCGGCT
GCCGCGGTGC GTCCGCTGCT CACCGGGTGG CAGGACGCGA GGACGGCGCA CGAGGCCGGG
ACCAAGTCGC TGGTCGCTGC GTGCGACACG GCCCCGGCGG ACCTGCTGCC GCCCGGCGGC
GCGGCGGTCC TCGTCGTCGG CGGGCGCGAC GGCGGTCCGG GACCCGACGC CGGCGGCACC
GAGCGTCTCG ACGGTGTGCT CGACGCGTGG CGGCCCCACC TGCGTGCCGA GCGTGACGCG
GCCGCGGACG CCGCCGCCGC GTTGCGTCGC ACGGTCGAGG TCGAGGCCGG GCTCACCGAG
CGCCGCCGCG CGGTGCGCGA GCTCGCGGCG GTGCTCGAGG AGCTGCGGTC GGAGGTCGAC
GCCGCGACCA CGTGGCTCGC CGGACGCCCG GGGAGCCGAG CGGCCCTCGA GCAGGAGCGG
GACGCCGCGC GCGCGCTCGC GGGCCGGTCC GACGCCGCCG AGCAGGCCCG GACGGCCGCG
CGGGCTCTCG TCGCGGACGT CGCCGCCCTG ACGGCGGCGC GCGCCGACCT CGCAGCCGCG
CAGGACGCCG TCGCCGGTGG TGCGGACGCC GCACGGGCCG CGGTCGCCGC CGAGGCGTCG
CTGCGTGCCG CACGCGTCGC CGGCCTGGCC GGCGAGCTCG CGGCCGGTCT CGTCGCCGGC
GACCCCTGCC CGGTGTGCGG TGCCTGTGAG CACCCTGCGC CGGCGTCGGT GGGCGCCGAC
CACGTGACCG CCGAGCAGGT GCGGGCCGCC GAGGAGGCGC GCGCCGCTGC CGAGTCGCAG
CTCGCCGCGC TCGGGGCGCG TCGGGCGGCG CTCGCCGAGC GCGTCGCCGG GCTGGCCGCG
CGCGTCGGGG AGCACGACGC CGGTGCGGCC GCGGCGCTGC TGCGGGCGGC CGAGGACGAC
GTCGCCGCTG CTGCCGCCGC CCACGCCCGG GTCGCCACCC TCGAGGCCGA CCTCACGGCG
CACGACGCGG CCACCCGGGA CCGTGCGCAG GTGCGCGAGG AGGTCCTCGG CCAGGTCCGC
GCCGCCGAGC TCACGCTCGA GACCGAGCGC GACGCTCTCG GGCGGGCCGA GGCCGAGGTC
GCGGACGCGC GCGCCGACCA CCCGACCGTC GCCGCCCGGC ACGCCGCCCT CGACGCGCGG
GCGCGCCAGG CGGTGGCGCT GCTCGACGCC CTCGACACCG AGCGTGCCGC GGCCGCCGAC
GAGCAGCGTC GTCACGCCGA GCTCGACGCC GCGCTGGCCG AGCACGGGTT CGCGGACGTC
GCCGACGCCC GCGGCGCCTG GTGCCCCGCG ACGGAGCTGG CCGACCTCGA GCGTCGCGTC
GTCGCCCGCA CCGCCGACGA GGCACGCGTC GACGCCGGCC TGGCCGACCC CGCGCTCGTC
GCGCTGCCCG AGGACGTCGC ACCCGACCTC GCCGGGACCG AGGGTGCCGA GCGTGCGGCG
CGGCGCAGCG CCGACGACCT CGAGGGCCGC GCCCGCGTCG CCGCGGCGCG CGCCGAGGCC
GCCGCGGACG CCGCAGCACG CGTGCGCGCC GCGGCCGAGG CGCTCGACGC CGCGGCAGCC
GCAGCGGCGC CCGTGACCCG CATGGCGAAC CTCGCGTCCG GGACGGGCGC CGACAACCCG
CACGCCCTGT CGCTGGCCAC CTACGTGCTC GGGCGTCGCT TCGAGGACGT CGTCGCCGCC
GCCAACGAGC GTCTCGCCGT GATGTCCGAC GGGCGGTTCG AGCTCGTCCG GTCCGACGAG
AAGGAGGACG TGCGTACCCG GGCGGTCGGA CTGGCGATGC GCGTCGTCGA CCACCGCACC
GAGCGTGCGC GTGACCCGCG GACCCTGTCC GGCGGCGAGA CGTTCTACGT GTCGCTGTGC
CTCGCGCTCG GCATGGCCGA CGTCGTCACG GCCGAGGCGG GAGGCGTCGA GCTCGGCACC
CTGTTCGTGG ACGAGGGCTT CGGTGCGCTC GACCCGCACG TCCTCGACCA GGTGCTGGCC
GAGCTCGGCC GGCTGCGGCA CGGCGGGCGC GTCGTCGGCA TCGTCTCCCA CGTGGAGACC
CTCAAGCAGG CGGTCGCCGA CCGGATCGAG GTGCGGCCGA CGCCCGCCGG TCCGAGCACG
CTCACCGTGC TCGCAGGCTG A
 
Protein sequence
MRLRSLTVQA IGPFAGRHTV DLDALGQSGL FLLEGPTGSG KSTLIDAVVF ALYGKVAGTD 
ASDERLRSAY AADDVESVVD LVFEVPSGVY RVRRTPAYRR AKRRGEGTTT AQASVKAWRL
PADVELGEDP EELDGVGVLL GTRLDEVGHE LQRIVGLDRT QFVQTVVLPQ GEFARFLRAT
GEERRVLLQK IFGTQVYEQL QQRLAALRAE AARTVEAARA GLGEAVAHLL GACALGPEEH
AAARVALDEA VAGGPGVAVR VAGVVVATTR ALDAAAGTLA QEAASARTSW DAARAAHEAA
RATAALAARR DALRAERRAL DAAAGQHEDD VQRLARARAA AAVRPLLTGW QDARTAHEAG
TKSLVAACDT APADLLPPGG AAVLVVGGRD GGPGPDAGGT ERLDGVLDAW RPHLRAERDA
AADAAAALRR TVEVEAGLTE RRRAVRELAA VLEELRSEVD AATTWLAGRP GSRAALEQER
DAARALAGRS DAAEQARTAA RALVADVAAL TAARADLAAA QDAVAGGADA ARAAVAAEAS
LRAARVAGLA GELAAGLVAG DPCPVCGACE HPAPASVGAD HVTAEQVRAA EEARAAAESQ
LAALGARRAA LAERVAGLAA RVGEHDAGAA AALLRAAEDD VAAAAAAHAR VATLEADLTA
HDAATRDRAQ VREEVLGQVR AAELTLETER DALGRAEAEV ADARADHPTV AARHAALDAR
ARQAVALLDA LDTERAAAAD EQRRHAELDA ALAEHGFADV ADARGAWCPA TELADLERRV
VARTADEARV DAGLADPALV ALPEDVAPDL AGTEGAERAA RRSADDLEGR ARVAAARAEA
AADAAARVRA AAEALDAAAA AAAPVTRMAN LASGTGADNP HALSLATYVL GRRFEDVVAA
ANERLAVMSD GRFELVRSDE KEDVRTRAVG LAMRVVDHRT ERARDPRTLS GGETFYVSLC
LALGMADVVT AEAGGVELGT LFVDEGFGAL DPHVLDQVLA ELGRLRHGGR VVGIVSHVET
LKQAVADRIE VRPTPAGPST LTVLAG