Gene Cfla_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1803 
Symbol 
ID9145696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2009618 
End bp2011327 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content77% 
IMG OID 
Productprotein of unknown function DUF349 
Protein accessionYP_003636899 
Protein GI296129649 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0809623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0235589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAGC ACCCGACGAC GTCGTCCACG GACGACGCCG AGCAGGCTCC CCCCCTCGCG 
GGCGACGCCG GGGACACCGT CGCAGAGGCC GTCGCCGACG CGCCCGACGC GACGCCGGAG
GCGCTCCCGA CGCCGGAGAC CGAGGCCGCC CCGGCAACCG ACGCGACCGC CGAGGCCGCC
CCGGCAACCG ACGCGACCGC CGAGGCCGCC CCGGAAGCCG ACGCGGCCAC CGCAGACGCG
ACCGCCGACG ACGCGACCAC CGCAGACGCG ACCGCCGACG GCACACCCGC CGACGAGACC
ACGGACGGCA CACCGGCACC CGCCCGCCCG GACCGGCCCG GGCCCCGTCC CTCACCGGCG
TCGGTGCGTC CGCACCCCCG CCCGGGTCGG CCCGCCGCTG CCGCTCCCGT CGTGCCCGTC
ACCCCGGTGC CCACGCCCGA CGAGGAGGCC GCAGCCCAGC ACGCCGCGAC CTTCGGACGC
GTCGACGAGG ACGGCACGGT GCACGTCGTC GAGGCCGCCG GCGAGCGTGC GGTCGGGCAG
TTCCCCGGTG CGAGCGCCCC GGAGGCGCTC GCGCTCTACG TGCGCCGGTT CCTGGACCTG
CAGGCGAAGG TCGCGCTCTT CGAGGCCCGG CTCTCCGCGA CGGACCTGTC CGTCAAGGAG
ATCGACCAGA CCCTCACGCG CCTCTCCGAG GAGCTCGCCG AGCCGGCCGC CGTCGGCGAC
CTCGACGGGC TCCGCGCCCG CCTCGAGGGC CTGCGCGCCC GTGCGGGAGA GCGTCGTGCG
CAGGCGGAGG CCGAGCGCGC GGCGGCCCGT GAGGCCGCCG TCGCCGCCCG CACCGCGATC
GTCGAGCAGG CGGAGAGGAT CGCCTCGACC GACCCGTCCC GCATCCAGTG GCGGCCCGCG
GGTGAGCAGC TCCGCGGGCT GCTGGACCAG TGGAAGGACG CCCAGCGCTC CGGGCCGCGC
ATCGACCGAC CCACCGAGGA GTCGCTCTGG AAGCGGTTCA GCCACGCGCG CACCACCTTC
GACCGTGAGC GGCGGCACTT CTTCGCCGAG CTCGAGGCAC GCAACTCCGA GGCGAAGGCC
GTCAAGGAGC AGCTCGTCGC GGAGGCCGAG CGCCTCGCCT CGAGCACCGA CTGGGGTGGC
ACCTCTGCGG CGTTCCGTGA CCTCATGACC CGCTGGAAGG CGGCCGGCCG CGCGAACCGC
CAGGTCGACG ACGCGCTCTG GGCGCGGTTC CGGACGGCGC AGGACACGTT CTTCGCGGCA
CGCGACGCCG CGAACCAGGC GATCGACGAG GAGTTCCGCG CGAACCTCGT CGTCAAGGAG
GCGCTGCTCG TCGAGGCGGA GGCCCTGCTG CCGATCACCG ACCTCGGTGC CGCGAAGGCG
GCGCTGCGCT CGATCCAGGA CCGCTGGGAC GCCGCGGGCA AGGTCCCGCG CGCCGACGTC
CAGCGTGTCG AGGCCCGCAT GCGGGCGGTC GAGAGCGCCG TGCGCGAGGC CGACTCCGCG
CAGTGGCGTC GCACGAACCC CGAGACGCGA GCACGCGCCG AGGGCGCGGC CGCTCAGCTC
GAGCAGGCGA TCGCGGGTCT CGAGGCCGAT CTGGCCGCCG CCCAGGCCAA GGGCGACAAG
CGCAGGGTCG CCGAGGCGCA GGCTGCGCTC GACGCCCGGC GCGCGTGGCT CGAGCAGGTC
CAGCGCGCCG CGCAGGACGC GCGCGGCTGA
 
Protein sequence
MTEHPTTSST DDAEQAPPLA GDAGDTVAEA VADAPDATPE ALPTPETEAA PATDATAEAA 
PATDATAEAA PEADAATADA TADDATTADA TADGTPADET TDGTPAPARP DRPGPRPSPA
SVRPHPRPGR PAAAAPVVPV TPVPTPDEEA AAQHAATFGR VDEDGTVHVV EAAGERAVGQ
FPGASAPEAL ALYVRRFLDL QAKVALFEAR LSATDLSVKE IDQTLTRLSE ELAEPAAVGD
LDGLRARLEG LRARAGERRA QAEAERAAAR EAAVAARTAI VEQAERIAST DPSRIQWRPA
GEQLRGLLDQ WKDAQRSGPR IDRPTEESLW KRFSHARTTF DRERRHFFAE LEARNSEAKA
VKEQLVAEAE RLASSTDWGG TSAAFRDLMT RWKAAGRANR QVDDALWARF RTAQDTFFAA
RDAANQAIDE EFRANLVVKE ALLVEAEALL PITDLGAAKA ALRSIQDRWD AAGKVPRADV
QRVEARMRAV ESAVREADSA QWRRTNPETR ARAEGAAAQL EQAIAGLEAD LAAAQAKGDK
RRVAEAQAAL DARRAWLEQV QRAAQDARG