Gene Cfla_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2003 
Symbol 
ID9145898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2229966 
End bp2231573 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content74% 
IMG OID 
Productprotein of unknown function DUF245 domain protein 
Protein accessionYP_003637097 
Protein GI296129847 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.42457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTGC GGCGCGTGAT GGGCATCGAG ACCGAGTACG GCGTCCTGCA GCCGGGGCGC 
CCCCTGGCCA ACCCGATGCT GCTGTCGAGC CACGTGGTCG CCGTCCACGC TGCGGCGCGG
GAGGCCGGCC GGGGACGCGC GCGCTGGGAC TACGACGACG AGGACCCGCT GCACGACGCA
CGCGGCTTCC ACCTGCAGCG TGCCTCGGCA CACCCGTCGA TGCTCACGGA CTCGCCGGCC
GTGCCGGCGC CGTCGGGCGA CGCTCCGCAG GAGATGCCGC GCTCGGAGGT CGAGGAGTAC
GAGGACCCGG GAGCGGCCAA CGTCATCCTC ACCAACGGCG CGCGGCTCTA CGTCGACCAC
GCTCACCCGG AGTACTCCGC ACCCGAGGTG ACGACGCCGC GCGACGCCGT GCGCTGGGAC
CGCGCCGGTG AGCTCGTCAT GCTCGACTCG GTACGGCGTC TGGCCGCGAA CCCCACGCTG
CCGGACGTGA CGCTGTACAA GAACAACGTG GACGGCAAGG GCGCCACGTA CGGCACGCAC
GAGAACTATC TCGTCGACCG CTCCGTGCCG TTCGGTGACC TCGTCGCGCG TCTCACGCCC
TTCCTGGTCA CGCGCCAGGT GTTCACCGGT GCCGGGCGCG TCGGGCTGGG GCAGCGCGGC
GAGCACCCCG GATTCCAGCT CTCGCAGCGC GCCGACTACA TCGAGGCCGA GGTCGGGCTT
GAGACGACGC TGCGCCGACC GATCGTCAAC ACGCGCGACG AGCCCCACGC CGACCCTGCC
CGCTGGCGCC GGCTGCACGT GATCATCGGC GACGCGACCA TGCTCGAGAC CGCGACGTAC
CTGCGTCTCG GCACGACGTC GCTCGTGCTG TGGCTGGTCG AGCAGGCGGA CGCCGGCGGC
GCGGCGCGCC GGCTCACGCA GGCCGTCGAC CGGCTCGCGC TGCGTGACCC CGTCCAGGCG
GTGCACCGGG TGTCCCACGA CCTGTCGCTC ACCGAGAAGC TGGAGCTCGC CGACGGCAGG
TACCTCACCG CGCTCGAGGT GCAGTCCGAG TACCTCGCGG CCGTGCGTGG CGCGCTCGAC
GCCGTCGGTG ACGGGCTCGA CGAGCAGACC GCGGACGTGC TCGACCGCTG GGAGTCGGTG
CTGCGGCGGC TGGGCGAGGA GCCCGCGTCG TGCGCGCGGG AGGTGGAGTG GGTCGCCAAG
CTGCGCCTGC TCGACGGCAT GCGCCGCCGC GACCACCTGG CGTGGGACCA CCCGCGACTC
GCGGCGGTCG ACCTGCAGTG GTCCGACGTG CGCCCCGAGC GCGGCCTGTA CCACCGGCTC
GTCGCCGCCG GCGCCGTCGA GCTCCTCGTG ACGCCCGAGG AGGTCGCCGA CGCGGTCGTG
CACCCGCCGC AGGACACCCG CGCCTACTTC CGCGGCGAGG CGGTCGCGCG CTACGGCGGA
CAGATCTCGG CGGCCAGCTG GGACTCGGTG GTGTTCGACG TGCCGGGCGC GCAGACGCTG
CAGCGCGTGC CGATGCGCGA CCCGCTGCGG GGCACGCGCG CGCACGTCGG GGAGCTGCTC
GACCGCAGCC CGGACGCGCG CTCGCTCCTG GCGGCGCTCG GGGGCTGA
 
Protein sequence
MTVRRVMGIE TEYGVLQPGR PLANPMLLSS HVVAVHAAAR EAGRGRARWD YDDEDPLHDA 
RGFHLQRASA HPSMLTDSPA VPAPSGDAPQ EMPRSEVEEY EDPGAANVIL TNGARLYVDH
AHPEYSAPEV TTPRDAVRWD RAGELVMLDS VRRLAANPTL PDVTLYKNNV DGKGATYGTH
ENYLVDRSVP FGDLVARLTP FLVTRQVFTG AGRVGLGQRG EHPGFQLSQR ADYIEAEVGL
ETTLRRPIVN TRDEPHADPA RWRRLHVIIG DATMLETATY LRLGTTSLVL WLVEQADAGG
AARRLTQAVD RLALRDPVQA VHRVSHDLSL TEKLELADGR YLTALEVQSE YLAAVRGALD
AVGDGLDEQT ADVLDRWESV LRRLGEEPAS CAREVEWVAK LRLLDGMRRR DHLAWDHPRL
AAVDLQWSDV RPERGLYHRL VAAGAVELLV TPEEVADAVV HPPQDTRAYF RGEAVARYGG
QISAASWDSV VFDVPGAQTL QRVPMRDPLR GTRAHVGELL DRSPDARSLL AALGG