Gene Cfla_2057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2057 
Symbol 
ID9145953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2297190 
End bp2298677 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content70% 
IMG OID 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_003637151 
Protein GI296129901 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.130961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.746729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGGAC GGACCCACGG CAGCGTCGAC ACCGCCTCGG TGCGCGCCGC TTGCGAGGCT 
GCCGGCGTCG AGGGCCCCAA GCGGCTGCGC GCCGTCGTCC GCGGTCTGGG GACGGCCGGT
GTCGACGTGA ACGACCTGGG CACCGCCGGC CGGGCCGTCG CCGCGACGAG CGCGAAGTCG
CGCCCCGCGG CCAAGGCGAC CGTCGCGGAC GCCGGCACGC CCGCGGTGAG GTCGACCCGC
ACCACGAAGG CGGCGGCGAC CAAGGCCACG GCGACGAAGA CCGCCACCGC CAAGAAGGCC
GCATCGAAGG CGTCGCCCGC GACGGAGTCC GACGTCGCCG ACGAGCCGGT CGACGAGGTC
GAGGAGGACG TCGACGTCAC GGAGCTCGAG GACGTCGAGG TCACGGACGC CGACGTCGAG
ACCGACGACG TGGAGGACGT GGCCGAGGAC GACGCCGAGG AGACCGAGAC CAAGCCCGCC
GCCGCGAAGA AGGAGGACGA GCCCGAGGAC ACCGGGTTCG TCTACTCCGA CGCGGACGAC
GACGACGCCC CTGCCCAGCA GGTCGTCACC GCCGGTGCCA CCGCGGACCC GGTGAAGGAC
TACCTCAAGC AGATCGGCAA GGTCGCGCTG CTGAACGCCG AGCAGGAGGT CGAGCTCGCC
AAGCGCATCG AGGCCGGCCT GTTCGCCGAG GAGAAGCTCG CCGAGACACG CGACTCCCTC
GAGCCCAAGC TCCGCCGCGA GCTCGAGTGG ATCGCGCAGG ACGGTCGCCG CGCCAAGAAC
CACCTGCTCG AGGCGAACCT GCGACTCGTC GTCTCGCTGG CCAAGCGCTA CACGGGTCGC
GGCATGCTCT TCCTCGACCT GATCCAGGAG GGCAACCTCG GTCTGATCCG CGCGGTCGAG
AAGTTCGACT ACACCAAGGG CTACAAGTTC TCGACGTACG CCACGTGGTG GATCCGGCAG
GCGATCACGC GTGCGATGGC CGACCAGGCG CGCACCATCC GCATCCCGGT GCACATGGTC
GAGGTCATCA ACAAGCTCGC ACGCGTGCAG CGCCAGATGC TCCAGGACCT GGGCCGTGAG
CCCACCCCGG AGGAGCTCGC CAAGGAGCTC GACATGACGC CCGAGAAGGT CGTCGAGGTC
CAGAAGTACG GCCGCGAGCC CATCTCGCTG CACACCCCGC TGGGCGAGGA CGGCGACAGC
GAGTTCGGCG ACCTCATCGA GGACTCCGAG GCGGTCGTGC CCGCGGACGC CGTGAGCTTC
ACGCTCCTGC AGGAGCAGCT CCACCAGGTG CTCGACACGC TCTCCGAGCG CGAGGCCGGT
GTGGTGTCCA TGCGGTTCGG CCTCACCGAC GGCCAGCCCA AGACGCTCGA CGAGATCGGC
AAGGTCTACG GCGTGACGCG CGAGCGGATC CGTCAGATCG AGTCGAAGAC GATGTCGAAG
CTGCGCCACC CGTCGCGCTC GCAGGTGCTG CGCGACTACC TCGACTGA
 
Protein sequence
MRGRTHGSVD TASVRAACEA AGVEGPKRLR AVVRGLGTAG VDVNDLGTAG RAVAATSAKS 
RPAAKATVAD AGTPAVRSTR TTKAAATKAT ATKTATAKKA ASKASPATES DVADEPVDEV
EEDVDVTELE DVEVTDADVE TDDVEDVAED DAEETETKPA AAKKEDEPED TGFVYSDADD
DDAPAQQVVT AGATADPVKD YLKQIGKVAL LNAEQEVELA KRIEAGLFAE EKLAETRDSL
EPKLRRELEW IAQDGRRAKN HLLEANLRLV VSLAKRYTGR GMLFLDLIQE GNLGLIRAVE
KFDYTKGYKF STYATWWIRQ AITRAMADQA RTIRIPVHMV EVINKLARVQ RQMLQDLGRE
PTPEELAKEL DMTPEKVVEV QKYGREPISL HTPLGEDGDS EFGDLIEDSE AVVPADAVSF
TLLQEQLHQV LDTLSEREAG VVSMRFGLTD GQPKTLDEIG KVYGVTRERI RQIESKTMSK
LRHPSRSQVL RDYLD