Gene Cfla_1058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1058 
Symbol 
ID9144934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1174459 
End bp1176435 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content74% 
IMG OID 
Producttranscription termination factor Rho 
Protein accessionYP_003636162 
Protein GI296128912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.534298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGACA CCATCGACGC CGCCACGCCC GCCACGGGCG GACCGGGACC GGGCAGCATC 
GCCGCGATGC GCCTGCCGGA GCTCCAGGCG CTCGCCGCCC AGCTCGGCGT CAAGGGCACG
AGCAAGATGC GCAAGAGCGA CCTGGTCCAG GCGATCTCCG CCGCGCGCAC CGGTGACCGG
CCGCAGGCTC TGGAGACGCG CGCGACGACG CGCCGCGCCC GGGCCGCGGT CGCCGAGCGC
GTCACGACGA GCGCGCCCGC GCAGGACCTC CCGGTGGCGT CCCCGGCACC GTCGGGGGAG
CCCGGTGCTG CGGCGTCGGA GCGTCGGCCG GCCGGCCGCG ACGCGCTCGC CGGTCTGGAG
GCGGCGCTCG ACGCGCGGCT CGCGCCGGCC GAGCAGCGGG ACGACCGCGC GGCCCGTGCG
GCGGACGTCG TCGGCAACGT CGCAGGTGAA CGCCGCTCGC GCCGCGCCGG CCGCGGCGCG
GGTGCCCCGG AAGCCGCGCC GGCCGAGGAG CCGGGCGAGC GCCCCCAGGG TGAGCGCCGG
CAGAACGAGC GCCGCCACAA CGGTGAGCGC CCCCAGGGCG AGCGTCGCAC CGCGGAGCAG
GACGCCGACG CGCCCGCGCA GGGCGTCCCG GCCGGTGCGG GCGACAAGCC GGAGGACGAG
GAGCGCGGCG GCCGCCGGCG TCGCTCGCGG GACCGCTTCC GCGACCGGGA CCGGGACCGC
AAGAGGGGCC GGTCCCGCAC GGGCCAGGGC GACCTCGCCG GCCTCGACGA GGTCGAGGTC
ACCGACGACG ACGTGCTGCT GCCCGTGGCG GGCATCCTCG ACGTCCTCGA GTCCTACGCG
TTCGTGCGCA CCACCGGCTA CCTGCCGGGG CCCAACGACG TCTACGTCTC GCTCAACCAG
GTCAAGAAGC ACGGCCTGCG CCGGGGCGAC GCGATCACCG GCGCCGTCCG CCAGCCCCGC
GAGGGTGAGC AGCAGCCCAG CGGGGGCCGG CCCAACAAGT TCAACGCGCT CGTGCGGCTC
GACACGGTCA ACGGCCTGCT GCCGGACGAG GCCCGCGAGC GTCCCGAGTT CACCAAGCTC
ACGCCGCTGT ACCCGCAGGA GCGGCTGCGT CTGGAGACCG AGCCGGGCCG GCTGACGCCG
CGCGTCATCG ACATCGTCGC GCCCATCGGC AAGGGCCAGC GCGGCCTCAT CGTCGCGCCT
CCCAAGGCCG GCAAGACGAT CATCATGCAG CAGATCGCCA ACGCGATCAC GCACAACAAC
CCCGAGGTCC ACCTCATGGT CGTGCTCGTC GACGAGCGCC CCGAGGAGGT CACGGACATG
GAGCGGACGG TCAAGGGCGA GGTCATCGCC TCGACCTTCG ACCGCCCCGC GTCGGACCAC
ACGATCGTCG CGGAGCTCGC GATCGAGCGC GCCAAGCGCC TGGTCGAGCT CGGTCAGGAC
GTGGTCGTGC TGCTGGACTC GCTGACCCGG CTGTCGCGCG CCTACAACCT GGCCGCGCCC
GCGTCGGGCC GCATCCTGTC CGGGGGTGTC GACGCCTCGG CGCTCTACCC GCCGAAGCGG
TTCTTCGGTG CGGCGCGCAA CATCGAGAAC GGCGGCTCGC TGACGATCCT CGCCTCCGCG
CTGGTGGAGA CGGGCTCGAA GATGGACGAG GTCATCTTCG AGGAGTTCAA GGGCACCGGG
AACATGGAGC TGCGCCTGTC GCGCCAGCTC GCGGACAAGC GCATCTTCCC GGCGGTGGAC
GTCAACGCGT CCGGTACCCG CCGCGAGGAG GTGCTCATGA GCAACGACGA GCTGAAGATC
ATCTACAAGC TGCGCCGCGT GCTCGGCGGG CTGGACCAGC AGCAGGCCAT CGAGCTGCTG
CTCGGCAAGC TCCGCGAGAC GAAGTCCAAC GTGGAGTTCC TGCTCCAGGT GCAGAAGACG
ACGCCGGGCA ACAACGGTCC CGCGCTCGAG GAGGGCGTCG GCCGCACGGT CGTCTGA
 
Protein sequence
MTDTIDAATP ATGGPGPGSI AAMRLPELQA LAAQLGVKGT SKMRKSDLVQ AISAARTGDR 
PQALETRATT RRARAAVAER VTTSAPAQDL PVASPAPSGE PGAAASERRP AGRDALAGLE
AALDARLAPA EQRDDRAARA ADVVGNVAGE RRSRRAGRGA GAPEAAPAEE PGERPQGERR
QNERRHNGER PQGERRTAEQ DADAPAQGVP AGAGDKPEDE ERGGRRRRSR DRFRDRDRDR
KRGRSRTGQG DLAGLDEVEV TDDDVLLPVA GILDVLESYA FVRTTGYLPG PNDVYVSLNQ
VKKHGLRRGD AITGAVRQPR EGEQQPSGGR PNKFNALVRL DTVNGLLPDE ARERPEFTKL
TPLYPQERLR LETEPGRLTP RVIDIVAPIG KGQRGLIVAP PKAGKTIIMQ QIANAITHNN
PEVHLMVVLV DERPEEVTDM ERTVKGEVIA STFDRPASDH TIVAELAIER AKRLVELGQD
VVVLLDSLTR LSRAYNLAAP ASGRILSGGV DASALYPPKR FFGAARNIEN GGSLTILASA
LVETGSKMDE VIFEEFKGTG NMELRLSRQL ADKRIFPAVD VNASGTRREE VLMSNDELKI
IYKLRRVLGG LDQQQAIELL LGKLRETKSN VEFLLQVQKT TPGNNGPALE EGVGRTVV