Gene Cfla_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3036 
Symbol 
ID9146948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3373048 
End bp3375801 
Gene Length2754 bp 
Protein Length917 aa 
Translation table11 
GC content74% 
IMG OID 
ProductMMPL domain protein 
Protein accessionYP_003638118 
Protein GI296130868 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.101076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCTCTG CCCTCTACCG CCTCGGCCGG GCCGCGTTCG CCCGACGACG TGCCGTCATC 
GGCGCCTGGG TGGGGCTGCT CGTCCTCATC GGCGCGGCCG CCGGCCTGCT CGGCGGCACG
CTGGACAACT CGGTGTCGAT CCCCGGCACC GAGTCCCAGG CCGCGCTGGA CCGGCTCACC
GCGACCTTCC CCCAGGCCGC GGGCACGACG GCCCAGGTGC TCGTCGTCGG CGAGGACGGT
GCACAGGTCG ACGACCCGGC CGTGGTCACC GCCGTCGAGG ACTCCGTCGA CGCGTTCCTC
GAGGTCGAGA GCGTCACGTC CGCCGTCTCG CCGTTCGACG ACACGCTGCC CGGCGCGTCG
GCCGTCAGCG ACGACGGCGA GGCCGCGCTG CTGACCCTCT CGCTCGAGGG CGAGGGCGTC
GCCATCGGCG ACGAGGTCAA GGACCGCCTG CGGGACGTCG CCGACGAGCT CGACGCCGCG
CTGCCCGACG GGTACGACGC GACGATCGGC GGACAGCTCT TCTCGCAGGA GTTCCCGGGC
CTGAGCATCG CGGAGGTGCT CGGCGTCGTC GTCGCGTTCA TCGTGCTGCT CGTGACGCTC
GGCGGGTTCG CCGCCGCGGG CATGCCGCTG CTCAACGCGC TGCTCGGCGT CGGCCTGTCC
ACGCTGCTGG TGCTCGTCGC CGCGGCGTTC ACGTCGGTCA CCAGCACCAC GCCGCTGCTC
TCGCTCATGC TCGGCCTGGC CGTCGGCATC GACTACGCGC TGTTCATCGT CTCGCGCTAC
CGCGAGCTGC TCGCCACCGG CCTGCCCACC CAGGAGGCCG CCGCCCGCTC CAACGCGACC
GCGGGGTCCG CCGTGATCTT CGCGGGCCTC ACCGTGATGA TCGCGCTCGT CGGCCTGGGG
GTCGCCGGCA TCCCGTTCCT CACCGTCATG GGCGTCGCCG GTGCCGCAGC CGTCGGTATC
GCCGTCCTGG TCTCCATCAC GCTCGTGCCC GCGATGCTCG GCGTCGCCGG CGAGCGCCTG
CGTCCACGCC CGTCGCGGCG TGCCCGCAAG GACGCGGCCG CCGGGACCGC GCCCGCCGCG
GCTCCCGCGC CCGCCGCCGA CGGCGACACC TGGGACCTGC CCGAGCACCA CAACCGGTTC
TTCGCCGGCT GGGTGCGCCT GGCCACGGCC CGCCCGTGGG TCACCGTCGT CGTGACGATC
GGCGCGCTCC TGGCCCTCGC GTTCCCGGCG CTCGACCTGC GCCTCGCGCT GCCCGACGCC
GGCGTCGCCC CCACCGACTC CTCGCAGCGC GTCACCTACG ACCGCATCAC CGAGCACTTC
GGCCCCGGCG CCAACGGCCC GCTCGTCGTC ACCGGCAGCA TCGTCACCAG CGACGACCCG
CTGGGACTCA TGGAGGACGT CGCCGACGAG CTGCGGGCCC TGCCCGGCGT CGACTCCGTG
CCCCTGGCGA CGCCCAACGA GTCCATCGAC ACCGGCATCG TGCAGGTCGT GCCCACCACG
GGCCCGACCG ACCCCGCGAC CGCCGACCTC GTCAACGCGA TCCGCGACCT GCGGCCGACG
ATCCTCGAGA AGCACGGGTT CGACCTGGCC GTCACGGGCT TCACGGCCGT CGGCATCGAC
GTGTCCGCCA AGCTGGGCGC GGCGCTGCTG CCGTTCGCGG TGTTCGTCGT CGGCCTGTCG
CTGATCCTGC TGACGATGGT GTTCCGCTCG ATCGCCGTGC CGCTCAAGGC GACGATCGGC
TACCTGCTGT CGGTCGCCGC CGCGTTCGGC GTCGTCACCG CCGTCTTCGA GCACGGCATC
GCGGCCGACC TGCTCCACGT CTCGCGCCTC GGCCCGATCA TCTCGTTCAT GCCGATCGTC
CTCATGGGCG TGCTCTTCGG CCTCGCCATG GACTACGAGG TGTTCCTCGT GTCCCGCATG
CGCGAGGACT ACGTGCACTC CGGCAAGGCG CGCGCGTCGA TCGCCACCGG GTTCGTCGGC
TCCGCCAAGG TCGTCACCGC GGCGGCCGTC ATCATGGTCG CGGTGTTCTT CGCCTTCGTC
CCCGAGGGGG ACATCAACAT CAAGCCCATC GCGCTCGGCC TGGCCGTCGG CGTCGCGGTC
GACGCGTTCG TCGTCCGCAT GACCCTCGTG CCGGCCGTCA TGCAGATCCT CGGCGAACGC
GCCTGGTGGA TGCCGAAGGG CCTGGACCGC GTGCTGCCGT CGTTCGACGT CGAGGGCGAG
GCGCTCCACC GCGAGATCAG CATGCAGGCG TGGCCGCACG ACCCGGACGT CGTCGTCGCC
GCACGCGGCC TGCGGCTCGC TGCGCTCGAC CGCACCGACG TCGTGGACCT CGCGGTGCGA
CGCGGTGAGG TGCTCGTCGC GCACGCCGAC GAGCCCGCCC GCCCCGCAGC CCTGCTGCTC
ACCGTCGCCG GGCGCCTGGC ACCCGAGGCG GGCGACCTCA AGGTCGACGG GCTGCTCCTG
CCCGTGCGCG CCGCCGCCGT GCGCCGCCGC GTCGGCTACG TCGACCTGCG CACCGAGGGC
GTCGACGCGC TCGACGCCGC GGTCGCCGAG CGGCCGCCCG TGCTCGCCGT CGACCGCACC
GACCTCGTCA CCGACCCGCA CGAGCGCGCG CACGTCGCCG CGGCACTGTC CCGTGCGCTG
GACGCGGGCG CCACGCTCCT GCTCGGCGTC GTCGGCAGCA CCCCCGCCGA CGACCTGCTC
CCCGCAGGCA CCCCCGTCAC GACCCTCGCA CCGCAGGCCG GAGCCCTCGC GTGA
 
Protein sequence
MSSALYRLGR AAFARRRAVI GAWVGLLVLI GAAAGLLGGT LDNSVSIPGT ESQAALDRLT 
ATFPQAAGTT AQVLVVGEDG AQVDDPAVVT AVEDSVDAFL EVESVTSAVS PFDDTLPGAS
AVSDDGEAAL LTLSLEGEGV AIGDEVKDRL RDVADELDAA LPDGYDATIG GQLFSQEFPG
LSIAEVLGVV VAFIVLLVTL GGFAAAGMPL LNALLGVGLS TLLVLVAAAF TSVTSTTPLL
SLMLGLAVGI DYALFIVSRY RELLATGLPT QEAAARSNAT AGSAVIFAGL TVMIALVGLG
VAGIPFLTVM GVAGAAAVGI AVLVSITLVP AMLGVAGERL RPRPSRRARK DAAAGTAPAA
APAPAADGDT WDLPEHHNRF FAGWVRLATA RPWVTVVVTI GALLALAFPA LDLRLALPDA
GVAPTDSSQR VTYDRITEHF GPGANGPLVV TGSIVTSDDP LGLMEDVADE LRALPGVDSV
PLATPNESID TGIVQVVPTT GPTDPATADL VNAIRDLRPT ILEKHGFDLA VTGFTAVGID
VSAKLGAALL PFAVFVVGLS LILLTMVFRS IAVPLKATIG YLLSVAAAFG VVTAVFEHGI
AADLLHVSRL GPIISFMPIV LMGVLFGLAM DYEVFLVSRM REDYVHSGKA RASIATGFVG
SAKVVTAAAV IMVAVFFAFV PEGDINIKPI ALGLAVGVAV DAFVVRMTLV PAVMQILGER
AWWMPKGLDR VLPSFDVEGE ALHREISMQA WPHDPDVVVA ARGLRLAALD RTDVVDLAVR
RGEVLVAHAD EPARPAALLL TVAGRLAPEA GDLKVDGLLL PVRAAAVRRR VGYVDLRTEG
VDALDAAVAE RPPVLAVDRT DLVTDPHERA HVAAALSRAL DAGATLLLGV VGSTPADDLL
PAGTPVTTLA PQAGALA