Gene Cfla_3140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3140 
Symbol 
ID9147055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3489853 
End bp3491724 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content77% 
IMG OID 
Productpeptidase M48 Ste24p 
Protein accessionYP_003638221 
Protein GI296130971 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0822769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000295634 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACCA GGCTGCGCGC GCTCGTCGCG GTCGTCACGC TGGGCGGCTT CTACGTCGCC 
GCCCTCGCCG TCGTCGTGGG CCTCGGTGCG CTCACGGTGC TGGCCATGGA GGCCGGCACC
GGCGTCGTCG CCGGCAAGCT CGGGTTCGTC ACCCTCGCGG CGGCGGGCGG TCTGGTCGTC
GCGCTGTGGA AGGTCGCCCG TGCGCGCCCG CCGCAGCCCA CGGGCCCGGT GCTGCTGCGC
GCCGACGCCC CGGAGCTGTG GGGCATCGTC GACGAGCTCG CGAGCCTGAC CGGCACGCGC
GGCCCGGACG AGATCCGCCT CGCGCCGGAT GTCAACGCCG GCGTCTGGGA GGACGCGCGT
CTCCTCGGCC TCGTCGGCGG CACCCGCCGC ATGGTCCTCG GGGTCCCGCT GCTGCACGGG
CTGACCGTGG GGCAGCTCCG CTCGGTGCTC GCGCACGAGC TGGGGCACTA CTCGCACGAC
GACACGCGGC TGTCCGTCGT CGTCCACCGC GGGCGCGCCG TCATCGCTGC GACCCTCGCC
CAGCTCTCGG GGTCCGTCGC CGGCTGGCTG CTACGGCAGT ACGGCAAGCT CTACCTGCTC
GTCTCCGCCG CCACGAGCCG CTGTCAGGAG CTCGCCGCCG ACGCGCTGTC GGTGCGCGCC
GCGGGGCGCG CGACCGCGCA GTCCGCGCTC CGCGAGGTGC TCGTCATCGA CGCGGCCTGG
GACTTCTACC TCGACTGCTA CGTGGCCCCC GGGTGGGAGA TCGGACTGGC GCCCACGTCG
GACGCGTTCT TCGGCGGGTT CCGCGAGCTG CTCGCGGCCC GGACGCAGGA GCTGGGCTCC
GTGCGCGAGC GTCCCGCGGG CGAGCAGGGC AGCCGCTGGG ACAGCCACCC GCCCATCGGT
GAGCGGGTCG TCGCCATGGA CCGTCTGCCC GACGTCCCGG CACAGCCCGA CGACCGCCCG
GCGTCCGTGC TGGTGCCGCA CCTCGACGTG GTGGCGGCGC ACCTGGCCGA CGAGGTGCTC
GACGTCGCCG ACCGGCAGCG CCTGCCGTGG GACCAGCTCG TCCCGCCGAT GGCCGCCGCC
GCGCAGCAGC GCCGTGCCGA CGCCGTGCAC CGCGCCGCCG GCCGGCTCGC CGGGGTGCAG
CGCGCCACGC TCGGGACCGT CCTCGAGCTC GTCGAGCAGG GCCGTGGCGA CGACCTGGCG
CGCGAGCTGG GCATCGACCC GCGGCGGCTG ATGGTCGCGG CACCCGGTGA GCGCCCCCCG
CACCCCCTGG CCGGCGCGCT CGAGCCCGTG CTGGGCGCCG CGCTCGTGGC CGGGGGCGCC
GCCCGCTGGT GCCTCGAGTG GGCGGGGCCG GCGACGCTGC GCGACCGGGA GGGCGACGAG
CCCGACCTCA CCGCGTGGGC GGACCGCGCG GCCCGGCCCG GCGGCGTCGA GCACGTCCGG
GCGTGGCTCG CCGGGCTCGG CGTCGACCCG CGGGCCGTGG GCCAGGTGCA CGAACGCGCG
ACCGCTCACG GGGCCCAGGT GCTCGCCGGC CTGGCGAACG TCGCGGTCGA CGGCACGGAC
CACGACGTCG TCGTCCTCGA CCGCGGCCTG GTGCTGGTGC CCTGCCCCAA GAAGACGGAC
GGCGGCAAGG CGCGCATCCT CGGGGTGGTG CAGGCGGTGC CGGTCCACCA GCTCGCGCAG
GTGCACCGGT TCGTGCCCTA CGAGGAGGTC CGCACCGCCA CGGTCCACCG GCAGTCGCCC
GTGCACGCGA CCGTCGAGCT GCACGACGGC TCCCGGCTCG TGCTCAAGGA GCGCTGGTCC
GGGGAGTACC TGGTCAAGGG CTCGCAGGAG GTCCTCGTCG GTCACCTGCA CTCGCTCGCG
ACGACGCCCT GA
 
Protein sequence
MSTRLRALVA VVTLGGFYVA ALAVVVGLGA LTVLAMEAGT GVVAGKLGFV TLAAAGGLVV 
ALWKVARARP PQPTGPVLLR ADAPELWGIV DELASLTGTR GPDEIRLAPD VNAGVWEDAR
LLGLVGGTRR MVLGVPLLHG LTVGQLRSVL AHELGHYSHD DTRLSVVVHR GRAVIAATLA
QLSGSVAGWL LRQYGKLYLL VSAATSRCQE LAADALSVRA AGRATAQSAL REVLVIDAAW
DFYLDCYVAP GWEIGLAPTS DAFFGGFREL LAARTQELGS VRERPAGEQG SRWDSHPPIG
ERVVAMDRLP DVPAQPDDRP ASVLVPHLDV VAAHLADEVL DVADRQRLPW DQLVPPMAAA
AQQRRADAVH RAAGRLAGVQ RATLGTVLEL VEQGRGDDLA RELGIDPRRL MVAAPGERPP
HPLAGALEPV LGAALVAGGA ARWCLEWAGP ATLRDREGDE PDLTAWADRA ARPGGVEHVR
AWLAGLGVDP RAVGQVHERA TAHGAQVLAG LANVAVDGTD HDVVVLDRGL VLVPCPKKTD
GGKARILGVV QAVPVHQLAQ VHRFVPYEEV RTATVHRQSP VHATVELHDG SRLVLKERWS
GEYLVKGSQE VLVGHLHSLA TTP