Gene Cfla_3516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3516 
Symbol 
ID9147432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3904713 
End bp3907862 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content76% 
IMG OID 
ProductTetratricopeptide TPR_4 
Protein accessionYP_003638587 
Protein GI296131337 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.535351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCTG AGCCGCCCGC GTCCCCAGCC GACGGCCTCG GAGAGGGCCT CGACGCCTGG 
CGCCGCTTCA TCGACACCGC CGTCGCCGCG GACGACCCGG ACACCACGCC CTTCGTGCTG
GCGTCGGCCG CCGCAGCGCT CGTGTCACCG TCGTTCCTTC CCAAGGACGC ACCGAGGCCG
GGCCTCCCCG AAGGTGATCC GAACACGGCG CTGCTGCGTC GCATCGGTGC GGAGCTGCTG
GGATGGTCCG GACGCACGGA CGGGACGCTC GGCGCCACCT GGCGCGTGCT GGGGCTGGCG
CTGGCCACAC GCCATCCGAA GGAGGGCCGA GGGCCGGTCG ATGAGCTCGC CCAGCGGGTC
CGCGAGGTGC AGGAGGCCGC CGACACGGCC GCGGGGATCG ACCCCACCCT GCACGCCTGG
GCGCTCATCG CGCTGGCCGC GATGCACCGC AACGCCCACG GTCTCGAGGC TGCCCTGGAG
GCGGCCGAGC ACGCGCTCGC GGTCGCCCGC ACTCACCGCT CCCAGGACGA CACCCCGATC
CCGGTGCTCG TCGAGGCGAC GGGAGGCGCC GCGTGGATCC CCGGGCACCG CGCCGTCCAG
ATCATGCTCG AGTCACGGGC CCTGGCCCGG GTCGCCATCG CACGGCACAT GCAGCGCGAC
TACGCAGGAG CAGCTGCGGC TCGTGACGAG CAGATCGACG TCACGGCGAC CGTCCGCGAC
CAGTACCCGT CGCTGTACGC GAGCGCGCTG TCCCTGCGCT CCGAGACGGC CCTCCACCTC
GGCGACCTCC CCACGGCCCT GCGCCGTGCC GACGAGCTCG CCGCCTGGGC GCAGGAGTGC
ACCGACACGA CCACCCGACG GACGCACCTG CGCCGGGCGT CCGAGCTCGC GCTCTTCCTG
GACGACTGGG ACCGGGCACG CGAGCTGCGT CTGGAGCGCC TGCGCCTGTG CCTCGAGCGG
GTCCTGGACC CGGTACCGGA GCTCGAGCCC GCGTCCGTGC ACGCCGAGCT GCCGGCGCTG
GTGCGGCGCG GGCGCCGGGC GGCGCTGACG GCGATCGGCA ACGACGCCTA CGAGCTGGCG
CGGCTGGCGC TGAGCTCGGG CGCGGCCGAG CACGACCCCA CGGTGCGCGC CGTGGCCCGC
GCCTGGCTCG ACGTCGCGGA CGACGCCTGG GAGGACATCG CGCTCAACGG CGCCGTCGCC
GTCCGGTACC GCCGACTCGA GGCCGACGCC CTCGACGGCG CCCTGCCCCC GCTCGAGGTG
GGACGGGGCA TGGTCGAGTG CAGCCGGAGC TGGCGGCGTG TCGCCGGCAA GCGCCGTTCG
GCGGTGCGCG CCGCCCGGCT GGGCGCTCCC GGTGACCCCG AGGTCCTCGC CCGGCTCGAG
GAGCTCGCGG TCGACGCCCC CACGCTGGAC CTCGCGCGCA TCGACCGCGG CATCGCGCGG
TGGCACCTGC GCGCCGCGGA CGAGCTGACG GGGGACGCGC GCGTGCAGGC GCTCGTCGCG
GCTCGTCGCC ACGCGGCGTC GTCCGCCGCC GGGCTCGTCC TGCGCCGCCC CGACGGCACG
GTGCTCGACC TCGATCCCGA GGCCCGCATC GACGCGCTCC AGCTCCAGGT CGACACGACG
GTGGCGCTGC GGGAGGCGGG CGCCGTCGTC GACGGCACGG ACGAGACCGC CGAGCTCGCG
CTGCGCGTGG CGACCCTGCC GGCTGTCGCC CAGCGCTTCG CCGCCTCGGG GAGCCGCACG
CGGCGCCTCG CGCTCGAGCG CCGCTACCGC GACTGGCTCC GCGACACGAT CGTCCTGGCG
GCGCGGCTGC AGGACGGCGA CGCGGTCGAC CAGGTCGCGG AGGTCCTGCG GCGCGACCTC
GTCGGGACGA TCCTCTACGG CCTCACGCAG GACGAGATGG CGCCCGGACA GATCGCCGAG
CTCGCGCGCG AGCTGACCGC GACGCTCAAC GCGACCGTCG AGGACCTCGA CGCGGCTGAC
GACCCGTCCC CTGGCGACCC CTCGACCGGC CACCGCGCCG TGACGCTCGA CGAGCAGCTC
GACAAGACGC TCGACGTCAT GGGGCAGGTG CTCGGCCCGG TCGCCCGCAC CCTGTTCGAC
CCCCGCACGG TCGGCGACCA CACGGTGGCG GGTGCGCTGG ACAGCGCCTA CGGCACCCGC
CGTGCGGCGG TCCTGTCGCT GGTGCTGCTG CCGTCGGACG AGCCGCAGCT GGTCCGCCAC
CTGGCCTGGC GCGCCGAGGA CGACGGCCCG GTGGCGGAGC ACCTCGACGT GGTGCCCGCG
CCCCGCTGGC TCGCGGGGTT CGAGGTGGGC CACGAGCCGG AGCGGTTCTT CGCGCGCCTG
ACGAGCCTCC CGCGGACCGT GCTGCCCGAC CACCTCGCCG GTCTGCTCGC GACGACGGAC
GCCGAGCACC CGCTGCCACT GACGGTCGTG CCGACCGGGC TGCTCGGCGT GCCCTTCGCG
GCCCTCGTGT CGGGCGGACG CCTCGTCCTG GAGACGGCGT CGGTCGCCGC GGCGCAGTCA
CTGCAGGCGG TCCGCACGCT CGCCGGCCTC ACCACGCCCG ACGCCGAGGG CGCACCCCCG
TGGGACGTGG CCGTCTACGA CCTCGTCACG CTCAAGCACA CCGAGGAGGA GCGGGCGGCG
CTGATCGCGC AGCGGCCCAG CACCCACGAG CCGCGGACGC TCGCGGAGAT GCGGGCCGCC
CTGGCCGACC CTGCGCGCCG GGGGGCGATC GGCATCCTCG CGCTCGCGGT GCACGGCACG
CGCGGCGCGG ACGGCTGGGC CCAGGTGAAG GTCCTCCCCA GCGGCGAGCT GCTGACGACG
GGCCACGTGC TGCAGTGGTA CCTGCCGCGG CTGGTCGTCG GGGCGTCGTG CAACACCGAC
ATCCGGTCCG ACGCGGGCGG CGAGCTCGGG GGCTTCCCGC TCGCGTTCCA GCTGCGCGGT
GCCGCGACGA TCGTCGGCAG CCTGCACTAC GTCGAGGACG CCGCGACGGC CGAGATCATG
GCGGTGTTCT ACGCCGCCGT CGGCGCGGGC ATGGGCACGG CGGACGCGCT GCGGCACGCG
CAGCTGACGT GGGTGAACCG GGACCGCGCG GCGCGGCTGG CCGACAAGTC ACGGTGGGCG
TACCTGCTCT GCTACGGGCT GCCGGGCTGA
 
Protein sequence
MPAEPPASPA DGLGEGLDAW RRFIDTAVAA DDPDTTPFVL ASAAAALVSP SFLPKDAPRP 
GLPEGDPNTA LLRRIGAELL GWSGRTDGTL GATWRVLGLA LATRHPKEGR GPVDELAQRV
REVQEAADTA AGIDPTLHAW ALIALAAMHR NAHGLEAALE AAEHALAVAR THRSQDDTPI
PVLVEATGGA AWIPGHRAVQ IMLESRALAR VAIARHMQRD YAGAAAARDE QIDVTATVRD
QYPSLYASAL SLRSETALHL GDLPTALRRA DELAAWAQEC TDTTTRRTHL RRASELALFL
DDWDRARELR LERLRLCLER VLDPVPELEP ASVHAELPAL VRRGRRAALT AIGNDAYELA
RLALSSGAAE HDPTVRAVAR AWLDVADDAW EDIALNGAVA VRYRRLEADA LDGALPPLEV
GRGMVECSRS WRRVAGKRRS AVRAARLGAP GDPEVLARLE ELAVDAPTLD LARIDRGIAR
WHLRAADELT GDARVQALVA ARRHAASSAA GLVLRRPDGT VLDLDPEARI DALQLQVDTT
VALREAGAVV DGTDETAELA LRVATLPAVA QRFAASGSRT RRLALERRYR DWLRDTIVLA
ARLQDGDAVD QVAEVLRRDL VGTILYGLTQ DEMAPGQIAE LARELTATLN ATVEDLDAAD
DPSPGDPSTG HRAVTLDEQL DKTLDVMGQV LGPVARTLFD PRTVGDHTVA GALDSAYGTR
RAAVLSLVLL PSDEPQLVRH LAWRAEDDGP VAEHLDVVPA PRWLAGFEVG HEPERFFARL
TSLPRTVLPD HLAGLLATTD AEHPLPLTVV PTGLLGVPFA ALVSGGRLVL ETASVAAAQS
LQAVRTLAGL TTPDAEGAPP WDVAVYDLVT LKHTEEERAA LIAQRPSTHE PRTLAEMRAA
LADPARRGAI GILALAVHGT RGADGWAQVK VLPSGELLTT GHVLQWYLPR LVVGASCNTD
IRSDAGGELG GFPLAFQLRG AATIVGSLHY VEDAATAEIM AVFYAAVGAG MGTADALRHA
QLTWVNRDRA ARLADKSRWA YLLCYGLPG