Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3516 |
Symbol | |
ID | 9147432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3904713 |
End bp | 3907862 |
Gene Length | 3150 bp |
Protein Length | 1049 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Tetratricopeptide TPR_4 |
Protein accession | YP_003638587 |
Protein GI | 296131337 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.535351 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGCTG AGCCGCCCGC GTCCCCAGCC GACGGCCTCG GAGAGGGCCT CGACGCCTGG CGCCGCTTCA TCGACACCGC CGTCGCCGCG GACGACCCGG ACACCACGCC CTTCGTGCTG GCGTCGGCCG CCGCAGCGCT CGTGTCACCG TCGTTCCTTC CCAAGGACGC ACCGAGGCCG GGCCTCCCCG AAGGTGATCC GAACACGGCG CTGCTGCGTC GCATCGGTGC GGAGCTGCTG GGATGGTCCG GACGCACGGA CGGGACGCTC GGCGCCACCT GGCGCGTGCT GGGGCTGGCG CTGGCCACAC GCCATCCGAA GGAGGGCCGA GGGCCGGTCG ATGAGCTCGC CCAGCGGGTC CGCGAGGTGC AGGAGGCCGC CGACACGGCC GCGGGGATCG ACCCCACCCT GCACGCCTGG GCGCTCATCG CGCTGGCCGC GATGCACCGC AACGCCCACG GTCTCGAGGC TGCCCTGGAG GCGGCCGAGC ACGCGCTCGC GGTCGCCCGC ACTCACCGCT CCCAGGACGA CACCCCGATC CCGGTGCTCG TCGAGGCGAC GGGAGGCGCC GCGTGGATCC CCGGGCACCG CGCCGTCCAG ATCATGCTCG AGTCACGGGC CCTGGCCCGG GTCGCCATCG CACGGCACAT GCAGCGCGAC TACGCAGGAG CAGCTGCGGC TCGTGACGAG CAGATCGACG TCACGGCGAC CGTCCGCGAC CAGTACCCGT CGCTGTACGC GAGCGCGCTG TCCCTGCGCT CCGAGACGGC CCTCCACCTC GGCGACCTCC CCACGGCCCT GCGCCGTGCC GACGAGCTCG CCGCCTGGGC GCAGGAGTGC ACCGACACGA CCACCCGACG GACGCACCTG CGCCGGGCGT CCGAGCTCGC GCTCTTCCTG GACGACTGGG ACCGGGCACG CGAGCTGCGT CTGGAGCGCC TGCGCCTGTG CCTCGAGCGG GTCCTGGACC CGGTACCGGA GCTCGAGCCC GCGTCCGTGC ACGCCGAGCT GCCGGCGCTG GTGCGGCGCG GGCGCCGGGC GGCGCTGACG GCGATCGGCA ACGACGCCTA CGAGCTGGCG CGGCTGGCGC TGAGCTCGGG CGCGGCCGAG CACGACCCCA CGGTGCGCGC CGTGGCCCGC GCCTGGCTCG ACGTCGCGGA CGACGCCTGG GAGGACATCG CGCTCAACGG CGCCGTCGCC GTCCGGTACC GCCGACTCGA GGCCGACGCC CTCGACGGCG CCCTGCCCCC GCTCGAGGTG GGACGGGGCA TGGTCGAGTG CAGCCGGAGC TGGCGGCGTG TCGCCGGCAA GCGCCGTTCG GCGGTGCGCG CCGCCCGGCT GGGCGCTCCC GGTGACCCCG AGGTCCTCGC CCGGCTCGAG GAGCTCGCGG TCGACGCCCC CACGCTGGAC CTCGCGCGCA TCGACCGCGG CATCGCGCGG TGGCACCTGC GCGCCGCGGA CGAGCTGACG GGGGACGCGC GCGTGCAGGC GCTCGTCGCG GCTCGTCGCC ACGCGGCGTC GTCCGCCGCC GGGCTCGTCC TGCGCCGCCC CGACGGCACG GTGCTCGACC TCGATCCCGA GGCCCGCATC GACGCGCTCC AGCTCCAGGT CGACACGACG GTGGCGCTGC GGGAGGCGGG CGCCGTCGTC GACGGCACGG ACGAGACCGC CGAGCTCGCG CTGCGCGTGG CGACCCTGCC GGCTGTCGCC CAGCGCTTCG CCGCCTCGGG GAGCCGCACG CGGCGCCTCG CGCTCGAGCG CCGCTACCGC GACTGGCTCC GCGACACGAT CGTCCTGGCG GCGCGGCTGC AGGACGGCGA CGCGGTCGAC CAGGTCGCGG AGGTCCTGCG GCGCGACCTC GTCGGGACGA TCCTCTACGG CCTCACGCAG GACGAGATGG CGCCCGGACA GATCGCCGAG CTCGCGCGCG AGCTGACCGC GACGCTCAAC GCGACCGTCG AGGACCTCGA CGCGGCTGAC GACCCGTCCC CTGGCGACCC CTCGACCGGC CACCGCGCCG TGACGCTCGA CGAGCAGCTC GACAAGACGC TCGACGTCAT GGGGCAGGTG CTCGGCCCGG TCGCCCGCAC CCTGTTCGAC CCCCGCACGG TCGGCGACCA CACGGTGGCG GGTGCGCTGG ACAGCGCCTA CGGCACCCGC CGTGCGGCGG TCCTGTCGCT GGTGCTGCTG CCGTCGGACG AGCCGCAGCT GGTCCGCCAC CTGGCCTGGC GCGCCGAGGA CGACGGCCCG GTGGCGGAGC ACCTCGACGT GGTGCCCGCG CCCCGCTGGC TCGCGGGGTT CGAGGTGGGC CACGAGCCGG AGCGGTTCTT CGCGCGCCTG ACGAGCCTCC CGCGGACCGT GCTGCCCGAC CACCTCGCCG GTCTGCTCGC GACGACGGAC GCCGAGCACC CGCTGCCACT GACGGTCGTG CCGACCGGGC TGCTCGGCGT GCCCTTCGCG GCCCTCGTGT CGGGCGGACG CCTCGTCCTG GAGACGGCGT CGGTCGCCGC GGCGCAGTCA CTGCAGGCGG TCCGCACGCT CGCCGGCCTC ACCACGCCCG ACGCCGAGGG CGCACCCCCG TGGGACGTGG CCGTCTACGA CCTCGTCACG CTCAAGCACA CCGAGGAGGA GCGGGCGGCG CTGATCGCGC AGCGGCCCAG CACCCACGAG CCGCGGACGC TCGCGGAGAT GCGGGCCGCC CTGGCCGACC CTGCGCGCCG GGGGGCGATC GGCATCCTCG CGCTCGCGGT GCACGGCACG CGCGGCGCGG ACGGCTGGGC CCAGGTGAAG GTCCTCCCCA GCGGCGAGCT GCTGACGACG GGCCACGTGC TGCAGTGGTA CCTGCCGCGG CTGGTCGTCG GGGCGTCGTG CAACACCGAC ATCCGGTCCG ACGCGGGCGG CGAGCTCGGG GGCTTCCCGC TCGCGTTCCA GCTGCGCGGT GCCGCGACGA TCGTCGGCAG CCTGCACTAC GTCGAGGACG CCGCGACGGC CGAGATCATG GCGGTGTTCT ACGCCGCCGT CGGCGCGGGC ATGGGCACGG CGGACGCGCT GCGGCACGCG CAGCTGACGT GGGTGAACCG GGACCGCGCG GCGCGGCTGG CCGACAAGTC ACGGTGGGCG TACCTGCTCT GCTACGGGCT GCCGGGCTGA
|
Protein sequence | MPAEPPASPA DGLGEGLDAW RRFIDTAVAA DDPDTTPFVL ASAAAALVSP SFLPKDAPRP GLPEGDPNTA LLRRIGAELL GWSGRTDGTL GATWRVLGLA LATRHPKEGR GPVDELAQRV REVQEAADTA AGIDPTLHAW ALIALAAMHR NAHGLEAALE AAEHALAVAR THRSQDDTPI PVLVEATGGA AWIPGHRAVQ IMLESRALAR VAIARHMQRD YAGAAAARDE QIDVTATVRD QYPSLYASAL SLRSETALHL GDLPTALRRA DELAAWAQEC TDTTTRRTHL RRASELALFL DDWDRARELR LERLRLCLER VLDPVPELEP ASVHAELPAL VRRGRRAALT AIGNDAYELA RLALSSGAAE HDPTVRAVAR AWLDVADDAW EDIALNGAVA VRYRRLEADA LDGALPPLEV GRGMVECSRS WRRVAGKRRS AVRAARLGAP GDPEVLARLE ELAVDAPTLD LARIDRGIAR WHLRAADELT GDARVQALVA ARRHAASSAA GLVLRRPDGT VLDLDPEARI DALQLQVDTT VALREAGAVV DGTDETAELA LRVATLPAVA QRFAASGSRT RRLALERRYR DWLRDTIVLA ARLQDGDAVD QVAEVLRRDL VGTILYGLTQ DEMAPGQIAE LARELTATLN ATVEDLDAAD DPSPGDPSTG HRAVTLDEQL DKTLDVMGQV LGPVARTLFD PRTVGDHTVA GALDSAYGTR RAAVLSLVLL PSDEPQLVRH LAWRAEDDGP VAEHLDVVPA PRWLAGFEVG HEPERFFARL TSLPRTVLPD HLAGLLATTD AEHPLPLTVV PTGLLGVPFA ALVSGGRLVL ETASVAAAQS LQAVRTLAGL TTPDAEGAPP WDVAVYDLVT LKHTEEERAA LIAQRPSTHE PRTLAEMRAA LADPARRGAI GILALAVHGT RGADGWAQVK VLPSGELLTT GHVLQWYLPR LVVGASCNTD IRSDAGGELG GFPLAFQLRG AATIVGSLHY VEDAATAEIM AVFYAAVGAG MGTADALRHA QLTWVNRDRA ARLADKSRWA YLLCYGLPG
|
| |