Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1148 |
Symbol | |
ID | 9145027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1289993 |
End bp | 1292860 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | Fibronectin type III domain protein |
Protein accession | YP_003636251 |
Protein GI | 296129001 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.950672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000360258 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGACCACCA CCACTGGCCT GCTGCGACGC CTGCGGGGCC GCGCCGGAGA CCGGCAGCGC GGCTTCGCCG AGGCGGGCGT CGCCATGACG GGCGCCGCCC TCGTGCTGGG GGCGGCGCTC GGCTCCGGCG TGGCGTCGAC CGTCGTCAGC ATGTCCGACG GCGTCACGTG GCTGCCGGAC CCCGAGACGG GCCAGGTCGT GCAGATCAAC CCCGGGACGG GCCGGGCGGA GCGCCGGCTG CAGGTCGGCG AGCCCGGCGC GGAGCTGCTC ATCAGCCAGC GCGACGGTCG CCTCGTCGTC GCGGACGGCA CGGGGACGCT GCGCAGCATC GACCTGTCGA CGCTGCTGGC GGGCGGTGAG CGCCGAGCGG ACGAGCCGAC GAAGGTGCTG GTCGGCGGCG GGCGGGTCTA TCTCGTGACG CCCGGGTCCG GTGTCGTGCA CGCCGTGGAC GCGCTCACGC TGCGCGACCT CGGCGCCCCC TTCCGCGCCG GCTCGGCGCT CGCGGACGCG GTCGTCGACG ACCAGGGTGC CGTGTGGCTC GTGACAACGG GCGGCGACCT GCGCCGCGTG GCGTGGCAGC CGGACGCGCG GCGGTTCGAC GTCGGCGCAG CGCAGCGGGT GCACTCGGCC GGGCCCGGCA CCCGGCTGCT GCCGCACGCC CGCGGCGTCA CGGTGTTCGC TCCCGACAGC GGCTCGCTGC TGCAGGTCGG CGCGGGTCGC GACCTCTCGG TCGCCGTCCC CGACCTCACG GGGGAGGTGC TGCCGGCCGC CACGTCGCCC GGTGACCTCG CTCCCGCCGG GGTCCCCGGC CGGTCCGCGG TCGTCATGCT CTCCGGCCGC CAGGTCCTCG ACGTCGCGGT CGGCACGCTG GGGTGCACCC GCCCCGGCAG GCCCGCGGTC TTCACCGGCC TGGTGTACGT GCCGTGCACC GGCGCCGGTC GGGTCGTCGT GCTGCGCCCG GACGGCTCGC GTGCGCGTGC CGACGTGGTG CTGCCCGCCG GGCGGGACCC GCAGCTGCTC GTGGACGACG GGCGGCTGGT CGTCCACACG GAGGACGGCT CGCGCGCCGT GGTCGTCGAG GCCGACGGGC GCACGCGCGT GATCGACACC GGACGCTCGC AGGTCCCGGT GCACGACCCG CGCGACGGTC GCAGGGCCCC GGTCTCGGCG CAGCCGCCGC AGCGGGAGGC CGGCGCGCCC GGCGTCGGCG GCGTGGGTGG TGCGCAGGGC CCGGGCGGTG CGCCGGTCGT GGCCCCTGGC GGCGCGCTGG GCGCGGAGGT CCCCGGCGTC CCGGGGGCCG GTCCGACCCC GGCCGGTGGC GTGGCGCCGA CGGCAGGGTC TGCACGCCCC CTCGCCACGG CCGCCGCCAC GCCGTCGGCG ACCGCGCGCA CCGTGCCGGG CCCGACGCAC GTCGTCGCGA CCCCCGGTGA GCCCGACGAG CACGGCCTCA CCGAGGTCGT CGTGCGGTGG CGCGACGCGG CGTCGCGTCC CGACTCCTAC GTCGTGCGCT CGTCCGCGCC GAACGTCGCC GCCGTCGAGG TCGACGGGCG CACCACGACC GCGACGGTGC GCGGCGTCGT CTGCGGCACG CTCGCGACGT TCAGCGTGGA GGCCGTCGCC GACGGCGTCG TGTCGGCGCC CGCGTCCTCC CGCATGGTGC GCACCGAGGG CTGCCCGCCC CCGCCGGCCG CCCCGACGGG CGTGAGTGCC GTCGCGGGCG AGGACGGGAC GGTGACGGTG TCGTGGACGC CCTCGGGCGA CGACGTGGAC TCCTACCTCG TCGGGCCGGC GGGCGGGTCC ATGACGACCG TGGAGGCGGC CGCGACCTCG GTCGTGCTGC GTGACGTGCC CGCCGGGGAG AGCCTGCGGT TCGCCGTGCG TGCGGCCCGC GGCGGCCTGA CCAGCGCAGC GGCGCTGTCC CCGGCGGTCG TCGTCCCGGG CGTCCCGGGC ACCGTCGGGC CGCTGACGGT GGAGATCGAG CGACGTCTCG GGGACGAGCT CGGGTTCGTC GTGCGCTGGA CGCTGCCCGA GGACAACGGT TCCACCCTCG AGCGGTACGT GGTGACGTGG TCCGGCAGGG GCTTCTCCGG GTCCCACACC CCGCCCGCGG TGTCGGGCTG CTCCGCGCGA CTGGGCTGCG CCGCGACCGG CGGCGGGGTG TTCCGCACCG ACGTCACGAT CGCTGCGGCG TGCACCGGCG GCGCGGCGTG CCTCGACGAG GACCGCATCA CCGTCACGGT GACGCCCGAG AACGGTGTCG GCGCCGGGCC GGTCGCGACC ACGGAGGTCT GGATCCCGCC CTTCGTCCCG ATCATGCAGC CCGTCGAGGG GATCGACCCC GTCGTGCCGG ACCCGACCGA CCCCGACGTC CGGATGGTCG TGCACCTCCG CTCGCCGGCG CAGATGGCCG AGCACCAGGG CGCGTGCGAG CTCCAGATCC AGGACGTCGA CGGCAGCCGG TCGACGGTCC CGTGGTCGTG CCAGTCCGGC GACGTCGTCA TCGGTCCGTT CGCGCCCGGC AACATCGTGG TGACCCCGGT CGCGGTCGTG CCCGGTGGTC CGAACATCAT CGCGATGAAC TACAACGCCG AGGTCCCGCC GCGCTCGACC TGGCGCTACT GCGACTCGAC GACCGGCGTG TGCACGGAGC CCGTGAGCCG GGACGGGGAC CCGCCGGTCA CCGTCCGCAC CGTCCCGTGG ACGCCCCGGC TGCCGCAGGG CGACGAGCGC CCGCCGCTGG CGGCCGCCGG CGCCGGCCTG CTGCTGGCGG CCGGCGCGCT GCGCGCGTCC CGCCTGCGCC GCTCCGTCGC CGTGACCGGC ACCGACGTCC CGACCCCGTA CGCCCCCACC GAGGAGACCT CCGCATGA
|
Protein sequence | MTTTTGLLRR LRGRAGDRQR GFAEAGVAMT GAALVLGAAL GSGVASTVVS MSDGVTWLPD PETGQVVQIN PGTGRAERRL QVGEPGAELL ISQRDGRLVV ADGTGTLRSI DLSTLLAGGE RRADEPTKVL VGGGRVYLVT PGSGVVHAVD ALTLRDLGAP FRAGSALADA VVDDQGAVWL VTTGGDLRRV AWQPDARRFD VGAAQRVHSA GPGTRLLPHA RGVTVFAPDS GSLLQVGAGR DLSVAVPDLT GEVLPAATSP GDLAPAGVPG RSAVVMLSGR QVLDVAVGTL GCTRPGRPAV FTGLVYVPCT GAGRVVVLRP DGSRARADVV LPAGRDPQLL VDDGRLVVHT EDGSRAVVVE ADGRTRVIDT GRSQVPVHDP RDGRRAPVSA QPPQREAGAP GVGGVGGAQG PGGAPVVAPG GALGAEVPGV PGAGPTPAGG VAPTAGSARP LATAAATPSA TARTVPGPTH VVATPGEPDE HGLTEVVVRW RDAASRPDSY VVRSSAPNVA AVEVDGRTTT ATVRGVVCGT LATFSVEAVA DGVVSAPASS RMVRTEGCPP PPAAPTGVSA VAGEDGTVTV SWTPSGDDVD SYLVGPAGGS MTTVEAAATS VVLRDVPAGE SLRFAVRAAR GGLTSAAALS PAVVVPGVPG TVGPLTVEIE RRLGDELGFV VRWTLPEDNG STLERYVVTW SGRGFSGSHT PPAVSGCSAR LGCAATGGGV FRTDVTIAAA CTGGAACLDE DRITVTVTPE NGVGAGPVAT TEVWIPPFVP IMQPVEGIDP VVPDPTDPDV RMVVHLRSPA QMAEHQGACE LQIQDVDGSR STVPWSCQSG DVVIGPFAPG NIVVTPVAVV PGGPNIIAMN YNAEVPPRST WRYCDSTTGV CTEPVSRDGD PPVTVRTVPW TPRLPQGDER PPLAAAGAGL LLAAGALRAS RLRRSVAVTG TDVPTPYAPT EETSA
|
| |