Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3716 |
Symbol | |
ID | 9147632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 4106624 |
End bp | 4108414 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003638783 |
Protein GI | 296131533 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00638349 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGGAAG AGGTCGGCCG GGGCACGGTG CTCGCCGGGC GCTACCGGGT GGACGAGCCG CTGCCGTCGG ACCTCGCGGG AGTGTCCGTG TGGCGCGCCA CCGACCAGAT CCTCGACCGT CCCGTGCGTG TCCGGGTGCT GGAGTCCGGG GCCGTCGCGC CGGCGCTCGA CGCCGCACGT CGCGCCGCCC TCGTCACCGA CGCCCGCCTC GTGCGCGTGC TCGACGTCGG CATGCACGAG GGCGTCGGCT ACGTCGTCTC CGAGCAGATC ACCGGCGCAT CCCTCACCCA GCTCGTCGAG CGGGGGCCGC TGACGCCCGA CCAGGCCCGC GCGGTCGTCG GCGAGGCCGC GGCCGCGCTC GAGGTCGCGC GACGCCGCGG CGTGCACCAC CTCGCGCTGC GCCCGTCGGT CGTGCACGTG TCGGCCGACG GCCGGGTGCT CGTCTCGGGC CTCGCGATCG ACGCCGCACT CCTCGGCGCG CCCCCGGGGG ACGCGCGCAC GACGAGCCGC ACCGATGCCG TCGACCTCGT CCGGCTCCTC TACACGGGCC TCACGGGCCG CTGGCCCGCG GGGCGTGACG ACGCCCTCGC GCCCACCGTG CAGCCTGCAC CCGTCCTGGA CGGCCTGCCC GTGCCGCCCG CGGAGCTGGC GCCCGGCGTC CCGAACGACC TCGACACGCT GTGCGTCGTG ACCCTCGGCC CGAACCAGGA CGGCCCGTTC TCCCCCGGCG ACGTCGTGCA CGAGCTCGAG CCCTGGGGTG AGATCCGCAT CGGGCGCCCG GCCGACGACG ACCGTGCGGC AGGTGAGGGT GCCGTGGCCG CTGCCGCCGC GCCGGCCGTC GAGCCCGAGC GTCCCGCCCC GCCGGTCCGC GTCGCGCGCC AGTCCGTGCG GTCCGCGTTC GACGAGCTGC CCGCCGGTGC TCCGCTTCCC GGCACTCCGC CGCCGGCCGC CCCGGCGCGC GGCGGCATCC CCTCGGGGCG GGTCGAGCGC ACCGGTGTGC TGCCCGCCGG CGCCGCGTAC GGTGCCGGGG CGGTCCCGCC GCCCGGCCCC CCGCCGAGCC ACGCCCCCGA GCCCGATTTC TGGGACGAGG GTCCCGGTTT CGCGGACGAC CCGTTCGCGT TCGTGGAGGA CGACGAGCCG CGCCGCCGGT TCGACCCGAC GGCGCTCGTG CTCGTCGTCG TCGGACTGGC CGTGCTGATC GGGCTGGTGT TCGCCGCACG CTCGCTCTTC ACGTCGCCGG TCGGCGACCG TGACCCCGTC GCGGACGACA CCCCGAGCCA GCAGGAGCCC GCGACGCCGT CGGAGGGCGG CGCCACGGAG CCCACGCCGC AGGAGACCGT CGACGAGGCG CCCGACCCGG GCGTGCCGCC GGCGATCGAG TCCGTCCGGA CGTTCGACCC GACGGACCCC GCGGGCGAGC GGGTCGCCAA CGTCGAGCTC ACGCACGACG GCGACCCGTC GACGTTCTGG TTCTCCTACA CCTACAACAA CCCGGCGTTC GGCGGGCTCA AGGAGGGCAT GGGCCTCGAG GTCACGCTCG CCGCCGAGGC CCCGGTGTCC GGCGTGACGC TGAACGTGAA CGGCTCGGGC GGCAACGTCG AGGTCCGTGC GACCACCGCG TCGACGCCCA CGGAGGGTGC GGCCCTCGGT GGCGGGCCGC TGGGCCCCGA GACCGTCCTC GACTTCGAGG AGCCGGTGAC GACGTCGACC CTCGTCCTGT ACTTCACCGA GCTGCCCACC AACGCGGCCG GGCAGTACCG GATCGAGGTC ACGGAGATCA CCGTCCGGTA G
|
Protein sequence | MTEEVGRGTV LAGRYRVDEP LPSDLAGVSV WRATDQILDR PVRVRVLESG AVAPALDAAR RAALVTDARL VRVLDVGMHE GVGYVVSEQI TGASLTQLVE RGPLTPDQAR AVVGEAAAAL EVARRRGVHH LALRPSVVHV SADGRVLVSG LAIDAALLGA PPGDARTTSR TDAVDLVRLL YTGLTGRWPA GRDDALAPTV QPAPVLDGLP VPPAELAPGV PNDLDTLCVV TLGPNQDGPF SPGDVVHELE PWGEIRIGRP ADDDRAAGEG AVAAAAAPAV EPERPAPPVR VARQSVRSAF DELPAGAPLP GTPPPAAPAR GGIPSGRVER TGVLPAGAAY GAGAVPPPGP PPSHAPEPDF WDEGPGFADD PFAFVEDDEP RRRFDPTALV LVVVGLAVLI GLVFAARSLF TSPVGDRDPV ADDTPSQQEP ATPSEGGATE PTPQETVDEA PDPGVPPAIE SVRTFDPTDP AGERVANVEL THDGDPSTFW FSYTYNNPAF GGLKEGMGLE VTLAAEAPVS GVTLNVNGSG GNVEVRATTA STPTEGAALG GGPLGPETVL DFEEPVTTST LVLYFTELPT NAAGQYRIEV TEITVR
|
| |