Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3229 |
Symbol | |
ID | 9147145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3593589 |
End bp | 3596162 |
Gene Length | 2574 bp |
Protein Length | 857 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003638310 |
Protein GI | 296131060 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.248518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0371702 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGCG TCACCCTGCG CAGCGTGCGC GCGCACGCGG TGCGCTTCGC GTTGTCGATC CTGGCCGTGG CGCTCGGCGT CGCGTTCGTC GCCGGCACGT TCGCGCTGCG CACCATGCTC GCCGACACGT TCCACGGCAT CGTCGACGGG CAGGCGCCCG CCGACGCGTA CGTCCGCGGT GACGAGCCCC TGGCCGGTGG GAGTCAGACC GGCGGGGCCC TGACCCTCGG GGAGCAGCGG GTGCCGGTGC CGCTCGCGCT CGCGGACGAG GTCGCCGACG TCGACGGCGT GCGGGCCGCG CTGCCCGACC TCGCCGGGCA GGTCGTGCTC GTGGGCGCCG ACGGCACGGC CGTGCAGTCC ACGCAGGCGC CGTCGCTGGG CGTCGCGTTC CACGCGACCG ACCCGACGCT CGACCTGCGC GAGGGGCGTG CCCCCGACGC GGACGACGAG GTCGCGATCG AGTCCGCGAC CCTCGCGTCG TCCGGGCTGG CCGTGGGCGA CACCACCCGG CTGGTGGCGT CCGGCGAGGT CCGCGACGTC ACGGTCGTCG GCGAGGTCGA CGCCGGCGGC CCGCTCGCCG GCGCCACGCT CGTCTACCTG CCCGTCGACG TCGCCACGGC CGCGTTCGCG CCCGAGGGTG TCGTGCCGAG CGTCGCCGTG CACGCCGCCG ACGGGGTCGA CGAGGACACG CTCGCCGAGC GCGTCGCGGC CGCGCTGGAC GACGCACCGG GTGCCGAGGT CCTCACGGGC GACACCATGC GGGAGCAGCT CCGCGCCGAC ATCGACCAGA CGCTCGGGTT CATCACGTCG TTCCTGCTGA TCTTCGCCGT CGTGTCGCTG TTCGTCGGCG GGTTCCTCAT CTCCAACACG TTCGCGATGG CCGTGCGGCA GCGTGTGCGG GAGTTCGCGC TGCTGCGGGC CGTGGGCGCG TCGCCCGCGC AGGTGTTCGG CGTCGTCGTG GGGCAGGCGG CCGTCGTCGG GCTCGTCGGC TCGGCGATCG GCGTGGCCGG GGGTGTGGGT CTGGTGAGCG GGCTGCGCGT GGTGTTCGCG CAGGTCGGCA TGGACCTCGT CGGGGACGTG CCGGTCGAGA CGACGAGCGT CGTGACGTGC CTCGTGCTGG GCACCGTCGT GTCGGCGGTC GCCGCGGCCG TGCCCGCACG CCGCGCCGCG CTCGTCGCAC CCGTGGAGGC GATGCGTGGC GAGGTCACCG TGCCCGAGCG CTCGCTGCAC GTGCGGGGCG TCGCCGGTGG CCTCGTCGTC GCGCTGGGCG TCGCGGCCGT CGCGTACGCG GCGCTGCGGC CCGACGCGAC CGGGGCGGAG CTGCTCCTCG GCGCCGGCGC GGTGGTCGTG CTCGCCGGGG TGCTCGTGGT CGCGCCGTCG CTCGCGCGGG CCGTGCTGCG GGTGCTCGCC CTGCCGTTCG TGCACGCGCT GCCGCCGCTG GGGCGCCTCG CGCAGGGCAA CGTCGTGCGC AACCCCCACC GGACCGCCTC GACGGCGGGG GCGCTCGTCA TCGGGATGGC GCTCGTGGGG GCGGTGTCGG TGATCGCCGC GACGGGCCAG GCGTCCCTCG TGCGCGTCGT CGAGAGCGCG ACGAACGCCG ACCTCGTGCT GCGCGGCGCG ACGAACGCGG TGCCCGCCGG CGCCGTCGAT GACGTCACCG CGCTGCCGGA GGTCGGCCGG GCGGACGCGA CCGCCTTCGC GTTCGGCGGG ATGTCCCCGC GGCCGGGTGC CGTGCCCGGG CCCGACGACG GCGCGTTCCT GGTGGGCCTC GCGCCGGGGG TCCTGGGTGG GTCGCTCGTG GTCGAGGTCC TCGCGGGCGA CGTCGACGCG CTGGACGACA CGCACGCGGT CGTCAACGAG CGCATCGCCG GCGAGGGGTG GGAGGTGGGC GACGAGCTGA CCGTGAGCAC CGCCGCCGGT GAGCGGACCC TGGAGGTCGC GGCGGTCGTC AGCACCCCCG TCATGTCGGG GTCCGTGGTC GTCACGCAGG ACGTGCTCGA CGAGCTCGCA CCCGGGCCCG CGCAGACCAC CGACACGGTC TTCGTCGACG CGGCCGACGG CGTCGCACCG GCCGAGCTGC GGGACGCCGT GACGGCGGCC GTCTCGCCCT ACGTGGTGGT GTCCGTGCAG GACCGCGACG AGTTCGTCGA CCAGATGGCC GCCCAGGTCG ACCAGCTGCT CGTCATCCTC TACGCGCTGC TCGGCCTGTC CCTCGTCATC GCGGTGCTGG GCATCGTCAA CACGCTCGCG CTGTCGGTCA TCGAGCGCAC GCGCGAGATC GGGCTGCTGC GGGCCGTGGG CCTGGGGCGG CTGCAGCTCG CGGGCGTCGT CACGGTCGAG TCCGTGCTCA CCGCGGTGTT CGGCACGGTC GTCGGACTCG CGGTGGGTGT GGGGCTGGGG TCGACCCTGC CGAGCGTCTA CGCCGACGAG GGCCTGGACC GCCTGTCCGT CCCGTGGTCC GGCCTGGCCG TCATGGTGGG GCTCGCGCTG GTCGTCGGCG TGCTGGCGGC GGTGTGGCCC GGTGCGCGCG CCGCGCGCCT GCGGGTCCTC GACGCGATCG CCACCCCCGA CTGA
|
Protein sequence | MLRVTLRSVR AHAVRFALSI LAVALGVAFV AGTFALRTML ADTFHGIVDG QAPADAYVRG DEPLAGGSQT GGALTLGEQR VPVPLALADE VADVDGVRAA LPDLAGQVVL VGADGTAVQS TQAPSLGVAF HATDPTLDLR EGRAPDADDE VAIESATLAS SGLAVGDTTR LVASGEVRDV TVVGEVDAGG PLAGATLVYL PVDVATAAFA PEGVVPSVAV HAADGVDEDT LAERVAAALD DAPGAEVLTG DTMREQLRAD IDQTLGFITS FLLIFAVVSL FVGGFLISNT FAMAVRQRVR EFALLRAVGA SPAQVFGVVV GQAAVVGLVG SAIGVAGGVG LVSGLRVVFA QVGMDLVGDV PVETTSVVTC LVLGTVVSAV AAAVPARRAA LVAPVEAMRG EVTVPERSLH VRGVAGGLVV ALGVAAVAYA ALRPDATGAE LLLGAGAVVV LAGVLVVAPS LARAVLRVLA LPFVHALPPL GRLAQGNVVR NPHRTASTAG ALVIGMALVG AVSVIAATGQ ASLVRVVESA TNADLVLRGA TNAVPAGAVD DVTALPEVGR ADATAFAFGG MSPRPGAVPG PDDGAFLVGL APGVLGGSLV VEVLAGDVDA LDDTHAVVNE RIAGEGWEVG DELTVSTAAG ERTLEVAAVV STPVMSGSVV VTQDVLDELA PGPAQTTDTV FVDAADGVAP AELRDAVTAA VSPYVVVSVQ DRDEFVDQMA AQVDQLLVIL YALLGLSLVI AVLGIVNTLA LSVIERTREI GLLRAVGLGR LQLAGVVTVE SVLTAVFGTV VGLAVGVGLG STLPSVYADE GLDRLSVPWS GLAVMVGLAL VVGVLAAVWP GARAARLRVL DAIATPD
|
| |