Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2237 |
Symbol | |
ID | 9146137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 2497377 |
End bp | 2498927 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | Curculin domain protein (mannose-binding) lectin |
Protein accession | YP_003637327 |
Protein GI | 296130077 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.120186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00817186 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGACGTG TGGGGGCCAC GGTCGTGGCA GCCGTGCTGG CGTCGGCCCT CGCGGTGCCC GCGGCCGCCG GACCGGGCGA GGACGTGCTG CGCCCCGGCG AGCAGCTCGC GCCGGGCCAG GCCCTGCTCG CCGCCGGCGG CGGTCACGTG CTCGTCGTGC AGCCGGACGG CGCGCTCGGC CTGTACGCGG TCACGGGCGA CGTGACCGAC GCGATCGTGC GCTGGTCCTC CGGCCGCGGC GTGGCCGGCG CGACGCTCGT CGCCGACGCG TCGGGGGACG TACGCCTCGT CGCCCCCGAC GGCGCCGTGC TGTGGAGCAC CGGCACGGTG GGCTCGGGCG GCGCGCTGCG GTTGCGCGAC GACGGCGAGG TCGTCGTCGA GGCGGCGGAC GGCACGGCCG TGTGGGGGAG CGGCACGGCG CTGGCGCCCT CGGTCCTCGT GGGCCCCGGC AGGATCGCGG GGGACGTCGT GCTGTCGTCG CCGGACGGGC GCCACCTGCT GCACGTCGAC CCGGACTCCG GTGTGCAGCT GCGCGGACCC GACGGCACCG TGCGGTGGGC GCCGCCGGCC GGCGACGGCG CCGAGGCCGA CCCGGCGGTC GCCCTCGAGC TGCGTGCGGA CGGCAATCTC GTGGCGGTGG ACGCCGACGG CGACCCCGTG TGGCGCAGCC GCACCGCGGG GCGCGGGGCG GTGTCGCTCC TGCTCCAGGA CGACGGGAAC CTCGTGCTGC TCGGTGCCGA CGGTGCGCCC GTGTGGGACG CGGGCAGGCC CATCGGACCC TCGGGGCTCG ATGCGACGGG TGCGCTCGCG GGCGAGGCGT CGCTCGGCTC GCCGTCCGGG CACCTCGGTG TGCGCGTGAC GGCGGGTGCG CTGGTCGCGA CGTGGGACGG CGCGCCGATC TGGTCGTCGT CGACCGTCGG GGGCGTCACC GCGCGCGTGC AGGAGGACGG TGACCTCGTG CTCCTCGACG CCGCCGGTGC GGCCGTGTGG CGGTCCGGCA CCGCCGGCCG GCCGGGGGCG CGGCTCGTGG TGGAGGAGAG CGCGGTGCTG CTCGTCGCCC CCAACGGCGA GGTGCTCTGG CAGGTCGCGG TCCCGGAGGC GCTCGTGCCG ACGGGCGTGC AGCCGACCGA CTGCGACGAC GTCGACGGGC CGGTCGCCGC CGGCGACGTG GTACGCACCC GCAGCGGCAT CGTCGTCCAC CCGTGCCTCG CCGAGGCCCT GGACGCCATG GTGGCAGCGG CGCGCGCCGA CGGCATCGAG CTGCACGGGG GCGGGTGGCG CAGCCCCGAG CAGCAGGTCG CGCTCCGGCG TGCGCACTGC GGGCCGAGCG ACGCCGACGT GTACGACAAG CCGGCCTCCG CGTGCCGACC GAGCACCGCG CGGCCCGGCA GCTCGCGGCA CGAGCGCGGC CTCGCGGTCG ACCTCACGTC CGGCGGGCGG TCCCTGACCG CGGGTTCGGC GGCGTACGCC TGGCTCGTCG AGCACGCGGG CGCCTACGGT CTGGAGAACC TGCCGGGCGA GCCGTGGCAC TGGAGCGTCG ACGGGTCCTG A
|
Protein sequence | MRRVGATVVA AVLASALAVP AAAGPGEDVL RPGEQLAPGQ ALLAAGGGHV LVVQPDGALG LYAVTGDVTD AIVRWSSGRG VAGATLVADA SGDVRLVAPD GAVLWSTGTV GSGGALRLRD DGEVVVEAAD GTAVWGSGTA LAPSVLVGPG RIAGDVVLSS PDGRHLLHVD PDSGVQLRGP DGTVRWAPPA GDGAEADPAV ALELRADGNL VAVDADGDPV WRSRTAGRGA VSLLLQDDGN LVLLGADGAP VWDAGRPIGP SGLDATGALA GEASLGSPSG HLGVRVTAGA LVATWDGAPI WSSSTVGGVT ARVQEDGDLV LLDAAGAAVW RSGTAGRPGA RLVVEESAVL LVAPNGEVLW QVAVPEALVP TGVQPTDCDD VDGPVAAGDV VRTRSGIVVH PCLAEALDAM VAAARADGIE LHGGGWRSPE QQVALRRAHC GPSDADVYDK PASACRPSTA RPGSSRHERG LAVDLTSGGR SLTAGSAAYA WLVEHAGAYG LENLPGEPWH WSVDGS
|
| |