Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3225 |
Symbol | |
ID | 9147141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3587997 |
End bp | 3589691 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | alpha amylase catalytic region |
Protein accession | YP_003638306 |
Protein GI | 296131056 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.262228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000603449 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGGATCC GCGACACCGG TGACCTGTGG TGGAAGTCCG CGGTCATCTA CTGCCTCGAC GTCGAGACGT ACATGGACTG GAACGACGAC GGCATCGGTG ACCTGCCGGG CCTCGTCCAG CGCATCGACC ACCTCGCCGA GCTCGGTGTG ACGTGCCTGT GGCTCATGCC GTTCTACCCG ACGGCGGACC GCGACGACGG CTACGACATC ACCGACTACC TGGGCGTCGA CCCCCGCCTC GGGGACGCCG GCGACCTCGT CGAGCTGATC CGCACGGCCA ACGACCGCGG CATCCGGGTC ATCGCCGACC TCGTCGTCAA CCACACGTCG GACCGGCACC CGTGGTTCAG GAGCGCGCGG CGCTCGCCGG ACAGCCCGTA CCGCGACTGG TACGTGTGGC GGCAGGACGC CCCGCCGGAC ACGTCCGACC AGGTCGTCTT CCCCGACAAG GAGGACGGCA TCTGGACGTA CGACGAGCAG GCGCAGGCCT GGTACCGGCA CCGGTTCTAC AAGCACCAGC CGGATCTCAA CACCGCGAAC CCGCAGGTGC GGGACGCGAT CGCGAAGACG ATCGGCTACT GGGCGCAGAT GGGCCTGGAC GGGTTCCGCG TCGACGCGGT GCCGTTCTTC CTCGCCGACG CCGCGAACAC CCAGGGCGAC GACATCGACC ACCCGCACGA GTTCCTGCAG CAGCTGCGCG CGTTCCTGTC GCGGCGCACC GGCGACTCGA TCCTCATGGG CGAGGTCAAC CTGCCGTACG ACCAGCAGCG CCTGTTCTTC GGCGACCACA CGGGCGACGG CGACCACGAG GGCGCCGAGC TGAACCTGCA GTTCGACTTC GTGCTCATGC AGCAGATGTA CCTGGCGCTG GCCCGGCACG ACGCGGGGCC GATCGCGGAC ACCCTGCGCG CGCGACCCGT CATCCCGTCG GACGCGCAGT GGGCGACGTT CGTGCGCAAC CACGACGAGC TGACGCTCGA CAAGCTCTCC GACGACGAGC GCCAGGAGGT GTTCGACGCG TTCGGCCCCG AGCCCGAGCA CCAGCTGTTC GACCGCGGCC TGCGCCGACG GCTGCCCCCG ATGCTCGAGG GCGACCCGCG ACGCGTGCGG ATGGTCTACT CGCTGCTGTT CAGCCTGCCG GGCACGCCCG TGCTGTTCTA CGGCGAGGAG ATCGGCATGG GCGAGAACCT CGCCGCCGAG GGACGGCTCG CGGTGCGCAC GCCCATGCAG TGGACGTCGG GGAAGAACGG CGGCTTCTCG CGCGCCGCGC CGTCGCGGCT GTCCGGCCCG GTGCCCGACG GCCCCTACGC ACCGGCGCAC GTCAACGTCT CGGACCAGCG CCGCGACCCG GACTCGCTGC TGAACTTCGT CCAGAAGCTC GCACGGCGGT ACCGCGAGAG CCCCGAGCTG GGCTGGACGG AGCGCGCCGA GATCCTCGAC CAGCCGCACC CGAGCGTCCT GGCGCACCGG TCGACGTGGC AGGACGCGTC GATGGTGGCG CTGCACAACC TGGGGCCCGA GGCGGTGCGG GTGCCCCTGC ACCTGCCGGA CACGGACGAG GGGTGGCGCC TCATCGACCT GCTGCAGGAC GACGAGCACG TGCCGAGCAG CACGGGGCGG GTCGAGATCG AGCTCGAGGG GTACGGGTAC CGCTGGCTGC GCCTCGCGAG CCCGGGGTCG CGGCGCCTGC TGTGA
|
Protein sequence | MRIRDTGDLW WKSAVIYCLD VETYMDWNDD GIGDLPGLVQ RIDHLAELGV TCLWLMPFYP TADRDDGYDI TDYLGVDPRL GDAGDLVELI RTANDRGIRV IADLVVNHTS DRHPWFRSAR RSPDSPYRDW YVWRQDAPPD TSDQVVFPDK EDGIWTYDEQ AQAWYRHRFY KHQPDLNTAN PQVRDAIAKT IGYWAQMGLD GFRVDAVPFF LADAANTQGD DIDHPHEFLQ QLRAFLSRRT GDSILMGEVN LPYDQQRLFF GDHTGDGDHE GAELNLQFDF VLMQQMYLAL ARHDAGPIAD TLRARPVIPS DAQWATFVRN HDELTLDKLS DDERQEVFDA FGPEPEHQLF DRGLRRRLPP MLEGDPRRVR MVYSLLFSLP GTPVLFYGEE IGMGENLAAE GRLAVRTPMQ WTSGKNGGFS RAAPSRLSGP VPDGPYAPAH VNVSDQRRDP DSLLNFVQKL ARRYRESPEL GWTERAEILD QPHPSVLAHR STWQDASMVA LHNLGPEAVR VPLHLPDTDE GWRLIDLLQD DEHVPSSTGR VEIELEGYGY RWLRLASPGS RRLL
|
| |