Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0427 |
Symbol | |
ID | 4069653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 499063 |
End bp | 501198 |
Gene Length | 2136 bp |
Protein Length | 711 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637982431 |
Product | cellulase precursor |
Protein accession | YP_589506 |
Protein GI | 94967458 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0137531 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATGTA TTCGGAAGCA GTGTCTCTTC GTCTCCCTGA TTGCGATTTG CTCAAGCGTC ATTGGGCTCG GGCAAACGAT TCGGGTAGAT ATCAGCCACC CCACCAATTC GTTCGTTCCA AACGAATCTC TCGGTGCTGG CATTGATCGT ATCGGAGCGG CAGCGGCGGC GAAGTACTTC AGCGAGCCTC CGTCGAAAGC CGTGCTGGAT GCCGGTTGGC AGCCTGTGAC CTATCGGCAG AACACCGAGT TGGCAGTTGA GGCCTGGCAC TGGAATCCGC AGGGCACGTG GAGCGATCCG AGCGGCAAAG GATACTTCAC CGGTAGCGCC GAACCGGGAC CGGAGATCAT TAAGCATTCG TACGGATATG TTTTGCCCCA TCGTGGCTTT ACGCGTAACG ATGGAACCGA CACGAGCGGC TTCGGTAGAA TGACCGACGG CGATCTCAAC ACCTATTGGA AGAGCAATCC GTATCTCACG AAACATTTCA CTGGCCAGGA AGATTCCGAA CATCCGCAGT GGATCGTGAT TGATCTTGCA ACGACACAAT CCATAAACGC TTTGCGGATC GCCTGGGCAG AGCCTTACGC CAAGCAATAC CTAATTCAGT ATTGGACCGG CGACGACGCG ATCAAATTAC CGAGCAAAGG ATCGTGGGTC GCGTTCCCCG CGGGCGTGGT AAGCGCCGGC AAGGGTGGGA CGGTCACCCA CCAACTCTCG AATTCGCCGA TGCCGGTGCG TTTCTTGCGC GTCTGGATGA CGGAGCCATC GAATACCTGC GACACACATG GCTCGGCCGA TATTCGCAAT TGCGTTGGTT ATGCAATCCG CGAGGTGTTC ATCGGGACGA CCGACAGGAA TGGCTTCCAT GATGTGGCGC GGCACACCCC CGACCAGGAT CAGACCACGA CTTACTGCTC TTCGATTGAC CCTTGGCACG AGCCTTCGGA CTTGGGATCG ACGCAACACG AACACGTTGG CATGGATCTG TTTTATCGCA GCGGATACAC ACGCGGCTTG CCGGCAATGA TTCCAACCGC GTTGCTCTAT GGCACGCCCG AAGATTCCGC TGCTGAAATC GCGTACGTCG AGAAGCGCGG CTACCCGATC TCCTATGTGG AACTCGGCGA AGAGCCGGAT GGCCAGTACA CGCTTCCGGA AGACGACGCT GAACTCTACC TGCAATGGGC GAGGGCAATC CACAAGGTGG ATCCGAAGCT GAAACTTGGC GGGCCGGTAT TTACGGGCCA GAACGAAGAC ATTTTGTCGT GGCCCGATGC GCAAGGTCAG ACATCGTGGA CGCGCCGGTT CCTGAACTAT TTGAGGGCCC GCGGCGGTCT GCGAGAGCTC GCCTTCTTCT CGTTCGAACA CTATCCATTC GAACCGTGCA AGGTGAATTG GAGCAGCCTT TACGACGAGC CGCAGCTCAT GACTCACATC ATGCAGGTTT GGCGTGACGA CGGTCTGCCT GCCGATGTGC CAATGTTCGT TACCGAATCG AACATCACGT GGAACAGTGG CGAGTCGTCG GTAGACATCT TCGGAGCGCT CTGGCTCGCG GATTATGTTG GCTCGTTTTT CACCGCAGGC GGCAAGGGAC TTTACTACTT CCATTATTTA CCGCTGGGTG TGCACCCGGG GTGTAATCAG TCCGGCGGTA CGTTTGGTAT GTTCACGACG AAGGGCAATT TCGAAGTCGA CAAGCCGACG TCGCAGTTCT TCTCGAGCCA GTTAATCAAC ACCGAATGGG TGCAGCCTGG TGACGGAGTG CACGAAACCT ACGCCGCGAC TGGCGATCTC ATGGACGCCG CCGGGCACGC CTTAATCACC GCCTACGCCG TGAAGCGCCC TGATGGCCAA TGGTCGCTGC TCGTTGTGAA TCGCGATCAG GAGAATGCGC ATAAGGTGAC GATCGATTTC TCTGATTCTG GGCGGGGCAA GACTGGATTT GCAGGGCCAG TGCAGTTGCT GACCTTCGGC AGCACGCAAT ATAAGTGGAA TCCCACAAGG GAAGGCGGCT TCCCCGATCC AGATGGGCCG GTTGCAAAAT CCAGCATTAA CGCATCGGCC GACACGGTTT ATGAATTGCC GAAAGCATCC ATGACTGTAA TTCGCGGGTC GCTTTCACAT CAGTAA
|
Protein sequence | MKCIRKQCLF VSLIAICSSV IGLGQTIRVD ISHPTNSFVP NESLGAGIDR IGAAAAAKYF SEPPSKAVLD AGWQPVTYRQ NTELAVEAWH WNPQGTWSDP SGKGYFTGSA EPGPEIIKHS YGYVLPHRGF TRNDGTDTSG FGRMTDGDLN TYWKSNPYLT KHFTGQEDSE HPQWIVIDLA TTQSINALRI AWAEPYAKQY LIQYWTGDDA IKLPSKGSWV AFPAGVVSAG KGGTVTHQLS NSPMPVRFLR VWMTEPSNTC DTHGSADIRN CVGYAIREVF IGTTDRNGFH DVARHTPDQD QTTTYCSSID PWHEPSDLGS TQHEHVGMDL FYRSGYTRGL PAMIPTALLY GTPEDSAAEI AYVEKRGYPI SYVELGEEPD GQYTLPEDDA ELYLQWARAI HKVDPKLKLG GPVFTGQNED ILSWPDAQGQ TSWTRRFLNY LRARGGLREL AFFSFEHYPF EPCKVNWSSL YDEPQLMTHI MQVWRDDGLP ADVPMFVTES NITWNSGESS VDIFGALWLA DYVGSFFTAG GKGLYYFHYL PLGVHPGCNQ SGGTFGMFTT KGNFEVDKPT SQFFSSQLIN TEWVQPGDGV HETYAATGDL MDAAGHALIT AYAVKRPDGQ WSLLVVNRDQ ENAHKVTIDF SDSGRGKTGF AGPVQLLTFG STQYKWNPTR EGGFPDPDGP VAKSSINASA DTVYELPKAS MTVIRGSLSH Q
|
| |