Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xcel_1948 |
Symbol | |
ID | 8649478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xylanimonas cellulosilytica DSM 15894 |
Kingdom | Bacteria |
Replicon accession | NC_013530 |
Strand | - |
Start bp | 2104758 |
End bp | 2106590 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Collagen triple helix repeat protein |
Protein accession | YP_003326525 |
Protein GI | 269956736 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.584968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCA CCACGTCGAG CCCGACCCGT CACGGCGCAG GCGGGGCGCA CCCGGTCGGC TGCGGCTGCG GGGGCGGCGG CCAGTGCACC TACGGCCCGT TCACCCGCAA CAGCTACTGG TACGGCAAGC TGATGCTGCC GCAGGACTTC ATCGACGAGC AGCAGTACGT GCGCGACAAG ATCCGCCACC ACAACCAGCG GCTGCACGGC TGCGGCGTCG CGTGCGGGCT CGTCGTCGAG CAGGACACGG CGCCGGACTG CCGGGACCGG ATCGTCGTCA TCACGCCCGG GACCGCTATC GACTGCTGCG GCAACGAGAT CATCGTGACC GACCGCTACC GGCTCGAGCT GGCGACCCTG CCCGCACTCG CGGCACTGTC CGGACCTGGA GCCGATCCCG AGGCCGTGCA CGAAGTGCGG CTGTGCCTGC GCTACCGCGA GTGCGACACC GATCCGGTGC CGGTGCTGTA CGACGACTGC GGGGGCGACG ACGGCCAGAC CGCCCCGAAC CGGGTGCTCG AGTCCTGGGA CGTCGACGCC GTCGTCCTCC CGCCCGGGCC GGACGAGCCT GCGCCGGACG AGCCTGCGCC GGACGAGCCT GCGCCGGACG AGCCTGCGCC GGACGAGCCT GCGCCGGCGG AGGCTGGGCT GGAGGAGCCC GGGCTGGACG AGCCCGGGCT GGACGAGCCC GGGCCCGAGC CCGAGCCCGA CGTTCCGGCG ACGGGCGCCT GCACGGAGCA CTGGAACACG CTGCCCGGGT GCCCGATCTG CGAAGACTCG GCGTGCTCGT GCGTCGTGCT GGCCACCATC CACGGGTACC GGCCCGGCTT CGTGGTCCTC GACGCCGACG CCGAGGCGAC CGCCGAGGCC GACCTGGCGG CCCAGATCGC CCGCATCGAC AACCACGCAG GCCGGAGCGT GCTGCGCAGC ACGCAGGTGA TCAGCGAGAC CGTCGAGTGC CTGCTGGAGC ACGGCGGCAC GGGAGGGGAG ACCGGGCCTG CGGGGCCAGA GGGACCAGTG GGGCCGGCGG GGCCAGAAGG GCCAGAAGGG CCAGAAGGGC CGGCGGGGCC AGAGGGGCCC GCGGGGGAGA AGGGCGACAC GGGTGAACCC GGGCCGCAGG GTCCACCGGG GGCCGCCGGT GCCGCCGGTG CAGACGGTGC GACGGGTTCC GCAGGGGTGC CCGGCCCACA AGGGCCCGCA GGCCCGGGGC TGGAGGCGGG CCTGGTCCAG ATCGCCGCCC TGAGCTGGAC GCACGCCGAC ATGATCCTGG TCCAGGACCT GGAGACGCTG GAGATCGACG GCCGGCGGCG CCGGGGCGTG ACGATCCAGT TCACCGAGAC CGTGCACCTG GCCCCGGAGG GGTTCACGCT GCCTGACGTC CAGCACGTGC TCACGGTCGA GGCGCCGCAC GTGCGGTACA CGTTCCCGCC ACGAGAGGCG GAGGACCGGG AGGTGGCCGC GAAGCAGGCC GAGCTCGACG CCTTCTACCG GTGCCGCTGT CACGTGCTCG GCCAGGTGGT GCTGACCGAG GTGCAGGCGA TCGACGCGAC CGGCCGGATC ACGTTGGCGA CGGACAAGAC GAGCGAGCCC GACGCCCTGT CGTTCGTCTT CCATGAGCGC TTCCTCGACG CCTTGTTCGG CGCACTGGGG GACCCGCGCG GTGTCGACCT GTGGGTCAAG CTCCGCGGCG AGTTCGTGCT GGACGCGCGA GAACGCGCGG TGGACGCGGA GTTCGCGCGC GCCGGTCTCC CCACGGGCGA CCGCCCGAAG GGCGAGAAGC ACGGCATCCA GGGCGGCACC TTCGAGAGCT GGCTCTACCC GGTGCTCGAC TGA
|
Protein sequence | MSTTTSSPTR HGAGGAHPVG CGCGGGGQCT YGPFTRNSYW YGKLMLPQDF IDEQQYVRDK IRHHNQRLHG CGVACGLVVE QDTAPDCRDR IVVITPGTAI DCCGNEIIVT DRYRLELATL PALAALSGPG ADPEAVHEVR LCLRYRECDT DPVPVLYDDC GGDDGQTAPN RVLESWDVDA VVLPPGPDEP APDEPAPDEP APDEPAPDEP APAEAGLEEP GLDEPGLDEP GPEPEPDVPA TGACTEHWNT LPGCPICEDS ACSCVVLATI HGYRPGFVVL DADAEATAEA DLAAQIARID NHAGRSVLRS TQVISETVEC LLEHGGTGGE TGPAGPEGPV GPAGPEGPEG PEGPAGPEGP AGEKGDTGEP GPQGPPGAAG AAGADGATGS AGVPGPQGPA GPGLEAGLVQ IAALSWTHAD MILVQDLETL EIDGRRRRGV TIQFTETVHL APEGFTLPDV QHVLTVEAPH VRYTFPPREA EDREVAAKQA ELDAFYRCRC HVLGQVVLTE VQAIDATGRI TLATDKTSEP DALSFVFHER FLDALFGALG DPRGVDLWVK LRGEFVLDAR ERAVDAEFAR AGLPTGDRPK GEKHGIQGGT FESWLYPVLD
|
| |