Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xcel_1991 |
Symbol | |
ID | 8649521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xylanimonas cellulosilytica DSM 15894 |
Kingdom | Bacteria |
Replicon accession | NC_013530 |
Strand | - |
Start bp | 2146998 |
End bp | 2148647 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | RNA polymerase, sigma 70 subunit, RpoD subfamily |
Protein accession | YP_003326568 |
Protein GI | 269956779 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.918536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAAAG GTCGGTTTGT GGCGTCCACG TCCCTAAGTA CCGCACAGCA GAACATCACG CTGCCCCGCG AGTTCGAGCA CCCAGGTCTG AAGGACCTCC TCGCCCGCGG CGCTGCCGAG GGCCGCGTCG ATGCCGAGTC GTTCCGCGCT GCCTGCGAGG GCGCCGGTGT CGGCGACGCC AAGCGCCTCA AGGCAGTCCT CAAGGCGCTG TCTCTCGCGG GGATCGAGGT CGGAATGCCG ACGGTGGCAA AGGTCGCCGC CGCGACGTCG ACCCGCTCCA CGCGCACGAC GGCGAAGGCA CCGGCGACCC GCAAGACGGC CACGAAGGCC GCCACGGCCT CCGGGGACGC CGACGACGCC GCTGAGGCGA CCGAGGCACC CGCCGCGAAG GCCGCGGCCA AGGCGCCCGC CAAGAAGGCT GCCCCCGCAC GCAAGAGCGC CGCGAAGGCG CCCGCCAAGA AGACCGCCGC GTCGGCGAAG TCCGCGAAGG GCGGCGACGA CGCCGTCGAG GACGACGTCG AGATCGACGA GACCGAGCTC GTGGCGGTCG ACGCCGACGT GACGGACACG GACGACGCTG CCGAGACCAC CACGGACGAC GCCGGTGAGG CGAAGCCTGC GGAGAAGGAG GGCGAGAAGG AGGAAGACGC CGGCTTCGTC GTCTCCGACT CCGACGACGA GGACCAGCCC GTCCAGCAGG TCGTCACGGC CGGTGCGACG GCCGACCCGG TCAAGGACTA CCTCAAGCAG ATCGGCAAGG TCGCGCTGCT GAACGCCGAG CAGGAGGTCG AGCTCGCGAA GCGCATCGAG GCCGGCCTGT TCGCGGAGGA GAAGCTCTCC AAGGACTTCG CCGACTTCGA CCGGACCAAG GCGGACGCTG ACACGCGCCG GCTGGTGCGT GACCTGCAGT GGATCGCCCA CGACGGCAAG CGCGCCAAGA ACCACCTGCT CGAGGCCAAC CTGCGCCTGG TCGTCTCGCT CGCCAAGCGC TACACCGGCC GCGGCATGCT GTTCCTGGAC CTGATCCAGG AGGGAAACCT CGGTCTGATC CGCGCGGTCG AGAAGTTCGA CTACACCAAG GGCTACAAGT TCTCGACGTA CGCCACCTGG TGGATCCGTC AGGCGATCAC CCGCGCCATG GCAGATCAGG CGCGCACCAT CCGCATCCCG GTGCACATGG TCGAGGTCAT CAACAAGCTG GCCCGCGTGC AGCGCCAGAT GCTCCAGGAC CTGGGCCGCG AGCCCACCCC GGAGGAGCTG GCCAAGGAGC TGGACATGAC CCCCGAGAAG GTGGTCGAGG TCCAGAAGTA CGGCCGCGAG CCCATCTCGC TGCACACGCC GCTGGGCGAG GACGGCGACT CGGAGTTCGG TGACCTCATC GAGGACTCCG AGGCCGTGGT CCCGGCCGAC GCCGTGTCCT TCACGCTGCT CCAGGAGCAG CTCCACCAGG TGCTCGACAC GCTCTCCGAG CGTGAGGCCG GCGTGGTCTC GATGCGCTTC GGCCTGCAGG ACGGCCAGCC CAAGACGCTC GACGAGATCG GCAAGGTCTA CGGCGTGACG CGCGAGCGCA TCCGCCAGAT CGAGTCCAAG ACGATGTCGA AGCTGCGCCA CCCGTCGCGC TCGCAGGTGC TGCGGGACTA CCTCGACTGA
|
Protein sequence | MPKGRFVAST SLSTAQQNIT LPREFEHPGL KDLLARGAAE GRVDAESFRA ACEGAGVGDA KRLKAVLKAL SLAGIEVGMP TVAKVAAATS TRSTRTTAKA PATRKTATKA ATASGDADDA AEATEAPAAK AAAKAPAKKA APARKSAAKA PAKKTAASAK SAKGGDDAVE DDVEIDETEL VAVDADVTDT DDAAETTTDD AGEAKPAEKE GEKEEDAGFV VSDSDDEDQP VQQVVTAGAT ADPVKDYLKQ IGKVALLNAE QEVELAKRIE AGLFAEEKLS KDFADFDRTK ADADTRRLVR DLQWIAHDGK RAKNHLLEAN LRLVVSLAKR YTGRGMLFLD LIQEGNLGLI RAVEKFDYTK GYKFSTYATW WIRQAITRAM ADQARTIRIP VHMVEVINKL ARVQRQMLQD LGREPTPEEL AKELDMTPEK VVEVQKYGRE PISLHTPLGE DGDSEFGDLI EDSEAVVPAD AVSFTLLQEQ LHQVLDTLSE REAGVVSMRF GLQDGQPKTL DEIGKVYGVT RERIRQIESK TMSKLRHPSR SQVLRDYLD
|
| |