Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xcel_3077 |
Symbol | |
ID | 8650627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xylanimonas cellulosilytica DSM 15894 |
Kingdom | Bacteria |
Replicon accession | NC_013530 |
Strand | - |
Start bp | 3408528 |
End bp | 3411470 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003327637 |
Protein GI | 269957848 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCACGG ACTTCACCTG GAACCAGCAG TTCAGGCTCG ATGTCGACTT CGGCTTCCTC AGCACGCAGA CGCACGGAGC ACCGCGACAC CACAATCCAC GCATCATCCT GAACGGCGAC GGCAGCACGG TGCTGCACGA GATCCTCAGC GAGCTGCGTC GCTGCAGCTC CTTCACGTTC TCCGTCGCCT TCGTCGCGCC CGCAGCTGTT GCCCAGCTCA AGCAAGCCCT CGTGGAGTTC GACGGTGTCG GGCGGATCAT CACCTCCGAC TACCTCGGCT TCAACTCACC CGAGGCCTTC GCCGAGCTGC ACAATCTGCG GGCCCTCGGG ATCGACGTGC GCTTGCACAA TGCCGACGGG TTCCATCCCA AGGGCTACAT CTTCGAACAC GCAAACGCCG TGACGACGAT GATGGGCAGC TCGAATCTCA CCCCAAGCGC CCTGCTCAGG AACCACGAGT GGAACCTCAA GGTTTCGGCA TCGCCGGACA GTGACCTCGC CGCGCAGCTC GCACACCTCG TCGAGCTCCA GGTCGCGGAC TCGGTCCCGA TCACCCAGGA ATGGATCGAG CAGTACGCGA AGACGTACGT GCGGCCTGCG CAGCGACCGC CACGGGTGCC CCGCGACGTC TCAGCCCTGG TGCCGCACCC GACGGCGCCG CTTGAGGAAC GGTCGCCGTC GCTCCTGCCG ACGACGATCA CACCCAACCG GATGCAGCGT GATGCGCTGA ACGCGCTCGC CGCGGTCCGG GACAGTGGCG AGAAGCGTGC CATCGTCATC TCGGCGACCG GTACGGGCAA GACGATCCTG TCTGCGCTCG ACGTACGGGC CGTCAACCCG CGGCGCCTCC TCTTCGTCGT CCACCGTGAA CAGATCCTCG ACCGCACCAT CCAGGAGTAC CGCAAGGTGC TGGGCGGCGA CGCGAGCGAC TACGGCAAGC TGACCGGATC GTCGAAGGAC TTCGCCGCCC GCTACCTGTT CGCCACCGTG CAGACGCTGG CTCAGCCCGA CGTGCTCGCC CGGTTCCCCG CTGACGTCTT CGACTACGTC ATCGTCGACG AAGCCCACCG TGGGGGATCC CCGACGCATC GGCGGGTCAT CGGACATTTC GACCCGGTGT TCATGCTCGG CATGACGGCG ACACCCGAGC GCACCGACGG GTTCAACGTC TTCGAGCTCT TTCACTACAA CGTGCCGTAC GAGATCCGGC TCAACCGGGC ACTCGACGAG GACATGCTGA CCCCGTTCCA CTACTACGGG ATCACGGACG CCACGTTCGA CGACGACACG ACCGTCGACG CCCTCAGTGA TCTCGACCTG CTCGTCTCGC CGCAGCGGGT CAGCCATCTG ATCTGGGCAT TGGAGACGTA CGCCCAGGCC GGCACGGCCC CGCGCGGGTT GATCTTCTGC AGTCGCACCC AGGAGGCACG CCGGCTCTCG GACGTGCTGA ACCGCTCGCT GCTGCGTGGC CGCCCGTTGC GTACGGCGTC GCTCACCGGC GTGGACTCGA TCGAGCACCG CGAGAGAACC GTTGAGCAGC TGGAGTCGGG CGAGCTCGAC TACATCCTGA CCGTGGACGT GTTCAACGAG GGCGTGGACA TCCCCTCGAT CAACCAGGTC ATCATGCTGC GCCAGACGCA GTCGGCCATC GTGTTCGTCC AGCAGCTGGG GCGCGGGCTG CGCAAGTGCG ACCAGAAGGA GTACCTGGTC GTCCTCGACT TCATCGGGAA CTATGCCAAC AACTTCCTGA TCCCGATTGC GTTGTTCGGC GATGACTCGC TCAACAAGGA GTCCTTGCGC CAGAACCTCA TCGCCGTCGA GGAAGCAGGG GCGCTCCCGG GACTGTCCAG CGTGCAGTTC GACAAGATCG CGCAGCAGCG GGTCCTCGAG TCGATCCGGG ACACGAAGCT CGACGACATG GCGCGTCTCA AGGCAGCCGT CGTCGCGATG CGCAACCGGG TGGGCGCGGT TCCCCGGCTC TGGGACTTCT TCCGCTTCGA GTCTGTCGAC CCCGTCGTGC TCGCGACGAA GAAGGAGCAC TACCCGGCCC TGGTCAGGTC CCTTCTCAAG GAGGAGATCG ATCTGTCGGA GACGAGCAGT CGGGCCTTGC AGCTCCTGTC TCACGAGGTG CTTCCAGCGA AGCGAGGCCA CGAGTTCGTC CTGCTCCGCG CACTGCTCAA TGACGGCCGA TTGACGTCGT CGCAGATCGA CGAACGCTTT GCCGCGGCGG GTCTGCCGAC GTCGCCCGCC TACGTCAAGA GCGCGGTCGA CACGCTCACC CTGGACGGCT TTGCCGAGGC GGACGTCAGA CGCTACGGGA GCGGTATCGC CGTCCGCGAC GGCTCCGCCG TGACGCTCAA GAGCGAGGTC GCGTCCGCCT ATGCGTCGCC GACGGACTTC CCGAGTGCGG TCGACGACAT CATCGACACG GGACTGTCGC TCGTCCGCAA GGGGTTCGAC CCGTCTGCTC GCTTCACGGT CGGCCGCCAG TACTCCCGCA AGGAAGCGCT GCGACAGCTC TGCTGGCCAC GGAGCTGGGC ATCCACCGTC TATGGGTACA AGGTCGACCG TGCGAGCGGC GCCTGCCCGA TCTTCGTCAC CCTGCACAAG TCCGACGAGA TCTCGGCGAG CACCGCGTAC GAGGATGCCC TCCTCGATCC GTCGTCGATG CTCTGGTACA CGCGAAGCCG CCGCACGCTG CGGAGCTCCG AGGTTCAAGC GATCGTCTCG GGCGACGTGG CGCTGTACGT CTTCGTCAAG AAGGACGACG CCGAGGGGAC GGGCTTCTAC TTCCTTGGTC AGGCCACGGC GCATGACGCC GAGGAGGCGA CGATGCCGGA TGCGTCGGGC AACTCGCTCG ACGTCGTCCG CATGATCCTG CGCTTCGAGA AGCCCGTCGA GACGGCCCTC TTCGACTACT TCCACTCGAC GCTCATCGAC TGA
|
Protein sequence | MTTDFTWNQQ FRLDVDFGFL STQTHGAPRH HNPRIILNGD GSTVLHEILS ELRRCSSFTF SVAFVAPAAV AQLKQALVEF DGVGRIITSD YLGFNSPEAF AELHNLRALG IDVRLHNADG FHPKGYIFEH ANAVTTMMGS SNLTPSALLR NHEWNLKVSA SPDSDLAAQL AHLVELQVAD SVPITQEWIE QYAKTYVRPA QRPPRVPRDV SALVPHPTAP LEERSPSLLP TTITPNRMQR DALNALAAVR DSGEKRAIVI SATGTGKTIL SALDVRAVNP RRLLFVVHRE QILDRTIQEY RKVLGGDASD YGKLTGSSKD FAARYLFATV QTLAQPDVLA RFPADVFDYV IVDEAHRGGS PTHRRVIGHF DPVFMLGMTA TPERTDGFNV FELFHYNVPY EIRLNRALDE DMLTPFHYYG ITDATFDDDT TVDALSDLDL LVSPQRVSHL IWALETYAQA GTAPRGLIFC SRTQEARRLS DVLNRSLLRG RPLRTASLTG VDSIEHRERT VEQLESGELD YILTVDVFNE GVDIPSINQV IMLRQTQSAI VFVQQLGRGL RKCDQKEYLV VLDFIGNYAN NFLIPIALFG DDSLNKESLR QNLIAVEEAG ALPGLSSVQF DKIAQQRVLE SIRDTKLDDM ARLKAAVVAM RNRVGAVPRL WDFFRFESVD PVVLATKKEH YPALVRSLLK EEIDLSETSS RALQLLSHEV LPAKRGHEFV LLRALLNDGR LTSSQIDERF AAAGLPTSPA YVKSAVDTLT LDGFAEADVR RYGSGIAVRD GSAVTLKSEV ASAYASPTDF PSAVDDIIDT GLSLVRKGFD PSARFTVGRQ YSRKEALRQL CWPRSWASTV YGYKVDRASG ACPIFVTLHK SDEISASTAY EDALLDPSSM LWYTRSRRTL RSSEVQAIVS GDVALYVFVK KDDAEGTGFY FLGQATAHDA EEATMPDASG NSLDVVRMIL RFEKPVETAL FDYFHSTLID
|
| |