Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_2538 |
Symbol | |
ID | 4115598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 2557768 |
End bp | 2560899 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638037309 |
Product | coagulation factor 5/8 type-like protein |
Protein accession | YP_645267 |
Protein GI | 108805330 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCTCG CGCTGGTGAT TCTGGCCTTC GCTTCTGGAG AGTCCCGGGC GCTGAACGGC GGCGTGCAGG AGGAGGCCGG TTCAGGAGCG GGGGAGAGGA CGTTCGACGC TCGCACCTCC CGCGTCGAGC CGACAAAAGA GCAACGCGAG GCCGCAAGCC GGCTTCTCCG GAAAGCCGGC GCCGGGACCC GGATGGCGTG GGACCGCAGG TTCGGCACGC CGCACACCAT CTACCCGGCA CGCGGCACCC TCACCGGCCC GCGGGAGGGC ACACCGGCAG AGATTGCCCG GTCTTGGATC CGGGAGAACC GCGCCCTCTT CGGCCTCGCC GCGGGCGACG TGGATGACCT GAACGTTACG CAGAGCCACG GGCTGAGCAC CGGCACCCGC ATCATCTCCT TCGCCCAGAC CTTCGATGGC GTCGAGGCGG TGCATGGCGG CTCCATGACC GTCGTGGTGC GCAAGGACGG GTCGGTCGAG TCCTATGCCG GGCAGTCGGC GCGTCACGGC GAGCTGTCCG GGGGCTGGGA GCTCAGCGGC GCCCAGGCGC TGGAGAGGGT CGCGGGCAGG CTCGCCGGGG CCACCGCGTT TGCCGCCGAG GCTGTCGGCG AGCAGGCCGG GTACACGCAG TTCGCCAGGG GAGACTTCGC GGCCGGCTCC TACGTCAGGA AGGCGGTCTT CGTCACGGGG GAGGGGGCGC TTCCGGCCTA CCAGGTGCTC TTCATCGAGA AGCTCGACCG GGCCTGGGAC GTGATGGTGG ATGCGCGGAC CGGCGAGATC CTGTTCCGCG ACAGCCTCGT GGACCACAGC AGGGCCGAGG GGACCGTCTA CGAGAACTTC CCGGGAGACG CCGGGAGCGG CGGCCAGCCC GTCATCAAGA GCTTCGAGCC CACGCCGCAG TCCCCGTCGG GATACGTGGA CCCGACCGGC CTCGCCGGGC TCGACGGCCC CACGACGCTC GGCAACAACG CCAACAGCTA CGCCAACTGG TCGAACTTTT TGGTCCCCGC TGATCAGGCG CCGCGCCCCG TGAGCCCGCT CTCGCACTTC AACTACTCCT TCAGCGATCA CTGGGGCCGC TCGGGCTGCC AGGCCGTACC GCCTTCTTAC GCCCAGGACC TAGAGCCGGC GGCGACCAAC CTCTTCTACC ACCACAACCG CATCCACGAC GAGTTCTATC GCCTCGGGTT CACCGAGACG GGCGACAACT TCCAGGTCAA CAACTACGGC AGGGATTCCG GTGGGGGGGA CCCGATCCTG GGGCTGGTGC AGGCGGGAGC TGCCACCGGC GGCGCGCCGC TCTATACCGG GCGCGACAAC GCCTACATGC TCACGTTGCC CGACGGCATA CCGCCGTGGA GCGGGATGTT CCTGTGGGAG CCGATCAACG ACGCCTTCGA GGGGCCCTGC CGCGACGGCG ACTTCGACGC GTCGGTGATC GAGCACGAGT ACGCCCACGG CCTCACCAAC CGCTACGTCT CCGCGGAGGA CAACGCCCTG GGCACCCACC AGTCCGGCTC GATGGGCGAG GGCTGGGGCG ACTGGTACGC CCTGAACTAC CTCCACCGCG AGGGGCTCTA CGGCAAGTCC GTAGTCGGCG AGTACGTCAC CGGCAACCCC GCACGGGGCA TCCGCAACTG GAGTTACGGC AAGAACCCGA CCACCTACGC GGACATCGGC TACGACATCG TCGGTCCCGA GGTGCACGCG GACGGTGAGA TCTGGACCGC AATACTTTGG GACTACCGGC AGGCCCTCGT CGCGGCGTTC GGGCAGGCTA GGGGCGCGGA GATCGCCCAG AGGACCATCA CCGACGCTAT GCCGCGCTCT CCGGCAAACC CCTCGTTCCT CGACATGCGC GACGCCATCG CGCTGGCCAT CGACGACCGC TACCACGACT CGTCGAGCTA CGAGAGGATC TTCGACGTCT TCTGGACGGA GTTCGCCCGG CGCGGCGCGG GCTTCCACGC CCGGACCGCG GGCGGCGACG ACCTGGACCC GACGCCGGCC TTCGACCATC CGAACGGCTC GCGAAACGGG ACGCTCGTCG GCAGGGTCGT CAACGCGGCT ACCGGCGAGC CCGTGGCGGA CGCCCGCGTG ATGCTCGGGC GGTTCGAGGC ACGCGTCTCG CCGCTGCGCA CCACCGGCTC CGGGGGCGGC TTCTCCGCCC CGGTCGTTGC GGGGACCTAC CCGGTCACGA TCCAGGCCCG CGGCTTCGGA TCGCGCACCT TCGAGAACGT GAGGGTGAGG GCGGGCGAGA CGACCTCGCT GCGCTTCCCG CTCAGCCCGA ACCTGGCCTC CGGGGCGAAC GGCGCGAAGG TCGTCTCCGC GACGAGCCCC AACGCTCGCG CGCTCATAGA CGATACCGAG GCCAGCACCT GGACGAGCCG GAGGCGGGGC AACGCCGTAA TCGAGCTGGC CAGGCCGGCG GAGATCACAT CCGTGCGGGT CAGCGCATAC ACGACCTCCC GCTTCGAGGC CCTGCGCGAC TTCACGCTCC AGGTCTCGAC CGACGGCAGC GTGTGGCGCA ACGCGCTCAT CGAGAAGGAG GCGTTCGCCT ACCGGAAGCC GCGCCCGGTG GCGCCCGACG TGCACTACAA GACGTTCCGG CTCGAGAACC CCACACGCGC GAAGTTCGTG CGCTTCTACA CCGACGCCCC GATGGGCGAG ACGAAGGCCC GGGTGCAGGC CGCCGAGCTT CAGGTCTTCT CCGGCACGGT CAAGGACGTC GAGCCGCTGC CGCCACCGCC GCCGGACCCG CCCTACGAGG ACGAGGGAAC CATCGCCTTC GGCACCCCCG TGGGGGACGC CACGAGCGGC GGCGTAACCG CGGTGGACTT CCAGACGAAC TGCACCTTCC CGCCGGCTAC CCAGGGCTCG GACGGTTGGG TCACGAGGCT CCCGGAGTCC TTCGGGGACG GGCTGCGCCA GGTCTCGGTG AAGGGCACCT CGCCTGCGCC GCACGACCTG GACCTGTACT TCTACTCCGC CGACTGTGAG CTGACCGGTT CCGCGGCGTC CGCGGCGGCG GACGAGTCGG GGACCATCCC CAGCGGCACC CGCTATGTTC TCACTCACCT GTGGCTGGGG GCGGGCGAAA GCTTCGAGCT GAGGGCGACA GATGCCCGGT AG
|
Protein sequence | MALALVILAF ASGESRALNG GVQEEAGSGA GERTFDARTS RVEPTKEQRE AASRLLRKAG AGTRMAWDRR FGTPHTIYPA RGTLTGPREG TPAEIARSWI RENRALFGLA AGDVDDLNVT QSHGLSTGTR IISFAQTFDG VEAVHGGSMT VVVRKDGSVE SYAGQSARHG ELSGGWELSG AQALERVAGR LAGATAFAAE AVGEQAGYTQ FARGDFAAGS YVRKAVFVTG EGALPAYQVL FIEKLDRAWD VMVDARTGEI LFRDSLVDHS RAEGTVYENF PGDAGSGGQP VIKSFEPTPQ SPSGYVDPTG LAGLDGPTTL GNNANSYANW SNFLVPADQA PRPVSPLSHF NYSFSDHWGR SGCQAVPPSY AQDLEPAATN LFYHHNRIHD EFYRLGFTET GDNFQVNNYG RDSGGGDPIL GLVQAGAATG GAPLYTGRDN AYMLTLPDGI PPWSGMFLWE PINDAFEGPC RDGDFDASVI EHEYAHGLTN RYVSAEDNAL GTHQSGSMGE GWGDWYALNY LHREGLYGKS VVGEYVTGNP ARGIRNWSYG KNPTTYADIG YDIVGPEVHA DGEIWTAILW DYRQALVAAF GQARGAEIAQ RTITDAMPRS PANPSFLDMR DAIALAIDDR YHDSSSYERI FDVFWTEFAR RGAGFHARTA GGDDLDPTPA FDHPNGSRNG TLVGRVVNAA TGEPVADARV MLGRFEARVS PLRTTGSGGG FSAPVVAGTY PVTIQARGFG SRTFENVRVR AGETTSLRFP LSPNLASGAN GAKVVSATSP NARALIDDTE ASTWTSRRRG NAVIELARPA EITSVRVSAY TTSRFEALRD FTLQVSTDGS VWRNALIEKE AFAYRKPRPV APDVHYKTFR LENPTRAKFV RFYTDAPMGE TKARVQAAEL QVFSGTVKDV EPLPPPPPDP PYEDEGTIAF GTPVGDATSG GVTAVDFQTN CTFPPATQGS DGWVTRLPES FGDGLRQVSV KGTSPAPHDL DLYFYSADCE LTGSAASAAA DESGTIPSGT RYVLTHLWLG AGESFELRAT DAR
|
| |