Gene Rxyl_2538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2538 
Symbol 
ID4115598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2557768 
End bp2560899 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content70% 
IMG OID638037309 
Productcoagulation factor 5/8 type-like protein 
Protein accessionYP_645267 
Protein GI108805330 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCTCG CGCTGGTGAT TCTGGCCTTC GCTTCTGGAG AGTCCCGGGC GCTGAACGGC 
GGCGTGCAGG AGGAGGCCGG TTCAGGAGCG GGGGAGAGGA CGTTCGACGC TCGCACCTCC
CGCGTCGAGC CGACAAAAGA GCAACGCGAG GCCGCAAGCC GGCTTCTCCG GAAAGCCGGC
GCCGGGACCC GGATGGCGTG GGACCGCAGG TTCGGCACGC CGCACACCAT CTACCCGGCA
CGCGGCACCC TCACCGGCCC GCGGGAGGGC ACACCGGCAG AGATTGCCCG GTCTTGGATC
CGGGAGAACC GCGCCCTCTT CGGCCTCGCC GCGGGCGACG TGGATGACCT GAACGTTACG
CAGAGCCACG GGCTGAGCAC CGGCACCCGC ATCATCTCCT TCGCCCAGAC CTTCGATGGC
GTCGAGGCGG TGCATGGCGG CTCCATGACC GTCGTGGTGC GCAAGGACGG GTCGGTCGAG
TCCTATGCCG GGCAGTCGGC GCGTCACGGC GAGCTGTCCG GGGGCTGGGA GCTCAGCGGC
GCCCAGGCGC TGGAGAGGGT CGCGGGCAGG CTCGCCGGGG CCACCGCGTT TGCCGCCGAG
GCTGTCGGCG AGCAGGCCGG GTACACGCAG TTCGCCAGGG GAGACTTCGC GGCCGGCTCC
TACGTCAGGA AGGCGGTCTT CGTCACGGGG GAGGGGGCGC TTCCGGCCTA CCAGGTGCTC
TTCATCGAGA AGCTCGACCG GGCCTGGGAC GTGATGGTGG ATGCGCGGAC CGGCGAGATC
CTGTTCCGCG ACAGCCTCGT GGACCACAGC AGGGCCGAGG GGACCGTCTA CGAGAACTTC
CCGGGAGACG CCGGGAGCGG CGGCCAGCCC GTCATCAAGA GCTTCGAGCC CACGCCGCAG
TCCCCGTCGG GATACGTGGA CCCGACCGGC CTCGCCGGGC TCGACGGCCC CACGACGCTC
GGCAACAACG CCAACAGCTA CGCCAACTGG TCGAACTTTT TGGTCCCCGC TGATCAGGCG
CCGCGCCCCG TGAGCCCGCT CTCGCACTTC AACTACTCCT TCAGCGATCA CTGGGGCCGC
TCGGGCTGCC AGGCCGTACC GCCTTCTTAC GCCCAGGACC TAGAGCCGGC GGCGACCAAC
CTCTTCTACC ACCACAACCG CATCCACGAC GAGTTCTATC GCCTCGGGTT CACCGAGACG
GGCGACAACT TCCAGGTCAA CAACTACGGC AGGGATTCCG GTGGGGGGGA CCCGATCCTG
GGGCTGGTGC AGGCGGGAGC TGCCACCGGC GGCGCGCCGC TCTATACCGG GCGCGACAAC
GCCTACATGC TCACGTTGCC CGACGGCATA CCGCCGTGGA GCGGGATGTT CCTGTGGGAG
CCGATCAACG ACGCCTTCGA GGGGCCCTGC CGCGACGGCG ACTTCGACGC GTCGGTGATC
GAGCACGAGT ACGCCCACGG CCTCACCAAC CGCTACGTCT CCGCGGAGGA CAACGCCCTG
GGCACCCACC AGTCCGGCTC GATGGGCGAG GGCTGGGGCG ACTGGTACGC CCTGAACTAC
CTCCACCGCG AGGGGCTCTA CGGCAAGTCC GTAGTCGGCG AGTACGTCAC CGGCAACCCC
GCACGGGGCA TCCGCAACTG GAGTTACGGC AAGAACCCGA CCACCTACGC GGACATCGGC
TACGACATCG TCGGTCCCGA GGTGCACGCG GACGGTGAGA TCTGGACCGC AATACTTTGG
GACTACCGGC AGGCCCTCGT CGCGGCGTTC GGGCAGGCTA GGGGCGCGGA GATCGCCCAG
AGGACCATCA CCGACGCTAT GCCGCGCTCT CCGGCAAACC CCTCGTTCCT CGACATGCGC
GACGCCATCG CGCTGGCCAT CGACGACCGC TACCACGACT CGTCGAGCTA CGAGAGGATC
TTCGACGTCT TCTGGACGGA GTTCGCCCGG CGCGGCGCGG GCTTCCACGC CCGGACCGCG
GGCGGCGACG ACCTGGACCC GACGCCGGCC TTCGACCATC CGAACGGCTC GCGAAACGGG
ACGCTCGTCG GCAGGGTCGT CAACGCGGCT ACCGGCGAGC CCGTGGCGGA CGCCCGCGTG
ATGCTCGGGC GGTTCGAGGC ACGCGTCTCG CCGCTGCGCA CCACCGGCTC CGGGGGCGGC
TTCTCCGCCC CGGTCGTTGC GGGGACCTAC CCGGTCACGA TCCAGGCCCG CGGCTTCGGA
TCGCGCACCT TCGAGAACGT GAGGGTGAGG GCGGGCGAGA CGACCTCGCT GCGCTTCCCG
CTCAGCCCGA ACCTGGCCTC CGGGGCGAAC GGCGCGAAGG TCGTCTCCGC GACGAGCCCC
AACGCTCGCG CGCTCATAGA CGATACCGAG GCCAGCACCT GGACGAGCCG GAGGCGGGGC
AACGCCGTAA TCGAGCTGGC CAGGCCGGCG GAGATCACAT CCGTGCGGGT CAGCGCATAC
ACGACCTCCC GCTTCGAGGC CCTGCGCGAC TTCACGCTCC AGGTCTCGAC CGACGGCAGC
GTGTGGCGCA ACGCGCTCAT CGAGAAGGAG GCGTTCGCCT ACCGGAAGCC GCGCCCGGTG
GCGCCCGACG TGCACTACAA GACGTTCCGG CTCGAGAACC CCACACGCGC GAAGTTCGTG
CGCTTCTACA CCGACGCCCC GATGGGCGAG ACGAAGGCCC GGGTGCAGGC CGCCGAGCTT
CAGGTCTTCT CCGGCACGGT CAAGGACGTC GAGCCGCTGC CGCCACCGCC GCCGGACCCG
CCCTACGAGG ACGAGGGAAC CATCGCCTTC GGCACCCCCG TGGGGGACGC CACGAGCGGC
GGCGTAACCG CGGTGGACTT CCAGACGAAC TGCACCTTCC CGCCGGCTAC CCAGGGCTCG
GACGGTTGGG TCACGAGGCT CCCGGAGTCC TTCGGGGACG GGCTGCGCCA GGTCTCGGTG
AAGGGCACCT CGCCTGCGCC GCACGACCTG GACCTGTACT TCTACTCCGC CGACTGTGAG
CTGACCGGTT CCGCGGCGTC CGCGGCGGCG GACGAGTCGG GGACCATCCC CAGCGGCACC
CGCTATGTTC TCACTCACCT GTGGCTGGGG GCGGGCGAAA GCTTCGAGCT GAGGGCGACA
GATGCCCGGT AG
 
Protein sequence
MALALVILAF ASGESRALNG GVQEEAGSGA GERTFDARTS RVEPTKEQRE AASRLLRKAG 
AGTRMAWDRR FGTPHTIYPA RGTLTGPREG TPAEIARSWI RENRALFGLA AGDVDDLNVT
QSHGLSTGTR IISFAQTFDG VEAVHGGSMT VVVRKDGSVE SYAGQSARHG ELSGGWELSG
AQALERVAGR LAGATAFAAE AVGEQAGYTQ FARGDFAAGS YVRKAVFVTG EGALPAYQVL
FIEKLDRAWD VMVDARTGEI LFRDSLVDHS RAEGTVYENF PGDAGSGGQP VIKSFEPTPQ
SPSGYVDPTG LAGLDGPTTL GNNANSYANW SNFLVPADQA PRPVSPLSHF NYSFSDHWGR
SGCQAVPPSY AQDLEPAATN LFYHHNRIHD EFYRLGFTET GDNFQVNNYG RDSGGGDPIL
GLVQAGAATG GAPLYTGRDN AYMLTLPDGI PPWSGMFLWE PINDAFEGPC RDGDFDASVI
EHEYAHGLTN RYVSAEDNAL GTHQSGSMGE GWGDWYALNY LHREGLYGKS VVGEYVTGNP
ARGIRNWSYG KNPTTYADIG YDIVGPEVHA DGEIWTAILW DYRQALVAAF GQARGAEIAQ
RTITDAMPRS PANPSFLDMR DAIALAIDDR YHDSSSYERI FDVFWTEFAR RGAGFHARTA
GGDDLDPTPA FDHPNGSRNG TLVGRVVNAA TGEPVADARV MLGRFEARVS PLRTTGSGGG
FSAPVVAGTY PVTIQARGFG SRTFENVRVR AGETTSLRFP LSPNLASGAN GAKVVSATSP
NARALIDDTE ASTWTSRRRG NAVIELARPA EITSVRVSAY TTSRFEALRD FTLQVSTDGS
VWRNALIEKE AFAYRKPRPV APDVHYKTFR LENPTRAKFV RFYTDAPMGE TKARVQAAEL
QVFSGTVKDV EPLPPPPPDP PYEDEGTIAF GTPVGDATSG GVTAVDFQTN CTFPPATQGS
DGWVTRLPES FGDGLRQVSV KGTSPAPHDL DLYFYSADCE LTGSAASAAA DESGTIPSGT
RYVLTHLWLG AGESFELRAT DAR