Gene Rxyl_3090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_3090 
Symbol 
ID4114889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp3099621 
End bp3101579 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content70% 
IMG OID638037857 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_645809 
Protein GI108805872 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACGGA TACGCATCCC GGGGCCGGTG AAGGAGCGGG CCACCCGCCC GTTCCTCTGG 
CTGCAGCGCA GCTTCCGGCA GTCGCCGCCC GCCTTCCGGC GCGGGGCGGC GATGCTGGTG
GACGCCGCCA TCGTCATGGA GTCCTTTGCG GTGGCGCTGC TGTTCCGGTT CGAGGGGGAC
GTGCAGTGGG AGTTCTGGGT CTCCTTCTGG CCGTTCGCGC TGGCCGGGGC GGCGCTCTTC
GTGCTGCTCC TGCACATAAA CGGCGTCTAC AAGAGCATCC TCCGCTACAC CGGCATCTAC
CAGGGGGTGC GCATCGCGAG CGCCACCTCC ATCGCCACCG GGCTTCTGTT CATCGCCGAC
GTGACCTTCG ACGAGGTGCT CGGCTACTAC CCCGCCCCGC GCTCGGTGGT CCTGGTGGGG
GCGGCGCTGG CGTACATGCA GCTCGTTGCG GTGAGGCTCT ACCCGCGGGT CTTCTACGAG
CTCTCGCTGC GGGAGGTGGG CCGCCGGAAG CGGACCGCCA TCGTGGGGAC GGGAGAGCAG
GGGGTCGCGC TCGCGGGCCA CATCTGGCGC ACGGCGGCGA TGAACACGCA GGTGGTGGGC
TTCGTGAGCG ACAGCCCCTC GGAGGTCGGC AACCACATCG AGGGGGTCCC GGTCGTCGGG
ACCATAGACG GGATCGAGGA GATCATCGCG GGCCACGGCC TGGACCAGGT GATCATCGCG
ACCCCGCAGG CCAGCCGGGA GCAGGTGGAC CGCATCTGGC GCACCTGCGT CCGCTCCCGG
GTCGAGGTGA AGGTGATGCC GGACCTGGGG GAGCTGCTCG CCGAGGGGAC CATCCGCCTG
AGGGAGCTGC AGATAGAGGA CCTCCTGGGC CGCGAGCCGG TGGACATAGA CCTCGAGGCC
CTCTCCGGCT ACATAAACGG CAAGCGGGTA CTCGTCACCG GGGCGGGGGG CTCCATCGGC
AGCGAGCTCT CGCGCCAGAT CTCTCGCCTC GGGCCCGCCA GGCTCGTGCT GATGGACCGC
GACGAGAGCG GGCTCTACTA CCTGGGCGGG GAGCTGCGCC GGGAGGAGTT CAACGCCGCC
GAGCTCGTCG TCGGGGACGT GACCAACCCC GAGCGGGTGG GCTACGTCTT CGAGCGGTTC
CGGCCGCAGC TGGTCTTCCA CGCCGCCGCC TACAAGCACG TCCCCATGAT GGAGCTGCAG
GCCACCGAGG CGATCATCAA CAACGTCTAC GGCACCCTCA ACGTGGCCCG GGCGGCCGGG
GCCTACGGGG CGGAGAAGTT CGTCAACGTC TCCACCGACA AGGCCGTCAA CCCCGCCAAC
GTGATGGGGG CCACCAAGCG GCTCTCGGAG ATGATCGTGC GGGAGATGGC CGGGGTGTAC
CCGGAGACGG TCTACGCCTC GGTGCGCTTC GGCAACGTGC TCGGGAGCCG CGGCTCGGTG
GTGCCGACCT TCCGCCAGCA GATAGAGGCC GGGGGGCCGG TGACGGTGAC GCACCCGGAG
ATGATCCGGT ACTTCATGAC CATCCCGGAG GCGGTCTCCC TGATCCTGCA GGCCGGGGCG
ATGGCCGAAG GATACGCGAC CTACGTGCTG GAGATGGGCC GCCCGGTGCG CATCCTGGAC
CTGGCGCGCA ACATGATCGA GATAATGGGC GCCCCGGACG TCCAGATAAA GTTCGTCGGC
CTGAGGCCCG GCGAGAAGCT GCGGGAGGAG CTCTCGGAGG AGGGCGAGCA GCGGCTGCCC
ACCGGCCACC CGATGGTCTA CCGGCTGGTC TCGGAGAACG AGGGCCCGCC CGGCGGGGGC
GACCTGCAGG AGCTGGTGGA CGCGATGGTC TACGAGGCCA GAAACCAGGA GGCCGAGCGG
GCGGTGCGCC TCCTGCAGCG GGCGGTGCCG AACTACTCGG CCGCAGACCT CCCGGAGGTG
GCCCCCCGGG AGGAGGACGG GCCCTTCTTC TCCTCCTAG
 
Protein sequence
MRRIRIPGPV KERATRPFLW LQRSFRQSPP AFRRGAAMLV DAAIVMESFA VALLFRFEGD 
VQWEFWVSFW PFALAGAALF VLLLHINGVY KSILRYTGIY QGVRIASATS IATGLLFIAD
VTFDEVLGYY PAPRSVVLVG AALAYMQLVA VRLYPRVFYE LSLREVGRRK RTAIVGTGEQ
GVALAGHIWR TAAMNTQVVG FVSDSPSEVG NHIEGVPVVG TIDGIEEIIA GHGLDQVIIA
TPQASREQVD RIWRTCVRSR VEVKVMPDLG ELLAEGTIRL RELQIEDLLG REPVDIDLEA
LSGYINGKRV LVTGAGGSIG SELSRQISRL GPARLVLMDR DESGLYYLGG ELRREEFNAA
ELVVGDVTNP ERVGYVFERF RPQLVFHAAA YKHVPMMELQ ATEAIINNVY GTLNVARAAG
AYGAEKFVNV STDKAVNPAN VMGATKRLSE MIVREMAGVY PETVYASVRF GNVLGSRGSV
VPTFRQQIEA GGPVTVTHPE MIRYFMTIPE AVSLILQAGA MAEGYATYVL EMGRPVRILD
LARNMIEIMG APDVQIKFVG LRPGEKLREE LSEEGEQRLP TGHPMVYRLV SENEGPPGGG
DLQELVDAMV YEARNQEAER AVRLLQRAVP NYSAADLPEV APREEDGPFF SS