Gene Rxyl_3023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_3023 
Symbol 
ID4115959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp3029834 
End bp3031024 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content61% 
IMG OID638037793 
Productpyridoxal-5'-phosphate-dependent enzyme, beta subunit 
Protein accessionYP_645745 
Protein GI108805808 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01747] diaminopropionate ammonia-lyase family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGGA TACGCGTGTT GCTTAACCCG CTTGCCTCTG CGAGCCCGGT AGATGCCGGG 
CCTCCTGGAA GAGAGCCTCT GGAGTTCCAC CGCCGCCTGC CGGGGTACGC CCCAACTCCT
CTCATGGATA CTCCAAGGCT GGCAGACTCT CTTGGTGTCA GACGGGTGTG GGTGAAGGAC
GAGTCGCACC GGCTCGGGCT GCCGGCCTTC AAGATCCTTG GTGCCTCCTG GGCCACCTAC
CGTGCCCTTG AAGAGCATGC AAGAGAGCGT CTTGGCAGCG GATTTGAGCC ATGGCAGAGC
ATTGAAGAGC TCAAGGAGAA GGTTGAGCAT CTCAAGCCCC TCACCCTAGC GGCGGCCACC
GACGGCAACC ACGGGCGGGC GGTTGCCCGC ATGGCGAGGT TGCTTGGGCT ACGCTCGCGT
ATCTTTGTGC CTTCAGAGAT GGTTCGGGCG CGCATCGAGG CCATTGAGTC CGAGGGAGCA
GAAGTTGTTG TGGTCGACGG GACCTACGAC GACGCCGTAG CGCGCTCGGC CGAGGAAGCG
GCCGAGCGGT GTCTGGTGAT CTCCGACACC TCTTGGCCCG GGTACACCAG TGTACCGCGC
TGGGTGATAG AAGGCTACTC GACCATTCTG TGGGAGATAG ACCGGGAGCT TGAGCGCCGC
GAAGAGAAGG GCCCCGACCT TGTGGTAGTC CAGATTGGGG TGGGTGCCTT TGCCGCGGCG
GTGACTCGCC ACTATCGCGC TCCCGGAGCC CCGAGTAGAC CCAAGCTGCT CGGTGTTGAG
CCGGAGAGAG CTGCTTGCAT GCTCTCCTCG GTTGAGGCCG GACACCCGGT TCAACTTCCC
GGACCGCACA CCTCTGTTAT GGCGGGTCTG AGTTGCGGCA CGCCGTCTCT TATCGCTTGG
CCTCTGGTCT CCAGAGGAGT GGACGTTTTT ATTGCCATAG AAGATGAGTG GGCAAAAGAG
GCGATGAGGG AGTTGGCCCG CTCAAACATA GTTTCTGGCG AGACGGGTGC GGCCGGGCTT
GCGGGACTTC TGGCGTTGCT GAAAAGCAAG ACCGGCGCGG AGGCTCGGAA ATTAATCGGG
CTTAACGAGG AGGCCAGTGT TCTGATATTC AACTGCGAGG GAGCAACGGA CCCGGAGTCT
TACGCACGCG TGGTCTACGG AGCAAACAAA GCTGTGTCCG AAAATCCTTG A
 
Protein sequence
MSRIRVLLNP LASASPVDAG PPGREPLEFH RRLPGYAPTP LMDTPRLADS LGVRRVWVKD 
ESHRLGLPAF KILGASWATY RALEEHARER LGSGFEPWQS IEELKEKVEH LKPLTLAAAT
DGNHGRAVAR MARLLGLRSR IFVPSEMVRA RIEAIESEGA EVVVVDGTYD DAVARSAEEA
AERCLVISDT SWPGYTSVPR WVIEGYSTIL WEIDRELERR EEKGPDLVVV QIGVGAFAAA
VTRHYRAPGA PSRPKLLGVE PERAACMLSS VEAGHPVQLP GPHTSVMAGL SCGTPSLIAW
PLVSRGVDVF IAIEDEWAKE AMRELARSNI VSGETGAAGL AGLLALLKSK TGAEARKLIG
LNEEASVLIF NCEGATDPES YARVVYGANK AVSENP