Gene Rxyl_3032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_3032 
Symbol 
ID4115967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp3040211 
End bp3041257 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content50% 
IMG OID638037801 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_645753 
Protein GI108805816 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000338526 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAATA GATTACATAG TAGTACAGTT ACCAGGAGGG ACTTCCTCAA GGTTTTTTGC 
GGTGCTAGCG TAGGGTTTAT TGGGGCGGGA GGTCTGTTAA CCTCGTGCGG CTCTAGCTCG
AGCGGGAGCA AAAGAGCGAC CTCTGTAACT CACCAGCTTG GGTGGCTCAA GATTTCACAG
TTCTCTGGCT TCTTTGCAGG TCTCGAGAAG GGCTACTACA AAGATGAAGG CATAGCGGCA
AAGTTCAATG CCGGCGGACC TAACATTATC GCCTCCCAGG TCGTAGCGTC AGAGAGAGCA
TTGGTCGGTG ACGACGACAA TACCACTGTG CTCCAGGCTA TAGACAAGGG CCAACCCATT
GTAGTTTATG GCACTATTTT TCAAAAATCT CCATATGCGA TAATGAGTTA CAAAGACAAT
CCAATTAGAA CGTTACAGGA CTTCGCTGGA AAAACTATAG CCTTGAGCGA AGCGACACGT
CCGCAGCTCA CCCCTTTGCT TGAAAAGGCA GGAGTTGATC TCAAAGAGGT TAAATACGTT
CCAGCAGGCC CTGATCCCTC GCAGCTTGCC AGCAGGCAAG TGGATGGGTA CTTCGGATAC
GCTACCTCAG AAGGTGTAGC ACTTAAACAG CAGGGACTAG ATATAATTGT TACTTATTTC
AACGACCTTG GCTTTCCTAG CTATGCCAAC GTGCTCATAA CGCAGCCTTC TGCTGTTAAG
GACAATCAGG ATACGCTTGT CCGTTTTCTG CGAGCAAGCA TAAGGGGCTG GGAGTATTCG
CTGGCGCACC CGGAGGAGAT GGGCGAACTA GTAGCCAAGA AGTATGGACC CGAGGGACTA
GACGTTGAGA CAGAAATAGC AGTCCACAAA GCTCAGGCAC CGCTTATTAG AAGTCCTAAT
GGTCCTCTTT GGATTGATCG CGACAAAATG GAAGCCGTAA TCAAGGCTGC GGCTAATGCA
GGGTCTATAT CTAAGGTACT GCCGGTTGAC GAGGTTATGA CCACGGAGAT TTGGCAGAAG
GCTTCTGGCG GTTCTGGTGG GGAATAA
 
Protein sequence
MSNRLHSSTV TRRDFLKVFC GASVGFIGAG GLLTSCGSSS SGSKRATSVT HQLGWLKISQ 
FSGFFAGLEK GYYKDEGIAA KFNAGGPNII ASQVVASERA LVGDDDNTTV LQAIDKGQPI
VVYGTIFQKS PYAIMSYKDN PIRTLQDFAG KTIALSEATR PQLTPLLEKA GVDLKEVKYV
PAGPDPSQLA SRQVDGYFGY ATSEGVALKQ QGLDIIVTYF NDLGFPSYAN VLITQPSAVK
DNQDTLVRFL RASIRGWEYS LAHPEEMGEL VAKKYGPEGL DVETEIAVHK AQAPLIRSPN
GPLWIDRDKM EAVIKAAANA GSISKVLPVD EVMTTEIWQK ASGGSGGE