Gene Rxyl_3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_3052 
Symbol 
ID4114852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp3059844 
End bp3061223 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content62% 
IMG OID638037820 
Productpermease for cytosine/purines, uracil, thiamine, allantoin 
Protein accessionYP_645772 
Protein GI108805835 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.700557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCAG AGACCCACCA CATAGACGTC ATTCCGGAGG ATGAGCGGCA CGGGAGGGCG 
CGCGATCTGT TTTTCGTGTG GTTTGCGGCG AACTTCAACA TCGGCAACGC AGTCTTCGGG
GCGGTGGCGG TTTTTCTGGG TAACGATCCG TTGTGGGCAA TGCTCGCGGT GATAGTGGGA
AACCTGCTTG GCGGGGTGTT CATGGCGTAC CACTCGGCGC AAGGCCCACA GCTCGGGGTC
CCACAGCTCA TCCAGAGCCG CGGCCAGTTC GGTTACTATG GGGCGCTCAT GCCCGTGGGG
CTGGCGGTGT TGTTGTACGG AGGCTTCTTT GTGCTCACGG CGGTCATAGC AGGGCAGGCG
CTCACGGCGG TATTTCCGGG TCTGAGCCTG GATCTCGCGA TAGTCATCGG GGCTACGCTC
AGCCTCGTGC TCGCGCTCTT CGGCTACAAC GCCATCCACA GGGCAGCGCA GATCGGCACC
TGGCCGCTCG CCATCCCGGT CGTGATGCTC ACCGTCGCCA CGCTGGGAGA GGGCACGCCA
GAACTCACGC CGTCCGGGTT CCAGATCGGA CCCTTCGCCC TCGCCGTGGC GCTCTCGGCG
ACCTTCCAGC TCACCTACGC GCCCTACGTC TCCGACTACT CGCGTTACCT CCCGAGCGAT
ACGAAGGTTT CTGCCACGTT CTGGTGGACC TTCCTCGGCG TCACCACGAG CGTCATATGG
ACCCAGCTCA TCGGGGTTCT CCTCGCTTTC CAGTTTGAGA ACCTTACCAC CTTCGACGCG
GCCAAGAAGC TCCTTGGGAC GAACGTCCTG ACCGCGGTGA TACTGCTTAT CAGCGGGGCC
GCCATCGCGG GCAACAACGC GCTCAATCTT TATGGCGGGA TGCTCAACTT GGTGCCGAGG
GGGTTCAAGA TGCGGGCGCT GCTCATACTG CCCACCTTCG TCGTCGGCAC CGCGCTCGCC
ATCCTCGCCT CCAGAGACTT CATCGCCACC CTCACCAACT TCCTCAGCCT CCTGCAGCTG
ACCTTCGTGC CGTGGGGCGC GATCAATCTC ACCGACTTCT ACCTCGTCAA AAAGGGGCGC
TACGATGTCG GCGCCTTCTT CGAGCCCCGC GGCCCCTACT ACAGGGACGA GGCCTCCTGG
ACTTTCCACG GCATCGCCTG GAAGGCCATA CTGTGTTACT TGGTTGGGAT AATCGTGCAG
GTGCCGTTCC TGAACAACGC TTGGTTCAAG GGCTGGCTGA CAGACCCTCT CGGCGGCGGC
GATTTCTCCT TCATCTTCGG CCTCGTCGTG CCCGCGGTCC TCTACTACGT GCTGATGCGC
CCGCGAAGAA CAAACGTACG GAACGCCACA GAATTAACCG AAGCGGCGGG AGGACCCTAG
 
Protein sequence
MAAETHHIDV IPEDERHGRA RDLFFVWFAA NFNIGNAVFG AVAVFLGNDP LWAMLAVIVG 
NLLGGVFMAY HSAQGPQLGV PQLIQSRGQF GYYGALMPVG LAVLLYGGFF VLTAVIAGQA
LTAVFPGLSL DLAIVIGATL SLVLALFGYN AIHRAAQIGT WPLAIPVVML TVATLGEGTP
ELTPSGFQIG PFALAVALSA TFQLTYAPYV SDYSRYLPSD TKVSATFWWT FLGVTTSVIW
TQLIGVLLAF QFENLTTFDA AKKLLGTNVL TAVILLISGA AIAGNNALNL YGGMLNLVPR
GFKMRALLIL PTFVVGTALA ILASRDFIAT LTNFLSLLQL TFVPWGAINL TDFYLVKKGR
YDVGAFFEPR GPYYRDEASW TFHGIAWKAI LCYLVGIIVQ VPFLNNAWFK GWLTDPLGGG
DFSFIFGLVV PAVLYYVLMR PRRTNVRNAT ELTEAAGGP