Gene Rxyl_2115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2115 
Symbol 
ID4114711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2143995 
End bp2145524 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content72% 
IMG OID638036901 
Productanthranilate synthase, component I 
Protein accessionYP_644871 
Protein GI108804934 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.332687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGGCA CCGCTAGGCT GGAGCTGGTC CCCTCTTTGG GCGAGGCGCG GAGGCTCGCC 
CGCGCCCACG ACGTGGTCCC CGTGTACGCC GAGTTCATCG GGGACCTGGA GACCCCCATC
TCCGCGGTGT TGCGGTTCGC CGGCGAGGAG CACGTCTTTT TGCTGGAGAG CGCCGAGGCG
GCCGAGCGCT TCGGGCGCTA CTCCTTTCTC GGCTTCGACC CAAAGCGCAC CCTCTCCTAC
CGGCGGGGGA CCTACACCGT GGTGGACGCC GACGGGGTGC GGGAGCTCCC CGCGAAGGAC
CCCTTCCGGG GGCTCGCCGC CATCGTGGGG CGAAAGAGCG TCGCCCCGCT GCCCCACCTT
CCGGCCTTCG TCGGGGGGGC GGTGGGCTAC TTCGCCTACG ACGCTGTGCG CTACCTGGAG
CGGCTGCCGG AGGCCCCCCC GGACGACCTC GGCGTCCCGG AGGCGTACTT CGCCATCACC
GACACGCTGG TGGTCTTCGA CCACCTCAGG CACAAGGTGC TGGTGATCTC GCTGGTCGAC
GCCTCCAGGC TGCGCGACGT GCAGGGCGAG GGGTTCGCCG CGGCCTACCG CCGGGCCGCC
GACGACATCC GGCGGGTTGC CGAGCAGCTC GCGGCCCCGC TCGAGCGCGG GAGGGGCCTC
TCCTCCGGCC CGCCGGGGAG GCTCGAGATC TCCTCCAACT TCACCCGCGG GGCCTACGAG
GCGGCCGTCG AGCGGGCCAA GGAGTACATC CGGGCGGGGG ACGCCTTCCA GATAGTGCCC
TCCCAGCGCT TTGCGGCCGA GGTGGGCGAC CTGGACCCGC TGCTGCTCTA CCGGGGGCTC
AGGACGGTGA ACCCCTCCCC CTATATGACC TACCTGAAGT TCGGTGACCT GGCGCTGGTG
GGGGCCTCCC CGGAGCCGCT GGTGCGGGTC GAGGGGCGGC GGGTGATGAC CCGCCCCATA
GCGGGCACCC GGCGGCGCGG GGAGAGCCCG GAGGAGGACG CGGCGCTGGC CGGGGAGCTG
CTCGCCGACG CCAAGGAGCG GGCGGAGCAC GTGATGCTCG TGGACCTCGG GCGCAACGAT
CTGGGGCGGG TCTGCGAGGT CGGAAGCGTG GAGGTCACGA GCTTTATGGA GATAGAGCGC
TACTCGCACG TGATGCACAT CGTCTCCACG GTGGAGGGAA ACCTGCGGGA GAACCTCACG
GCGCTCGACG CCCTCGCCGC GGCCTTCCCC GCGGGGACCG TCTCGGGGGC CCCGAAGGTG
CGGGCGATGG AGATCATCGA CGAGCTCGAG CCAACCCGCC GCGGGCCCTA CGCGGGGGCC
ACCGGCTACT ACGGGGTGGA CGGGCGGCTG GACACCTGCA TCACCCTGCG CACGGCGCTG
CTGAAGGGCG GCCGCGCCTA CTTCCAGGCC GGCGGCGGGG TGGTCGCCGA CTCGGTCCCG
AAGCTGGAGT ACGAGGAGAC CCGCAACAAG GCGCGGGCGA TGGAGCGGGC GCTGGAGGTG
GCCAGGAGCC CGCGGCTCTG GCTGGGCTGA
 
Protein sequence
MTGTARLELV PSLGEARRLA RAHDVVPVYA EFIGDLETPI SAVLRFAGEE HVFLLESAEA 
AERFGRYSFL GFDPKRTLSY RRGTYTVVDA DGVRELPAKD PFRGLAAIVG RKSVAPLPHL
PAFVGGAVGY FAYDAVRYLE RLPEAPPDDL GVPEAYFAIT DTLVVFDHLR HKVLVISLVD
ASRLRDVQGE GFAAAYRRAA DDIRRVAEQL AAPLERGRGL SSGPPGRLEI SSNFTRGAYE
AAVERAKEYI RAGDAFQIVP SQRFAAEVGD LDPLLLYRGL RTVNPSPYMT YLKFGDLALV
GASPEPLVRV EGRRVMTRPI AGTRRRGESP EEDAALAGEL LADAKERAEH VMLVDLGRND
LGRVCEVGSV EVTSFMEIER YSHVMHIVST VEGNLRENLT ALDALAAAFP AGTVSGAPKV
RAMEIIDELE PTRRGPYAGA TGYYGVDGRL DTCITLRTAL LKGGRAYFQA GGGVVADSVP
KLEYEETRNK ARAMERALEV ARSPRLWLG