Gene Rcas_2153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2153 
SymboltrpD 
ID5539633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2764760 
End bp2765782 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content67% 
IMG OID640894286 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001432255 
Protein GI156742126 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.286108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00266226 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAATCC GTGATCAGAT TATCCAGATC GTTCGGGGTC ATGATCTCAC CGAAGAGCAG 
GCTGCCGAAG CGATGGAAGA AATTATGACC GGCGTGGCGA CCCCGGCGCA GGTCGCGGCG
CTGCTCACAG CGCTCCACCT GAAGGGCGAA ACCGACGCCG AGATCGCCGG CATGGCGCGG
GTTATGCGCG CCAAAGCCAT CCCCGTCCAC TTCGACGGTC CGCTGCTCGA CACATGCGGC
ACCGGCGGCG ACAGCGCCGG CACGTTCAAC ATTTCGACGA CCGCCGCGTT CATCGCGGCA
GGCGCCGGCG CAACGGTCGC CAAGCACGGC AACCGTGCCA TGTCGAGTGT CTGCGGCTCT
GCCGACGTGC TCGAAGGGCT GGGGGTCACC ATCGATCTCG ACGCCGCTGG CGTGGCGCGC
TGTCTCGAAC AGGCGGGCAT TGGGTTCATG TTCGCACAGA AGTTCCATCC GGCGATGCGC
TTTGTCGGAC CGGTGCGCCG TGAGATCGGC ATCCGCACCA TCTTCAACGC CCTCGGTCCG
TTGAGCAACC CGGCGCAGGC ACGCCACCAG ACGCTTGGTG TCGCCGATCC GGCGCTGGCG
GAGAAGATGG CGCGCGCGCT TTACCTTCTC GGCGCGCAGC ATGCTCTGGT CGTTCATGGG
CACGGCGGGC TGGATGAACT GACCCTGAGC GGACCGAACC TCGTCATCGA AGTGCGTGCC
GGTCACAAGC CGCGACGGTA TGAGGTCAGC GCCGGCGACC TGGGGCTGAC GCCTGCCCCG
CGCGAGGCGC TGCTCGGCGG CGATGTATCG ACGAACGTGG CGATTGTTCG CGCCATTCTC
AGCGGAGAAG AACGGGGAGC ACGGCGCGAC GTGGCGTTGC TGAACGCCGC CGCCGCTCTT
GTTGCCGCCG ACTACGCCGC CGACCTGCGC GAGGGGTTGC AGCAGGCGCG GCAGAGCCTT
GAGAGTGGCG CCGCCCTGGC GCGCCTGGAG CGGCTTATCA CGGTCAGTAG CATCAACCGT
TGA
 
Protein sequence
MPIRDQIIQI VRGHDLTEEQ AAEAMEEIMT GVATPAQVAA LLTALHLKGE TDAEIAGMAR 
VMRAKAIPVH FDGPLLDTCG TGGDSAGTFN ISTTAAFIAA GAGATVAKHG NRAMSSVCGS
ADVLEGLGVT IDLDAAGVAR CLEQAGIGFM FAQKFHPAMR FVGPVRREIG IRTIFNALGP
LSNPAQARHQ TLGVADPALA EKMARALYLL GAQHALVVHG HGGLDELTLS GPNLVIEVRA
GHKPRRYEVS AGDLGLTPAP REALLGGDVS TNVAIVRAIL SGEERGARRD VALLNAAAAL
VAADYAADLR EGLQQARQSL ESGAALARLE RLITVSSINR