Gene RSP_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2001 
SymboltrpD 
ID3719334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp599260 
End bp600276 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content72% 
IMG OID640070164 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_352052 
Protein GI77462548 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.481302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACC GGCTGAAGCC CCTGATCGGC ACCGCGGCCA CCCGCCCCCT CAGCCGCGAG 
GAGGCCGAGT TCGCCTTCGA GTGCCTGTTC GAGGGCGAGG CCACGCCCGC GCAGATGGGG
GGCCTGCTGA TGGCGCTGCG GACCCGCGGC GAGACGGTGG ACGAATATGC CGCCGCCGCC
TCGGTCATGC GGGCCAAGTG CCACAAGGTG CGCGCCCCGC ACGGCGCCAT CGACATCGTG
GGCACCGGGG GCGACGGCAA GGGCACGCTG AACATCTCGA CCGCCACGGC CTTCGTGGTG
GCGGGGGCGG GCGTGCCGGT CGCCAAGCAC GGCAACCGCA ACCTCTCGTC GAAGTCCGGC
GCCGCCGATG CGCTTACCGA GATGGGCCTC AATGTCATGA TCGGCCCCGA ACAGGTCGAG
GCCTGCCTGC TGGAGGCCGG GATCGGCTTC ATGATGGCAC CGATGCACCA TCCGGCCATG
CGCCATGTCG GGCCGGTGCG GGCCGAGCTC GGGACGCGGA CGATCTTCAA CATCCTCGGG
CCGCTGACCA ATCCGGCGGG GGTGAAGCGC CAGCTGACCG GCGCCTTCTC GCCCGACCTC
ATCCGGCCGA TGGCCGAGGT GCTCTCCGCG CTCGGCTCCG AGAAGGCATG GCTCGTCCAT
GGCGGCGACG GGACGGACGA GCTCGCGATC TCGGCCGCCT CGAAGGTCGC GGCGCTCGAG
GGCGGGCAGA TCCGCGAATT CGAACTGCAT CCCGAGGAGG CGGGTCTGCC CGTCCATCCG
TTCGAGGAGA TCGTGGGCGG CACACCCGCC GAGAATGCGC AGGCCTTCCG CGCGCTGCTC
GACGGCGCGC CGGGCGCCTA CCGCGATGCG GTGCTGCTGA ATGCGGCGGC GGCGCTCGTG
GTGGCCGACC GCGCGGCGCA TCTGCGCGAA GGGGTGGAGA TCGCCACCGA CAGCATCCTG
TCCGGTGCCG CCAAGGCGAA GGTCGCCCTG CTGGCCCGGC TGACGAACGC CGCCTGA
 
Protein sequence
MSDRLKPLIG TAATRPLSRE EAEFAFECLF EGEATPAQMG GLLMALRTRG ETVDEYAAAA 
SVMRAKCHKV RAPHGAIDIV GTGGDGKGTL NISTATAFVV AGAGVPVAKH GNRNLSSKSG
AADALTEMGL NVMIGPEQVE ACLLEAGIGF MMAPMHHPAM RHVGPVRAEL GTRTIFNILG
PLTNPAGVKR QLTGAFSPDL IRPMAEVLSA LGSEKAWLVH GGDGTDELAI SAASKVAALE
GGQIREFELH PEEAGLPVHP FEEIVGGTPA ENAQAFRALL DGAPGAYRDA VLLNAAAALV
VADRAAHLRE GVEIATDSIL SGAAKAKVAL LARLTNAA