Gene Rpal_3235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3235 
SymboltrpD 
ID6410905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3480750 
End bp3481766 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content69% 
IMG OID642713111 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001992212 
Protein GI192291607 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGATT TCAAATCGAT TATCGCAAAG GTCGCGACCG GCGCGACGCT GACGCGCGAC 
GAAGCCACCG ACGCTTTCGA CGCAATGATG TCCGGCGACG CGACGCCGTC GCAGATGGGC
GCACTGCTGA TGGGCCTTCG GGTCCGCGGC GAAACCGTCG ACGAGATCAC CGGCGCGGTG
ACGACGATGC GCGCCAAGAT GCTGCCCGTC ACCGCGCCAC CGGACGCGGT CGACATCGTC
GGCACCGGCG GTGACGGCTC CGGCTCGGTC AACGTTTCGA CTTGCGCGTC GTTCGTGGTC
GCCGGCTGCG GCGTCACCGT CGCCAAGCAC GGCAACCGCG CGCTGTCGTC GAAATCCGGC
GCCGCCGACG TGCTCGCCGC GCTCGGCGTC AAGATCGACA TCACGCCCGA GCAGGTCGGC
CGCTGCGTCA ACGAAGCCGG CATCGGCTTC ATGTTCGCGC CGACGCATCA TCCGGCGATG
AAGAACGTCG GCCCCACCCG GGTCGAACTT GCGACCCGCA CCATCTTCAA TCTGCTCGGA
CCGCTGTCCA ACCCGGCCGG CGTCAAGCGC CAGATGATCG GCGTGTTCTC GCGGCAATGG
GTGCAGCCGC TCGCGCAGGT GCTGAAGAAC CTCGGCTCCG AAGCGGTCTG GGTGGTGCAC
GGTTCCGACG GCCTCGACGA AATCACGCTG TCCGGCACCA CCGCGGTCGC CGAGCTGAAG
AACGGCGAGA TCACCAGCTT CGAGATCAGC CCCGAGGACG CCGGCCTGCC CCGTGCGCCG
GCCGACGCGC TGAAGGGCGG CGACGCCCAG GCCAATGCGG TGGCGCTGCG CGCGGTGCTG
GAAGGCATGC CGGGGCCGTA TCGTGACGTC GCCCTGCTCA ACGCTGCCGC GACGCTGGTC
GTCGCCGGCA AGGCCCGCGA CCTGAAGGAA GGCGTCGCGC TCGGCACCCA GTCGATCGAC
AGCGGCGCCG CCGAAGCGCG GCTGAAGAAG CTGATCGCGG TGTCTGCGGC GGCCTAA
 
Protein sequence
MVDFKSIIAK VATGATLTRD EATDAFDAMM SGDATPSQMG ALLMGLRVRG ETVDEITGAV 
TTMRAKMLPV TAPPDAVDIV GTGGDGSGSV NVSTCASFVV AGCGVTVAKH GNRALSSKSG
AADVLAALGV KIDITPEQVG RCVNEAGIGF MFAPTHHPAM KNVGPTRVEL ATRTIFNLLG
PLSNPAGVKR QMIGVFSRQW VQPLAQVLKN LGSEAVWVVH GSDGLDEITL SGTTAVAELK
NGEITSFEIS PEDAGLPRAP ADALKGGDAQ ANAVALRAVL EGMPGPYRDV ALLNAAATLV
VAGKARDLKE GVALGTQSID SGAAEARLKK LIAVSAAA