Gene RPB_2797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2797 
SymboltrpD 
ID3910590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3188470 
End bp3189495 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content68% 
IMG OID637884697 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_486410 
Protein GI86749914 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.573917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGACCA TGATCGATTT CAAGGCAGTT ATCGCGAAGG TCGCGACGGG GGCGTCGCTG 
ACCAGGGACG AAGCGGCGAA CGCGTTCGAC GCGATGATGT CGGGCGACGC GACGCCGTCG
CAGATGGGTG CATTGCTGAT GGGCCTGCGA GTCCGCGGCG AGACCGTCGA CGAAATCACC
GGCGCGGTGA CGACGATGCG CGCCAAGATG CTGACGGTCG CAGCGCCGCC CGACGCGGTC
GACGTCGTCG GCACCGGCGG CGACGGCTCC GGCTCGGTCA ACGTCTCGAC CTGCACTTCG
TTTGTCGTGG CCGGCTGCGG CGTTCCGGTC GCCAAGCACG GCAACCGCGC GCTGTCGTCG
AAATCCGGCG CCGCCGACGT GCTCAATGCC CTCGGCGTCA AGATCGACAT CACCCCGGAC
CACGTCGGCC GCTGCGTGGC GGAGGCCGGC ATCGGCTTCA TGTTCGCGCC GACGCATCAT
CCGGCGATGA AGAACGTGGG TCCTACCCGC GTCGAGCTCG CTACCCGCAC GATCTTCAAT
CTGCTCGGGC CGCTGTCGAA TCCCGCCGGC GTCAAGCGCC AGATGATCGG CGTGTTCTCG
CGGCAATGGG TGCAGCCGCT GGCGCAGGTG CTGCAGAATC TCGGCTCAGA ATCGATCTGG
GTGGTGCACG GCTCCGACGG GCTCGACGAG ATCACCCTGT CCGGCCCGAC CGCCGTCGCC
GAATTGAAGA ACGGCGAGAT CAGGACCTTC GAGATCGGCC CCGAGGACGC CGGCCTGCCC
CGCGCGCCGG CCGACGCGCT GAAGGGCGGC GATGCCGAGG CCAATGCGGT GGCGCTGCGC
GCCGTGCTGG AAGGCATGCC GGGGCCGTAT CGCGACGTCG CGCTGCTCAA CGCCGCGGCG
ACGCTGATCG TCGCCGGCAA GGCGAAGGAT CTCAAGGAAG GCGTCGCGCT CGGCGCCCAA
TCGATCGACA GCGGCGCCGC CGAAGCACGT TTGAAAAAGC TGATCGCGGT ATCGGCGGCC
GCCTAA
 
Protein sequence
MGTMIDFKAV IAKVATGASL TRDEAANAFD AMMSGDATPS QMGALLMGLR VRGETVDEIT 
GAVTTMRAKM LTVAAPPDAV DVVGTGGDGS GSVNVSTCTS FVVAGCGVPV AKHGNRALSS
KSGAADVLNA LGVKIDITPD HVGRCVAEAG IGFMFAPTHH PAMKNVGPTR VELATRTIFN
LLGPLSNPAG VKRQMIGVFS RQWVQPLAQV LQNLGSESIW VVHGSDGLDE ITLSGPTAVA
ELKNGEIRTF EIGPEDAGLP RAPADALKGG DAEANAVALR AVLEGMPGPY RDVALLNAAA
TLIVAGKAKD LKEGVALGAQ SIDSGAAEAR LKKLIAVSAA A