Gene RPC_4388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4388 
Symbol 
ID3970442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4892572 
End bp4894734 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content67% 
IMG OID637927497 
Productanthranilate synthase 
Protein accessionYP_534230 
Protein GI90425860 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I
[COG0512] Anthranilate/para-aminobenzoate synthases component II 
TIGRFAM ID[TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase
[TIGR01815] anthranilate synthase, alpha proteobacterial clade 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0144881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGAA CCGTATTCGC GCTGCCCGCG CGCAGCGATT ACGCCACCGC CGGCGGGCTT 
TCGGTGACGC GCAGCGTGCA GCAATTCACC GGCGGCGACG CGCTCGACAA TCTGATCGAC
CTGCTCGACC ACCGCCCCGG CGTGATGCTG TCGTCCGGCA CCACGGTGCC CGGCCGCTAC
GAGAGCTTCG ACCTCGGCTT CGCCGATCCG CCGCTGCGGC TGGTTTCCAC CGGCGACAAC
TTCGCGTTGA CCGCGCTGAA CGCCCGCGGC GAGGTGCTGC TGGCGTTCCT CGCCGCCACC
CTGCAGGAGC CCTGCGTGGT GATCGACAAA GTCGCCGGCC GCCAGATCGA CGGCCATATC
ATCCGCGGCG AAGCGCCGGT CGACGAGGAC CAGCGCACCA GGCGCGCCAG CATGATGTCG
CTGGTGCGGG CGCTGGTGGC GGCGTTCGCC TCGCCGGGCG ATCCGATGCT CGGGCTGTTC
GGCGCCTTCG CCTATGACCT GGTGTTTCAG TTCGAAGATC TGAAGCCAAA GCGCGCCCGC
GAAGCCGACC AGCGCGACAT CGTGCTGTAC GTCCCGGACC GGCTGTTGGC CTATGACCGC
GCCACCGGCC GCGGCGTGCA TCTGGCCTAC GAATTCTCCT GGAACGGCCG CTCCACCGAG
GGGCTGTCGC ACGACACCCC GGACAGCGTC TACGCCAAGA GCCCACGGCA GGGCTTTGCC
GATCACGCCC CCGGCGAATA TCAGGCCACG GTGGAGGTCG CCCGCGCGGC GTTCGCCCGC
GGCGATCTGT TCGAGGCGGT GCCCGGGCAA TTGTTCGCCG AGCCGTGCGA GCGCTCGCCG
GCGGAAGTAT TTCAGCGGCT CTGCCGGATC AATCCGTCGC CCTACGGCGC CTTGATGAAT
CTCGGCGACG GCGAATTTTT GGTGGCGGCG TCGCCGGAAA TGTTCGTGCG CTCGGACGGG
CGCCGCATCG AGACCTGCCC GATTTCCGGC ACCATCGCGC GCGGCGTCGA CGCCATCGGC
GACGCCGAGC AGATCCGGCA ATTGCTGAAT TCGGAGAAAG ACGAATTCGA GCTCAACATG
TGCACCGACG TCGACCGCAA CGACAAAGCG CGGGTCTGCG TGCCGGGCAC AATTAAAGTT
CTCGCGCGCC GCCAGATCGA AACCTATTCA AAACTGTTCC ACACCGTCGA CCACGTCGAG
GGCATGTTGC GGCCGGGCTT TGATGCGCTC GACGCCTTCC TGACCCACGC CTGGGCGGTG
ACGGTGACCG GGGCGCCGAA ATTATGGGCG ATGCAGTTCG TCGAGGACCA CGAGCGCTCG
AGCCGCCGCT GGTACGCCGG CGCGATCGGC TGCGTGAATT TCGACGGCAG CATCAACACC
GGGCTGACCA TCCGCACCAT CCGGATGAAG GACGGCCTCG CCGAAGTCCG CGTCGGCGCC
ACGCTATTGT TCGATTCCGA TCCGGTCGCC GAAGAGAAGG AATGTCAGAC CAAGGCCGCG
GCGCTGTTCC AGGCGCTGCG CGGCGATCCG CCGAAGCGGC TGTCGGCGCT GGCGCCGGAC
GCCTCGGGCT CCGGCAAGAA GGTGCTGCTG ATCGATCACG ACGACAGCTT CGTGCACATG
CTGGCGGATT ACTTCCGCCA GGTCGGCGCT CAGGTCACCG TGGTGCGCTA CATCCACGCG
CTGCCGATGC TGGCGAACAA CGACTACGAT CTGCTGGTGC TGTCGCCCGG CCCCGGCCGG
CCGGAGGACT TCAAGATCAA GGCGACGATC GACGCTGGGC TGCAGAAGAA CATGCCAATC
TTCGGGGTGT GCCTGGGCGT GCAGGCGATG GGCGAGTATT TCGGCGGCCA GCTCGGGCAA
TTGGCGCAGC CGGCGCATGG CCGGCCGTCG AAGATCCAGG TCCGCGGCGG CACGCTGATG
CGCGGCCTGC CGGACGAGAT CGTGATCGGC CGCTATCACT CGCTCTATGT CGAGCAGGAC
AGCATGCCCG AGGTGTTGGC CGTCACCGCC GCCACCGAGG ACGGCATCGC CATGGTGATC
GAGCACAAGA CCTTGCCGGT CGGCGGCGTG CAGTTTCATC CCGAATCGCT GATGTCGCTC
GGTGGCGAGG TCGGCCTGCG GATCGTCGAA AACGCCTTCC GGCTTGGTCT TCCGGCCAAT
TGA
 
Protein sequence
MNRTVFALPA RSDYATAGGL SVTRSVQQFT GGDALDNLID LLDHRPGVML SSGTTVPGRY 
ESFDLGFADP PLRLVSTGDN FALTALNARG EVLLAFLAAT LQEPCVVIDK VAGRQIDGHI
IRGEAPVDED QRTRRASMMS LVRALVAAFA SPGDPMLGLF GAFAYDLVFQ FEDLKPKRAR
EADQRDIVLY VPDRLLAYDR ATGRGVHLAY EFSWNGRSTE GLSHDTPDSV YAKSPRQGFA
DHAPGEYQAT VEVARAAFAR GDLFEAVPGQ LFAEPCERSP AEVFQRLCRI NPSPYGALMN
LGDGEFLVAA SPEMFVRSDG RRIETCPISG TIARGVDAIG DAEQIRQLLN SEKDEFELNM
CTDVDRNDKA RVCVPGTIKV LARRQIETYS KLFHTVDHVE GMLRPGFDAL DAFLTHAWAV
TVTGAPKLWA MQFVEDHERS SRRWYAGAIG CVNFDGSINT GLTIRTIRMK DGLAEVRVGA
TLLFDSDPVA EEKECQTKAA ALFQALRGDP PKRLSALAPD ASGSGKKVLL IDHDDSFVHM
LADYFRQVGA QVTVVRYIHA LPMLANNDYD LLVLSPGPGR PEDFKIKATI DAGLQKNMPI
FGVCLGVQAM GEYFGGQLGQ LAQPAHGRPS KIQVRGGTLM RGLPDEIVIG RYHSLYVEQD
SMPEVLAVTA ATEDGIAMVI EHKTLPVGGV QFHPESLMSL GGEVGLRIVE NAFRLGLPAN