Gene RPD_4213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4213 
Symbol 
ID4024734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4678953 
End bp4681115 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content67% 
IMG OID637964419 
Productanthranilate synthase 
Protein accessionYP_571331 
Protein GI91978672 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I
[COG0512] Anthranilate/para-aminobenzoate synthases component II 
TIGRFAM ID[TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase
[TIGR01815] anthranilate synthase, alpha proteobacterial clade 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.848003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGGA CCGTTTTCTC GCTACCCAAG CACAGCGACC ACGCCACCGC CGCGGGCCTT 
GGCGTGTCGC GCACGGTGGA GGCCTTCACC GGCGGCCAGG CGCTCGATGA CCTGATCGAC
CTGCTGGATC GTCGCCGCGG CGTGATGCTG TCGTCCGGCA CCACGGTGCC GGGGCGCTAC
GAGAGCTTCG ATTTCGGCTT CGCCGATCCG CCGCTGATGC TGACTACCCG AGCCGATCAG
TTTTCGATCG AGGCGCTGAA CGCACGCGGC CAGGTGCTGA TCGCGTTCCT GTCCGACCGG
CTCGAAGAGC CCTGCGTCGT GGTCGAGCAG GCCTGCGCGA CCAAGATCAG GGGCCACATC
GTCCGCGGCG AGGCGCCGGT GGACGAGGAC CAGCGCACCC GCCGCGCCAG CGCGATGTCG
CTGGTGCGCG CGCTGGTCGC GGCGTTCGCC TCGCCGGCCG ATCCGATGCT CGGGCTGTAC
GGCGCCTTCG CCTATGATCT GGTGTTTCAG ATCGAGGATC TGAAGCAGAA ACGTGCGCGC
GAGGCCGACC AGCGCGACAT CGTGCTGTAT GTGCCGGATC GCCTGCTGGC CTACGACCGC
GCCACCGGCC GCGGCGTCAA TATCGCCTAT GAATTCGCCT GGAAAGGCAA GTCCAGCTCG
GGCCTGTCGA CCGCGACCGC GGAGAGCGTC TACACCCAGA CCGGCCGGCA GGGTTTCGCC
GACCATGCGC CGGGCGAATA CGCCAAGGTG GTCGAGACGG CGCGCGAATC CTTCGCCCGC
GGCGACCTGT TCGAGGCGGT GCCCGGGCAG TTGTTCGGCG AGCCGTGCGA GCGCTCGCCG
GCCGAAGTGT TCAAGCGGCT GTGCCGGATC AACCCGTCTC CTTACGGTGG CCTGCTCAAT
CTCGGCGACG GCGAATTCCT GGTGTCGGCC TCGCCGGAAA TGTTCGTCCG CTCCGACGGC
CGCCGGGTCG AGACCTGCCC GATCTCCGGC ACCATCGCCC GCGGTGTCGA CGCCATCGCG
GACGCCGAGC AGATCCAGAA GCTCCTGAAC TCCGAGAAGG ACGAGTTCGA GCTGAACATG
TGCACCGATG TCGATCGCAA CGACAAGGCG CGGGTCTGCG TGCCGGGCAC GATCAAGGTT
CTGGCGCGAC GCCAGATCGA GACCTACTCG AAACTGTTCC ACACCGTCGA TCACGTCGAG
GGCATGCTGC GGCCCGGCTT CGACTCGCTC GACGCCTTCC TGACCCACGC CTGGGCGGTG
ACGGTGACCG GCGCGCCGAA ATTATGGGCG ATGCAGTTCG TCGAGGATCA CGAGCGGACG
CCGCGGCGCT GGTACGCCGG CGCATTCGGG GTCGTCGGCT TCGACGGTTC GATCAACACC
GGCCTGACCA TCCGCACCAT CCGGATGAAG GACGGCCTCG CCGAGGTCCG TGTCGGCGCG
ACCTGTCTGT TCGATTCCGA TCCGGTCGCC GAGGACAAGG AATGCCAGGT CAAGGCCGCG
GCGCTGTTCC AGGCGCTGCG CGGCGATCCG CCGAAGCCGC TGTCGGCGGT GGCGCCGGAC
GCCACCGGCT CCGGCAAGAA GGTGCTGCTG ATCGATCACG ACGACAGCTT CGTCCACATG
CTGGCGGATT ACTTCCGCCA GGTCGGCGCC CAGGTCAGCG TCGTGCGCCA CATCCATGCG
CAGAAGATGC TGGCCGACAA TGCCTATGAT CTGCTGGTGC TGTCGCCTGG CCCCGGCCGG
CCGGAGGACT TCAAGATCAA GTCCACGATC GACGCCGCGC TGGCGCGGAA GCTGCCGATC
TTCGGCGTCT GCCTCGGCGT TCAGGCGATC GGTGAGTACT TTGGCGGCCA TCTCGGCCAG
CTCGCGCAGC CGGCGCATGG CCGGCCGTCG CGGATTCAGG TGCGCGGCGG CACGCTGATG
AACGGCCTCC CGAACGAGAT CACGATCGGC CGCTATCACT CGCTGTTCGT CGAGATGCAG
GACATGCCCG ACACGCTCAA GGTCACCGCC TCGACCGAGG ACGGCATCGC GATGGCGATC
GAGCACAAAT CGCTTCCGGT GGGCGGCGTG CAGTTCCACC CGGAGTCGCT GATGTCGCTG
GGCGGCCAGG TGGGCCTGCG GATCGTCGAA AACGCCTTCC GCCTCGGCCT GACGCCGAAC
TGA
 
Protein sequence
MNRTVFSLPK HSDHATAAGL GVSRTVEAFT GGQALDDLID LLDRRRGVML SSGTTVPGRY 
ESFDFGFADP PLMLTTRADQ FSIEALNARG QVLIAFLSDR LEEPCVVVEQ ACATKIRGHI
VRGEAPVDED QRTRRASAMS LVRALVAAFA SPADPMLGLY GAFAYDLVFQ IEDLKQKRAR
EADQRDIVLY VPDRLLAYDR ATGRGVNIAY EFAWKGKSSS GLSTATAESV YTQTGRQGFA
DHAPGEYAKV VETARESFAR GDLFEAVPGQ LFGEPCERSP AEVFKRLCRI NPSPYGGLLN
LGDGEFLVSA SPEMFVRSDG RRVETCPISG TIARGVDAIA DAEQIQKLLN SEKDEFELNM
CTDVDRNDKA RVCVPGTIKV LARRQIETYS KLFHTVDHVE GMLRPGFDSL DAFLTHAWAV
TVTGAPKLWA MQFVEDHERT PRRWYAGAFG VVGFDGSINT GLTIRTIRMK DGLAEVRVGA
TCLFDSDPVA EDKECQVKAA ALFQALRGDP PKPLSAVAPD ATGSGKKVLL IDHDDSFVHM
LADYFRQVGA QVSVVRHIHA QKMLADNAYD LLVLSPGPGR PEDFKIKSTI DAALARKLPI
FGVCLGVQAI GEYFGGHLGQ LAQPAHGRPS RIQVRGGTLM NGLPNEITIG RYHSLFVEMQ
DMPDTLKVTA STEDGIAMAI EHKSLPVGGV QFHPESLMSL GGQVGLRIVE NAFRLGLTPN