Gene Rpal_4991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4991 
Symbol 
ID6412683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5372632 
End bp5374791 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content67% 
IMG OID642714874 
Productanthranilate synthase 
Protein accessionYP_001993955 
Protein GI192293350 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I
[COG0512] Anthranilate/para-aminobenzoate synthases component II 
TIGRFAM ID[TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase
[TIGR01815] anthranilate synthase, alpha proteobacterial clade 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGGA CCGTTTTCTC GCTTCCCGCG ACCAGCGACT ATAAGACCGC CGCGGGCCTC 
GCGGTGACGC GCAGCGCCCA GCCTTTTGCC GGCGGCCAGG CGCTCGACGA GCTGATCGAT
CTGCTCGACC ACCGCCGCGG CGTGATGCTG TCGTCCGGCA CAACCGTGCC GGGCCGCTAC
GAGAGCTTCG ACCTCGGCTT TGCCGATCCG CCGCTGGCGC TCACCACTAG GGCCGAAAAA
TTCACCATCG AGGCGCTCAA TCCGCGCGGC CGGGTGCTGA TCGCGTTCCT GTCCGACAAG
CTTGAAGAGC CCTGCGTCGT CGTCGAGCAG GCCTGCGCCA CCAAGATCAG GGGCCACATC
GTCCGCGGCG AGGCCCCGGT CGACGAAGAA CAACGCACCC GCCGCGCCAG CGCGATCTCT
CTGGTGCGCG CGGTGATTGC TGCCTTCGCC TCGCCGGCCG ATCCGATGCT CGGACTGTAC
GGCGCCTTCG CCTACGACCT GGTGTTCCAG TTCGAGGATC TGAAGCAGAA GCGTGCCCGC
GAAGCCGACC AGCGCGACAT CGTGCTGTAC GTGCCGGATC GCCTGCTGGC CTATGACCGC
GCCACCGGCC GCGGCGTCGA CATTTCCTAC GAATTCGCCT GGAAGGGCCA TTCCACCGCC
GGCCTGCCGA ACGAAACCGC CGAGAGCGTC TACACCCAGA CCGGCCGGCA GGGTTTCGCC
GACCACGCCC CGGGCGACTA TCCCAAGGTG GTCGAGAAGG CCCGCGCGGC GTTCGCCCGT
GGCGACCTGT TCGAGGCGGT GCCGGGCCAG TTGTTCGGTG AGCCGTGCGA GCGGTCGCCG
GCCGAAGTGT TCAAGCGGTT GTGCCGGATC AACCCGTCGC CCTATGGCGG CCTGCTCAAT
CTCGGCGCCG GCGAATTCCT GGTGTCGGCC TCGCCGGAAA TGTTCGTCCG CTCGGACGGC
CGCCGGATCG AGACCTGCCC GATCTCCGGC ACCATCGCCC GCGGCGTCGA TGCGATCAGC
GATGCCGAGC AGATCCAGAA GCTCTTGAAC TCCGAGAAAG ACGAGTTCGA GCTGAATATG
TGCACCGACG TCGACCGCAA CGACAAGGCG AGGGTCTGCG TGCCGGGCAC GATCAAGGTG
CTGGCGCGCC GCCAGATCGA GACCTACTCG AAACTGTTCC ACACCGTCGA CCATGTCGAA
GGCATGCTGC GGCCGGGCTT CGACGCGCTC GACGCCTTCC TCACCCACGC CTGGGCGGTC
ACGGTCACCG GCGCGCCGAA GCTGTGGGCG ATGCAGTTCG TCGAGGATCA CGAGCGCAGC
CCGCGACGCT GGTATGCCGG CGCGTTCGGC GTGGTCGGCT TCGATGGCTC GATCAACACC
GGCCTCACCA TCCGCACCAT CCGGATGAAG GACGGCCTCG CCGAAGTTCG CGTCGGCGCC
ACCTGCCTGT TCGACAGCGA TCCGGTCGCC GAAGACAAGG AATGCCAGGT CAAGGCCGCG
GCGCTGTTCC AGGCGCTGCG CGGCGATCCG GCCAAGCCGC TGTCGGCGGT GGCGCCGGAC
GCCACTGGCT CGGGCAAGAA GGTGCTGCTG GTCGACCACG ACGACAGCTT CGTGCACATG
CTGGCGGACT ATTTCCGTCA GGTCGGCGCC CAGGTCACTG TGGTGCGCTA CGTTCACGGC
CTGAAGATGC TGGCCGAAAA CAGCTATGAT CTTCTGGTGC TGTCGCCCGG TCCCGGCCGG
CCGGAGGACT TCAAGATCAA GGATACGATC GACGCCGCGC TCGCCAAGAA GCTGCCGATC
TTCGGCGTCT GCCTCGGCGT CCAGGCGATG GGCGAATATT TTGGCGGTAC GCTCGGCCAG
CTCGCGCAGC CGGCTCACGG CCGCCCGTCG CGGATCCAGG TGCGCGGCGG CGCGCTGATG
CGCGGTCTCC CGAACGAGGT CACCATCGGC CGCTACCACT CGCTCTATGT CGACATGCGC
GACATGCCGA AGGAGCTGAC CGTCACCGCC TCCACCGATG ACGGCATCGC GATGGCGATC
GAGCACAAGA CCCTGCCGGT CGGCGGCGTG CAGTTCCACC CCGAGTCGCT GATGTCGCTC
GGCGGCGAGG TCGGGCTGCG GATCGTCGAA AACGCATTCC GGCTCGGCCA GGCGGCCTAA
 
Protein sequence
MNRTVFSLPA TSDYKTAAGL AVTRSAQPFA GGQALDELID LLDHRRGVML SSGTTVPGRY 
ESFDLGFADP PLALTTRAEK FTIEALNPRG RVLIAFLSDK LEEPCVVVEQ ACATKIRGHI
VRGEAPVDEE QRTRRASAIS LVRAVIAAFA SPADPMLGLY GAFAYDLVFQ FEDLKQKRAR
EADQRDIVLY VPDRLLAYDR ATGRGVDISY EFAWKGHSTA GLPNETAESV YTQTGRQGFA
DHAPGDYPKV VEKARAAFAR GDLFEAVPGQ LFGEPCERSP AEVFKRLCRI NPSPYGGLLN
LGAGEFLVSA SPEMFVRSDG RRIETCPISG TIARGVDAIS DAEQIQKLLN SEKDEFELNM
CTDVDRNDKA RVCVPGTIKV LARRQIETYS KLFHTVDHVE GMLRPGFDAL DAFLTHAWAV
TVTGAPKLWA MQFVEDHERS PRRWYAGAFG VVGFDGSINT GLTIRTIRMK DGLAEVRVGA
TCLFDSDPVA EDKECQVKAA ALFQALRGDP AKPLSAVAPD ATGSGKKVLL VDHDDSFVHM
LADYFRQVGA QVTVVRYVHG LKMLAENSYD LLVLSPGPGR PEDFKIKDTI DAALAKKLPI
FGVCLGVQAM GEYFGGTLGQ LAQPAHGRPS RIQVRGGALM RGLPNEVTIG RYHSLYVDMR
DMPKELTVTA STDDGIAMAI EHKTLPVGGV QFHPESLMSL GGEVGLRIVE NAFRLGQAA