Gene Rpal_3481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3481 
Symbol 
ID6411155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3727848 
End bp3729104 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content65% 
IMG OID642713360 
Productaminodeoxychorismate lyase 
Protein accessionYP_001992457 
Protein GI192291852 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.114938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAA GGCCGCCGAT TTCGCCGAGA AGCCCGCGTG CGGCGCTAGA GCCGGAACAG 
CTTCCGCCGC CGCCGAAGCG GTCCGATCAC GCCCGCAATC CGCTGGTCAT CATCGGCAAC
GCGATCATCA CTTTCATTGT GGTTGTGATG ATCGGCGCCG GCGGCTTGTA CGTGTACGGC
AAGAACAAGC TCGAAGCGCC GGGACCGCTC GCGCAGGACA AGACTGTCAA TATTCCGCAG
CGTGCTGGCC TCGACGACAT CGCGCAGATC CTGAAGCGCG AAGGCGTCAT CGAAGACGGT
TGGCTGGTGT TCGCAGGCGG CGTGATGGCA CTGCGCGCCC GCACCGAGCT CAAGCCGGGC
GAGTATCTGT TTCAGAAGAA TGCCAGCCTG CGCGACGTGA TCGGAACCAT CGTCGAAGGC
AAGGTGGTGC AGCACGCGGT GACGATTCCC GAAGGACTGA CTTCGGAGCA GATCGTCGAG
CGCCTGTCCG ACAATCCTAT CTTCACCGGA AGCATCCGCG AAATTCCGCG CGAAGGAACA
TTGCTGCCGG AGACCTACAA GTTTCCGCGC GGGACGCCGC GCGAGCAGGT GATCCACCGC
TTGCAGCAGG CGCAGAAGCG GGTGCTCAGC GAGATCTGGG AGCGTCGCAG TCCCGACCTG
CCGATCAAGA CTCCGGAGCA ACTGGTGACG CTGGCTTCGC TGGTTGAGAA AGAGACCGGC
AAGCCGGACG AGCGCACACG CGTCGCCGCC GTATTCGTCA ATCGGCTGCA GAAGAAGATG
CGGCTGCAGT CCGATCCGAC GATCATCTAT GGCCTCGTCG GCGGCAAGGG CACGCTCGGC
CGCCCGATCA AGCGAAGCGA GATCACGCAG CCGTCCCCGT ACAACACCTA TGTGATCGAC
GGTTTGCCGC CCGGGCCGAT CGCCAATCCG GGGCGCGCGT CGCTGGAGGC TGCGGCCAAT
CCGGCGCGCA CCCGCGATCT GTACTTCGTC GCCGATGGCA GCGGTGGGCA CGCCTTCAGC
GACAATTACG AGGTGCACCA GAAGAACGTC GGCAAGCTGC GGGCACAGGA AAAGCAGCTC
CAGAACGACA CCGTCGAGCC GCCGGAGGAA ACGCCGCCGA CCACAGCCGC TCCGGCGGCA
GAGCCGGCGG GCGATCCTGC GGCAGCCGCG CCGGCCGGGG CGCCGAAGGC CGCCGGCAAG
AACGGTGCGC AGAAGCGTCG CGCTCGCAAT GCCACGCCGA ACGGTGCGAC CGAGTAA
 
Protein sequence
MSERPPISPR SPRAALEPEQ LPPPPKRSDH ARNPLVIIGN AIITFIVVVM IGAGGLYVYG 
KNKLEAPGPL AQDKTVNIPQ RAGLDDIAQI LKREGVIEDG WLVFAGGVMA LRARTELKPG
EYLFQKNASL RDVIGTIVEG KVVQHAVTIP EGLTSEQIVE RLSDNPIFTG SIREIPREGT
LLPETYKFPR GTPREQVIHR LQQAQKRVLS EIWERRSPDL PIKTPEQLVT LASLVEKETG
KPDERTRVAA VFVNRLQKKM RLQSDPTIIY GLVGGKGTLG RPIKRSEITQ PSPYNTYVID
GLPPGPIANP GRASLEAAAN PARTRDLYFV ADGSGGHAFS DNYEVHQKNV GKLRAQEKQL
QNDTVEPPEE TPPTTAAPAA EPAGDPAAAA PAGAPKAAGK NGAQKRRARN ATPNGATE