Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3481 |
Symbol | |
ID | 6411155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3727848 |
End bp | 3729104 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642713360 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_001992457 |
Protein GI | 192291852 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.114938 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAA GGCCGCCGAT TTCGCCGAGA AGCCCGCGTG CGGCGCTAGA GCCGGAACAG CTTCCGCCGC CGCCGAAGCG GTCCGATCAC GCCCGCAATC CGCTGGTCAT CATCGGCAAC GCGATCATCA CTTTCATTGT GGTTGTGATG ATCGGCGCCG GCGGCTTGTA CGTGTACGGC AAGAACAAGC TCGAAGCGCC GGGACCGCTC GCGCAGGACA AGACTGTCAA TATTCCGCAG CGTGCTGGCC TCGACGACAT CGCGCAGATC CTGAAGCGCG AAGGCGTCAT CGAAGACGGT TGGCTGGTGT TCGCAGGCGG CGTGATGGCA CTGCGCGCCC GCACCGAGCT CAAGCCGGGC GAGTATCTGT TTCAGAAGAA TGCCAGCCTG CGCGACGTGA TCGGAACCAT CGTCGAAGGC AAGGTGGTGC AGCACGCGGT GACGATTCCC GAAGGACTGA CTTCGGAGCA GATCGTCGAG CGCCTGTCCG ACAATCCTAT CTTCACCGGA AGCATCCGCG AAATTCCGCG CGAAGGAACA TTGCTGCCGG AGACCTACAA GTTTCCGCGC GGGACGCCGC GCGAGCAGGT GATCCACCGC TTGCAGCAGG CGCAGAAGCG GGTGCTCAGC GAGATCTGGG AGCGTCGCAG TCCCGACCTG CCGATCAAGA CTCCGGAGCA ACTGGTGACG CTGGCTTCGC TGGTTGAGAA AGAGACCGGC AAGCCGGACG AGCGCACACG CGTCGCCGCC GTATTCGTCA ATCGGCTGCA GAAGAAGATG CGGCTGCAGT CCGATCCGAC GATCATCTAT GGCCTCGTCG GCGGCAAGGG CACGCTCGGC CGCCCGATCA AGCGAAGCGA GATCACGCAG CCGTCCCCGT ACAACACCTA TGTGATCGAC GGTTTGCCGC CCGGGCCGAT CGCCAATCCG GGGCGCGCGT CGCTGGAGGC TGCGGCCAAT CCGGCGCGCA CCCGCGATCT GTACTTCGTC GCCGATGGCA GCGGTGGGCA CGCCTTCAGC GACAATTACG AGGTGCACCA GAAGAACGTC GGCAAGCTGC GGGCACAGGA AAAGCAGCTC CAGAACGACA CCGTCGAGCC GCCGGAGGAA ACGCCGCCGA CCACAGCCGC TCCGGCGGCA GAGCCGGCGG GCGATCCTGC GGCAGCCGCG CCGGCCGGGG CGCCGAAGGC CGCCGGCAAG AACGGTGCGC AGAAGCGTCG CGCTCGCAAT GCCACGCCGA ACGGTGCGAC CGAGTAA
|
Protein sequence | MSERPPISPR SPRAALEPEQ LPPPPKRSDH ARNPLVIIGN AIITFIVVVM IGAGGLYVYG KNKLEAPGPL AQDKTVNIPQ RAGLDDIAQI LKREGVIEDG WLVFAGGVMA LRARTELKPG EYLFQKNASL RDVIGTIVEG KVVQHAVTIP EGLTSEQIVE RLSDNPIFTG SIREIPREGT LLPETYKFPR GTPREQVIHR LQQAQKRVLS EIWERRSPDL PIKTPEQLVT LASLVEKETG KPDERTRVAA VFVNRLQKKM RLQSDPTIIY GLVGGKGTLG RPIKRSEITQ PSPYNTYVID GLPPGPIANP GRASLEAAAN PARTRDLYFV ADGSGGHAFS DNYEVHQKNV GKLRAQEKQL QNDTVEPPEE TPPTTAAPAA EPAGDPAAAA PAGAPKAAGK NGAQKRRARN ATPNGATE
|
| |