Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2667 |
Symbol | |
ID | 3910460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3049965 |
End bp | 3051245 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637884567 |
Product | O-acetylhomoserine aminocarboxypropyltransferase |
Protein accession | YP_486280 |
Protein GI | 86749784 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.338224 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.137932 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAC GCAGCCCGGG ATTTGCCACG CTCGCGGTTC ACGCCGGCGC GCAGCCCGAT CCCACCACCG GGGCGCGGGC GACGCCGATC TACCAGACCA CCTCCTTCGT GTTCAACGAC GCCGACCACG CCGCCTCGCT GTTCGGCCTG CAGGCGTTCG GCAACATCTA CACCCGCATC ACCAATCCGA CGACGGCGGT GCTCGAGGAG CGCGTCGCCG CGCTCGAAGG CGGCACCGCG GCGCTCGCGA CCGCGTCCGG CCACGCCGCG CAGCTCGTGG TGATGCAGCA ATTGCTGATG CCCGGCGACG AATTCATCGC CGCGCGAAAA CTCTACGGCG GCTCGATCAA CCAGTTCACC CACGCCTTCA AGAGCTTCGG CTGGAACGTG GTGTGGGCCG ATCCCGACGA CATCGACAGC TTCCAGCGCG CGGTGACGCC GAAGACCAAG GCGATCTTCA TCGAATCGAT CGCCAATCCG GCGGGCTCCA TCACCGATAT CGAGGCGATC GCCGAAGTCG CGCGCAGTGC CGGCGTGCCG CTGATCGTCG ACAACACCCT GGCGACGCCC TATCTGATCC GCCCGATCGA CCACGGCGCC GACATCGTCG TGCATTCGCT GACGAAATTT CTCGGCGGCC ACGGCAATTC GCTCGGCGGC ATCATCGTCG ACGCCGGCAC CTTCGACTGG TCGAAGGGCG GCAAATATCC GATGCTGAGC GAGCCGCGGC CGGAATATCA CGGGCTGAAG CTGCAGGAGA CGTTCGGCAA TTTCGCCTTC GCGATCGCCT GCCGCGTGCT CGGCCTGCGC GACCTCGGCC CGGCGCTGTC GCCGTTCAAC GCCTTCCTGC TGATGACCGG CATCGAGACG CTGCCGCTGC GGATGCAGAA GCATTGCGAG AACGCCAAGG CGATCGCCGA ATTCCTGGCG ACCCACAAGG CGGTGTCGGC GGTGAACTAT TCCGGCCTGG CGTCGAGCAA GTACAATGCG CTGGCCCGCA AATACGCGCC GAAGGGCGCC GGCGCGGTGT TCACCTTCAG CCTCAAGGGT GGCTACCAGG CCGGCGTCGA TCTGGTCTCC AATGTGAAGC TGTTCTCGCA TCTCGCCAAT GTCGGCGACA CCCGTTCGCT GATCATCCAT CCGGCCTCGA CCACCCACAG CCAGCTCGAC GACGCGCAGA AGACGGCGGC CGGCGCCGCG CCGGACATGG TGCGGGTGTC GATCGGCATC GAGGACAAGG AAGATCTGAT CGCGGATCTC GACGGAGCGC TCGGCGGCTG A
|
Protein sequence | MTERSPGFAT LAVHAGAQPD PTTGARATPI YQTTSFVFND ADHAASLFGL QAFGNIYTRI TNPTTAVLEE RVAALEGGTA ALATASGHAA QLVVMQQLLM PGDEFIAARK LYGGSINQFT HAFKSFGWNV VWADPDDIDS FQRAVTPKTK AIFIESIANP AGSITDIEAI AEVARSAGVP LIVDNTLATP YLIRPIDHGA DIVVHSLTKF LGGHGNSLGG IIVDAGTFDW SKGGKYPMLS EPRPEYHGLK LQETFGNFAF AIACRVLGLR DLGPALSPFN AFLLMTGIET LPLRMQKHCE NAKAIAEFLA THKAVSAVNY SGLASSKYNA LARKYAPKGA GAVFTFSLKG GYQAGVDLVS NVKLFSHLAN VGDTRSLIIH PASTTHSQLD DAQKTAAGAA PDMVRVSIGI EDKEDLIADL DGALGG
|
| |