Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3892 |
Symbol | |
ID | 3911696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4447347 |
End bp | 4448492 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885793 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_487496 |
Protein GI | 86751000 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAC TCAAGGCGCT CCTGGCCGCG GCCGTGCTCG CGTCGACCGC CGGCGCCGCA TCGGCGCAGG TCAAGGTCGG CGTGATCGCG TCGTCGACCG GTCCGATCTC GGTGGTCGGG CTGCAGCAGA AGAACACCGT GGCGCTGTTG CCGAAGACCA TCGGCAATCT CACGGTCGAC TACATCTACA TGGACGACAA CAGCGATCCC ACCCAGGCGA CCAAGAACGT CCAGAAATTC CTGATCGAGG ACAAGGTCGA CGCCATCATC GGCCCGTCCG GATCGCCGAA CGCGATGGCC GTGCTGTCCT TCATCGCCGA CGCCAAGACG GTGATGCTGG CGCCGGTCGG CACCACCGCG GTGGTGCTGC CGATGGACGA GAAAAAGAAG TGGGTGTTCA AGACCACCCA GAACGACAAT CTCGTGATGG ACGCGCTGGT CGCGCACATG AAGACCAACG GCGTCAAGAC GGTCGCCTTC ATCGGCACCA ACGATCCGCT CGGCGCCAAT TTCGCCAAGG CCTTCAAGGC CGTGATCGAC AAGGAAGGCA TCAAGCTGGT CGCCGAAGAA ACCTTCAGCC GCGCCGATAC GTCGGTGGCC GGCCAGAGCC TGAAGATTCT GTCGGCCGCG CCGGACGCGG TGCTGGTCGG CGCGGCGGGC GCCACCACCG TGCTGCCCGA AGTGACGCTG GTCGATCAGG GCTACAAGGG CAAGATCTAC CAGACCCACG GCGCCGCGAC GCCGGAATTC ATCAAGCTGG GCGGCAAGAA GGTCGAGGGC ACCATTCTCG CCGCGAGCCC GATGCTGGTG CAGGCGCAGA TCGGCGACGA TGTGCCGTCG AAGGCCTCGG GACAAGCCTA TATCGACGCC TACACCAAGG TGTACGGCGT AGCGCCCGGC ACCTTCGGCG CCAATGTCTG GGACGCCGGC CTGCTGCTGG AACGCACCAT TCCGATCGCG GCCGCCAAGG CCAAGCCCGG CACGCCGGAA TTCCGCGCCG AGCTGCGCGA CGCGCTGGAG AACGTCAAGA ATCTCACCGG CGCGCAGGGC GTCTACAACA TGACTGCGGC CGATCATTCC GGCTTCGACG CCCGCAGCAT CGTCACCATC GCGGTGAAGG ACGGCGCCTG GAAGCTGCTG AAGTAA
|
Protein sequence | MNKLKALLAA AVLASTAGAA SAQVKVGVIA SSTGPISVVG LQQKNTVALL PKTIGNLTVD YIYMDDNSDP TQATKNVQKF LIEDKVDAII GPSGSPNAMA VLSFIADAKT VMLAPVGTTA VVLPMDEKKK WVFKTTQNDN LVMDALVAHM KTNGVKTVAF IGTNDPLGAN FAKAFKAVID KEGIKLVAEE TFSRADTSVA GQSLKILSAA PDAVLVGAAG ATTVLPEVTL VDQGYKGKIY QTHGAATPEF IKLGGKKVEG TILAASPMLV QAQIGDDVPS KASGQAYIDA YTKVYGVAPG TFGANVWDAG LLLERTIPIA AAKAKPGTPE FRAELRDALE NVKNLTGAQG VYNMTAADHS GFDARSIVTI AVKDGAWKLL K
|
| |