Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1663 |
Symbol | |
ID | 3908650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1892458 |
End bp | 1893654 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637883557 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_485282 |
Protein GI | 86748786 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.596073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACACG TCGCACGACT GGCTTTGACG GCACTGCTTT TCGCCTCCGG CGCTGCCTAT GCGCAGCAAG GCGAGATCAA GGTGGGAGAG ATCAACTCCT ATTCGGCACT CCCCGGCTTC ACCGAGCCAT ACCGCAAGGG CATGGAACTG GCGCTGAAGC AGATCAACGA TGCCGGCGGC ATCAAGGGCA AGAAGCTCGT CGTCATCACC AAGGACGACG GCGGCAAGCC GGGCGACGCG CTGACCGCCG CCAACGAGCT GGTGTCGCGC GACGGCGTGG TGATGATCGC CGGCGGCTTC CTGTCGAATA TCGGCCTGGC GCTGTCCGAC TTCGCCAAGC AGAAGAAGGT GCTCTACGTC GCCGCCGAGC CGCTGACCGA CGCCATCGTC TGGTCGAAGG GCAACGACTA TACTTTCCGG CTGCGCACAT CGAACTATAT GCAGGCCTCG ATGCTGGCGG AGGAAGCCGC CAAGCTGCCG GCCAAGAAGT GGGCGACGAT TGCGCCCAAC TACGAATTCG GCCAGTCGTT CGTCGCCGCG TTCAAGGAGA TTCTCAGCAA GAAGCGTCCC GACGTCGAGT TCGTCGCCGA GCAATGGCCG CCGCTGAACA AGATCGACGC CGGCCCGGTG CTGCAGGCGA TCGACGCCGC CAAGCCCGAC GCCATCCTCA ACGCCACCTT CGCCGGCGAC CTGGTCAAGC TGGTGCGCGA GGGCAATACC CGCGGCGTGT TCAAGGATCG CGCGGTGGTG AGCTATCTCA CCGGCGAGCC GGAATATCTC GACCCGCTGA AGACCGAAAC GCCGGAGGGC TGGATCGTCA CCGGCTATCC CTGGTACGCG ATCAGCACGC CGGAGCATCA GGCTTTCCTC GACGCTTACC AACAGCTCAA CAAGGACTAT CCGCGGCTCG GCTCGGTGGT CGGCTACGCC ACCGTGAAGA CCATCGCCGC CGTGCTGACC GCCACCGACG ATCACTCCAC CGACGGCCTG GTCAAGGCGA TGAAGAACCT GAAGGTCGAC ACCCCGTTCG GCGCGGTCGT CTACCGCGCC GGCGATCATC AGTCGACGAT GGGCGCCTAT GTCGGCAAGA CCACGCAGAA GGACGGCAAG GGCATCATGA CCGACATCAA GTTCAAGAAG GGCGCCGACT ATCTACCGCC CGAGGCCGAG GCCGCCAAGC TGCGTCCGGC GAACTGA
|
Protein sequence | MKHVARLALT ALLFASGAAY AQQGEIKVGE INSYSALPGF TEPYRKGMEL ALKQINDAGG IKGKKLVVIT KDDGGKPGDA LTAANELVSR DGVVMIAGGF LSNIGLALSD FAKQKKVLYV AAEPLTDAIV WSKGNDYTFR LRTSNYMQAS MLAEEAAKLP AKKWATIAPN YEFGQSFVAA FKEILSKKRP DVEFVAEQWP PLNKIDAGPV LQAIDAAKPD AILNATFAGD LVKLVREGNT RGVFKDRAVV SYLTGEPEYL DPLKTETPEG WIVTGYPWYA ISTPEHQAFL DAYQQLNKDY PRLGSVVGYA TVKTIAAVLT ATDDHSTDGL VKAMKNLKVD TPFGAVVYRA GDHQSTMGAY VGKTTQKDGK GIMTDIKFKK GADYLPPEAE AAKLRPAN
|
| |