Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3339 |
Symbol | |
ID | 3911141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3819862 |
End bp | 3821094 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637885242 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_486946 |
Protein GI | 86750450 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.154352 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.135284 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATGA CGACGACTTC CGTGGCCGCG TTCGCGGCTG CGATCGCGAT GCTGGCCGCG AGCCCGGCAG CGGCCCAGAA GAAATACGGC CCCGGCGCCA GCGACACCGA GATCAAGCTC GGCAACACAG TGCCCTATAG CGGCCCAGCC TCGGCCTACG GCATTCTCGG CAAGACCTAT GCCGCGTATT TCGCAAAGAT CAACGAGGAA GGCGGCATCA ACGGCCGCAA GATCGTCCTG ATCTCTTATG ACGACGCCTA TTCGCCGCCG AAGACCGTGG AACAGACCCG CAAGCTGGTC GAAAGCGACG AGGTGCTGGC GATCGTCGGC AATGTCGGTA CCGCCTCCAA CATCGCGATC CAGAAATATC TGAACGCCAA GAAGACCCCG CAATTGTTTC TCGCCACCGG CGCGACGCGC TGGAACGATC CGAAGCAGTT TCCGTGGACC ATGGGCTGGC TGCCGAGCTA CCAGGCCGAG GCCACGGCCT ATGCGAAATA TCTGCTGAAG GAGAAGCCCG ACGCCAAGAT CGGCGTGTTC TACCAGAACG ACGATTTCGG CAAGGACTAC GTGCGCGGCC TGAAGGAGGG GCTCGGCGAC AAGGCGGCGA CGATGATCGT CGCCGAATCC AGCTACGAGG TCTCCGAGCC GACGGTGGAT TCCCACATCG TCAAGCTGAA GGCGGCCGGC GCCGACACGC TGCTCACCTT CGCGACCGGC AAGTTCGCCG CGCAGGCGAT CAAGAAGGTC GCCGAACTCG GCTGGAAGCC GCTGCACATC GTGCCCAACG CCAGTTCGTC GCTCGGCAGC GTGCTGCGCC CGGCCGGCCT CGACAATGCG CAGGACCTGG TGTCCGCGAC CTTCGCCAAG GACCCGACCG ATCCGCAGTG GAACGAGGAT CCGGGGATGA AGAAATTCCA CGCCTTCGTC GAGAAATACA TTCCCGAAGG CAAGGCGATG GAGAGCACCG TGCTGTCCGG CTACAGCATC GCCCAGACCA TGGCGGAGGC GCTGCGGATG TGCGGCGATG ATCTGACCCG CGACAACCTG ATGAAGCAGG CGGCGAACAT GAAGGACGTC AAGCTCGACG GCCTCTTGCC GGGCGTCACC GTCAACACCA GCGCCACCGA CTTCGCGCCG ATCGACCAGT TTCAGATGAT GGTGTTCAAG GGCGAGCGCT GGGGGCGGTT CGGCGACGTC ATCAAGGGCG AACTGGCCGT GGCCGGACGG TAA
|
Protein sequence | MRMTTTSVAA FAAAIAMLAA SPAAAQKKYG PGASDTEIKL GNTVPYSGPA SAYGILGKTY AAYFAKINEE GGINGRKIVL ISYDDAYSPP KTVEQTRKLV ESDEVLAIVG NVGTASNIAI QKYLNAKKTP QLFLATGATR WNDPKQFPWT MGWLPSYQAE ATAYAKYLLK EKPDAKIGVF YQNDDFGKDY VRGLKEGLGD KAATMIVAES SYEVSEPTVD SHIVKLKAAG ADTLLTFATG KFAAQAIKKV AELGWKPLHI VPNASSSLGS VLRPAGLDNA QDLVSATFAK DPTDPQWNED PGMKKFHAFV EKYIPEGKAM ESTVLSGYSI AQTMAEALRM CGDDLTRDNL MKQAANMKDV KLDGLLPGVT VNTSATDFAP IDQFQMMVFK GERWGRFGDV IKGELAVAGR
|
| |