Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4103 |
Symbol | |
ID | 3911910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4672497 |
End bp | 4673702 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637886007 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_487707 |
Protein GI | 86751211 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.742112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0857946 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGC TGATGGCAGC GGCGTGCGCC GCCGCGTGCA TCGTCGCGTC CGCGCCTGCG TATGCCGGGA ATTACGGACC GGGCGTGACC GACACCGAGG TGAAGATCGG GCAGACCATG CCGTATAGCG GCCCGGCCTC GAGCTTCGCC GCGATCGGGC GCGCGATGAC GGCGTATTTC GAGAAGCTGA ATGCCGAGGG CGGTGTCAAC GGCCGCAAGA TCAACCTGGT CTCGCTCGAC GACGGCTACA GTCCGTCGAA GGCGGTGGAG CAGACCCGCC GGCTAGTGGA GAGCGACGAG GTGCTGGCGA TCGTCGGCAC CTTCGGTTCG CCGACCAATT TTGCGATTCA GAAATATCTC AACACCAGGA AGGTGCCGGG TCTGTTCCTC GGCACCGGGG CGAACCGCGT CTCCGAGCCG CAGACCTATC CGTGGTCGAT GGGCTGGCAG CCGAACAACC ACGCCAAGGG CGTGATCTAC GCCAAATATC TTCTCAAGGA ACGCCCCAAC GCCAAGGTCG CGGTGCTGTA TCAGAACGAC GATTTCGGCC GCGACTATGC CAAGGGCTTT CGCGACGGGC TCGGCGACAA GGCCGGCAGC ATGATCGTCA AGGAGCTCAG CTACGAGATC ACCGAGCCGA CGGTCGACTC CCAGATCTTG CTGTTGAAGT CGACCGGTGC CGATGTGTTC CTGAACATCT CGACGCCGAA GTTCTCCGCG CAGGCGATCA AAAAGATGAC CGAGACCAAG TGGGAGGCGC TGCACATGCT GAGCGACGCC GCCGGTTCGA TCTCGAGCAC GCTGGTGCCG GCAGGGCTGG AGAACTCCAA GGGCGTGATC ACGGTCGCGT TCCGCAAGGA CCCGAACGAT CCGGCCTGGG CCGAGGATCC GGGGATGAAA CAGTATCTGG CATTCATGAA GCAATACATG CCGAACGCCG ATCCGTCCGA GACGTATTAC GTGTTTGGCT ATGCCACCGC GCAGACCTTC GAGCATGTGT TGAAGGCCTG CGGCGACGAA CTGACCCGCG AGAACCTGAT GAAGCAGGCG GCCAGCATCA AGGATCTGGA ACTGCCGATC CTGCTGCCCG GCATCAAGCT CAACACCAGC GCGACTCACT TCACGCCGAT GAGCCAGGAG CAGTTGATGC AGTTCGACGG CACCAGGTGG AAGCCGATCG GCGTGGTGAT CGACGCCGCA AAATAG
|
Protein sequence | MKKLMAAACA AACIVASAPA YAGNYGPGVT DTEVKIGQTM PYSGPASSFA AIGRAMTAYF EKLNAEGGVN GRKINLVSLD DGYSPSKAVE QTRRLVESDE VLAIVGTFGS PTNFAIQKYL NTRKVPGLFL GTGANRVSEP QTYPWSMGWQ PNNHAKGVIY AKYLLKERPN AKVAVLYQND DFGRDYAKGF RDGLGDKAGS MIVKELSYEI TEPTVDSQIL LLKSTGADVF LNISTPKFSA QAIKKMTETK WEALHMLSDA AGSISSTLVP AGLENSKGVI TVAFRKDPND PAWAEDPGMK QYLAFMKQYM PNADPSETYY VFGYATAQTF EHVLKACGDE LTRENLMKQA ASIKDLELPI LLPGIKLNTS ATHFTPMSQE QLMQFDGTRW KPIGVVIDAA K
|
| |