Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3575 |
Symbol | |
ID | 3911377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4099019 |
End bp | 4100176 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885477 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_487181 |
Protein GI | 86750685 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.807298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.720427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGGT TCAAACTATC TGCTGCTGCG TTCGCCGTGG CGATCGCGCT GCCGGCGATG TCCGGCGCCG CGCTCGCCGA GACCAATGAA ATCACCGTGG GCATCACCGT CACCACGACG GGCCCGGCCG CCGCGCTCGG CATTCCGGAG CGCAACGCGC TGGAATTCGT GGCCAAGGAA ATCGGCGGTC ACCCGATCAA GATGATCGTG CTCGACGACG GCGGCGACCC GACCGCGGCG ACCACCAACG CGCGGCGTTT CGTCACGGAG TCGAAGGCCG ACGTGATCAT GGGTTCGTCG GTGACGCCGC CGACCGTGGC GGTCTCGAAC GTCGCCAACG AGGCGCAGGT GCCGCATATC GCGCTGGCGC CGCTGCCGGT CACGCCGGAG CGGGCGAAGT GGTCCGTGGT GATGCCGCAG CCGATCCCGA TCATGGGCAA GGTGCTCTAC GAGCACATGA AGAAGAACAA CATCAAGACC GTCGGCTACA TCGGCTATTC CGACAGCTAC GGCGATCTGT GGTTCAACGA TCTCAAGAAG CAGGGCGAGG CGATGGGCCT CAAGATCGTC GCCGAGGAAC GCTTCGCGCG CCCCGACACC TCGGTCGCGG GTCAGGTGCT GAAGCTCGTT GCCGCCAATC CCGACGCCAT CCTGGTCGGG GCGTCCGGCA CCGCCGCGGC GCTGCCGCAG ACCGCGCTGC GCGAGCGCGG CTACAACGGG CTGATCTATC AGACCCACGG CGCCGCCTCG ATGGACTTCA TCCGCATCGC CGGCAAGTCC GCCGAGGGCG TGCTGATGGC GTCCGGCCCG GTGATGGATC CGGAAGGCCA GAACGACAGC GCGCTGACCA AGAAGCCCGG CCTCGAACTC AACACGGCCT ATGAAACCAA GTACGGCCCG AACAGCCGCA GCCAGTTCGC CGGCCACTCC TTCGACGCCT TCAAGGTACT CGAGCGCGTG ATTCCGGTGG CGCTGAAGAC CGCCAAGCCC GGCACGCAGG AATTCCGCGA AGCGATCCGT AAGGCGCTGC TCACCGAAAA GGACATCGCG GCGAGCCAGG GCGTCTACAG CTTCACCGAG ACGGATCGCT ATGGTCTCGA CGATCGTTCG CGCATCCTGC TGACGGTGAA GAACGGCAAA TACGTCATCG TCAAGTAA
|
Protein sequence | MNGFKLSAAA FAVAIALPAM SGAALAETNE ITVGITVTTT GPAAALGIPE RNALEFVAKE IGGHPIKMIV LDDGGDPTAA TTNARRFVTE SKADVIMGSS VTPPTVAVSN VANEAQVPHI ALAPLPVTPE RAKWSVVMPQ PIPIMGKVLY EHMKKNNIKT VGYIGYSDSY GDLWFNDLKK QGEAMGLKIV AEERFARPDT SVAGQVLKLV AANPDAILVG ASGTAAALPQ TALRERGYNG LIYQTHGAAS MDFIRIAGKS AEGVLMASGP VMDPEGQNDS ALTKKPGLEL NTAYETKYGP NSRSQFAGHS FDAFKVLERV IPVALKTAKP GTQEFREAIR KALLTEKDIA ASQGVYSFTE TDRYGLDDRS RILLTVKNGK YVIVK
|
| |