Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1474 |
Symbol | |
ID | 3908787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1663203 |
End bp | 1664624 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637883369 |
Product | ABC transporter substrate-binding protein |
Protein accession | YP_485095 |
Protein GI | 86748599 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0903726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCGTA CTTCTGGACA TCGTCTGATT TTCGCCGCCT TCACCGCCGC GATGATGGCG AGCACCCCGG TCGCCGCCCA GACCACCGTC ACGGTCGGCA TCGGCACCCA GGACACCACC ACCAACACCG CGACCACCGG CGTCGTCATT CGGCAGCTGA AGCTGCTGGA GAAGTATCTT CCCAAGGACG GCAAATACGC GAACGTCAAA TTCGAGTTCG ATTGGCAGAA TTTCACCTCC GGCCCACCCG TCACCAACGG CATGATGGCC AACAAGCTGC AATTCGGCGG CATGGGTGAC TATCCGCTGG TGGTGAACGG CTTCACCTTC CAGAGCAACC CCGAGAGCAA GAGCCGCCTG ATCGCGGTCG CGGCCTACAG CCTCGACGGC TCCGGCAACG GCCTGGTGGT TCACAAGGAC TCCCCGTACT ATCAGCTGTC CGATCTCAAG GGCAAATTGG TGAGCGTGCC GTTCGGCTCC GCCGCGCACG GCATGATCCT AAAGGCGATG CAGGATCGCG GCTGGCCCGC GGACTATTGG CAACTGGTGA GCCAGAGCCC GGAAGTCGGC TCGACCAATC TCCAGGAGAA GAAGATCGAC GCCCACGCCG ATTTCGTCCC GTTCGCCGAA CTACTGCCGT TCCGCGGTTT CGCCCGCAAG ATCTTCGACG GCGTCGAGAC CAATCTGCCG ACCTGGCACG GCGTGGTGGT GCGTACCGAC TTCGCCGAGA AATATCCCGA AGTCGTGGTC GCCTATGTCA AGGCGATCAT CGCGGCCAAT GCCTGGCTGC GCGCCGATCC GAAGCTCGCC GCCGAAAAGA TCCAGGAATG GACCGGCATC AACAAGGAAG TGGTCTACAT CTTCCTGGGA CCGGGCGGCA ACATGACCAC CGATCCGACG ATCAAGCCGC AGCTGATCGA GGCCGCCGCG GTCGACGTCA AGGTGCTGCA GAATCTCGGC CGCATGAAGG AATTCGATCC GAAGAGCTGG GTCGACGACA GCTACATCCG CAAGGCCTAT GCCGAACTGA AGCTCGACTA CGACGCCGAG CTGAAGAGCA CCAAGAACTA CGAAATCAGC GGCGAGGACA AATTCTGCAA GAAGCCGATC ACCGAGCCGC GCAAGGCCGG CGAGGTCTGG GTCGACGGCG ACGGCATCGA GCCGTTCAGC AGCGCGGCCT GCACGCTCGC CGCTTATGCG GACATCAAGG CCAAGGGCAA GAAGATCAAC ATGGCCTATG TGTTCGACTC CGCCCGGGGC ATCAAGCTGT TCGCCGACCA GGCCTACTAC ACGGTCGGCG CCGACAAGGC GCAGTTGTCG CCGTTCCTGC TCAAGAAGGA CGCCGAAGCG CATGCCGCCA AGATCAACGG CAAGGTGCTG AATTTCGACG AAGCCCTCAA GTCAGCGGTC AGCGGAGGTT GA
|
Protein sequence | MVRTSGHRLI FAAFTAAMMA STPVAAQTTV TVGIGTQDTT TNTATTGVVI RQLKLLEKYL PKDGKYANVK FEFDWQNFTS GPPVTNGMMA NKLQFGGMGD YPLVVNGFTF QSNPESKSRL IAVAAYSLDG SGNGLVVHKD SPYYQLSDLK GKLVSVPFGS AAHGMILKAM QDRGWPADYW QLVSQSPEVG STNLQEKKID AHADFVPFAE LLPFRGFARK IFDGVETNLP TWHGVVVRTD FAEKYPEVVV AYVKAIIAAN AWLRADPKLA AEKIQEWTGI NKEVVYIFLG PGGNMTTDPT IKPQLIEAAA VDVKVLQNLG RMKEFDPKSW VDDSYIRKAY AELKLDYDAE LKSTKNYEIS GEDKFCKKPI TEPRKAGEVW VDGDGIEPFS SAACTLAAYA DIKAKGKKIN MAYVFDSARG IKLFADQAYY TVGADKAQLS PFLLKKDAEA HAAKINGKVL NFDEALKSAV SGG
|
| |