Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1041 |
Symbol | |
ID | 3909165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1196131 |
End bp | 1197747 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637882934 |
Product | extracellular solute-binding protein |
Protein accession | YP_484662 |
Protein GI | 86748166 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTTT CGCGCTTCGA GATCAACCGC CGGACCGTCC TGCTGACGTC GGCCGCCATC GCCGCCAATG TGATCAATCC GATGCGGGCG TTCGCGCAGG AAACGCCGCG CAAGGGCGGC GTGTTCAACG TTCATTACGG CGCCGAGCAG CGCCAGCTCA ACCCCAGCTT GCAGGCCTCG ACCGGCGTCT ACATCATCGG CGGCAAGATC CAGGAGCCGC TGGTCGATCT CGACGCCGCC GGCAATCCGG TCGGCGTGCT GGCCGAGAGC TGGGACTCGA CGCCCGACGG CAAGACCATC ACCTTCCGGC TGCGCAAGGG CGTCACCTGG CACGACGGCA AGCCGTTCAC CTCCGAGGAC GTCGCCTTCA CCGCGATGAA CATGTGGAAG AAGATCCTCA ATTACGGATC GACGCTGCAG CTGTTCCTGA CCGCGGTCGA CACCCCCGAT CCGCAGACCG CGATCTTCCG CTACGAGCGG CCGATGCCGC TGAACCTGCT GCTGCGCGCG CTGCCCGACC TCGGCTACAT TTCGCCGAAG CACATCTACG AGACCGGCGA CATCCGCCAG AACCCGGTCA ATCTCGCGCC GATCGGCACC GGCCCGTTCA AGTTCAACAA ATACGAGCGC GGCCAGTACA TCATCGCCGA CCGCAACGAC AATTACTGGC GGCCGAACGC GCCCTATCTC GACCGCATCG TCTGGCGGGT GATCACCGAC CGCGCCGCGG CGGCGGCCCA GCTCGAAGCC GGCAGCCTGC ATCTCAGCCC GTTCTCGGGT CTCACGATCT CCGACATGGC GCGGCTCGGC AAGGACAAGC GTTTCGTCGT CTCGACCAAG GGCAACGAGG GCAACGCCCG CACCAACACG CTGGAATTCA ACTTCCGCCG CAAGGAGCTG TCGGACATCC GCGTCCGCAA GGCGATCGCG CACGCGATCA ACGTGCCGTT CTTCATCGAG AACTTCCTCG GCGATTTCGC CAAGCTCGGC ACCGGGCCGA TCCCCTCGAC CTCGACCGAC TTCTATCCGG GCCCGAACAC GCCGCAATAT CCGTACGACA AGAAGAAGGC GATCGCGCTG CTCGACGAGG CCGGGCTGAA GCCGGCCGCC GGCGGCAACC GCCTCACGTT GCGGCTCTTG CCGGCGCCGT GGGGCGAGGA CATCTCGCTG TGGGCGACCT TCATCCAGCA GTCGCTGGGC GAAGTCGGCA TCCAGGTCGA GGTGGTGCGC AACGACGGCG GCGGCTTCCT CAAGCAGGTC TATGACGAAC ACGCCTTCGA CCTCGCCACC GGCTGGCATC AATATCGCAA CGACCCCGCG GTCTCGACCA CGGTGTGGTA TCGCTCCGGC CAGCCCAAGG GCGCGCCCTG GACCAATCAA TGGGGCTGGA AGGACGAGGC GATCGACAAG ATCATCGACG ACGCCGCCAC CGAGGTCGAT CCGGTCAAGC GCAAGGCGCT GTATGCCGAC TTCGTCACCC GCGCCAATGG CGAGCTGCCG CTGTGGATGC CGATTGAGCA ATTGTTCGTC ACGGTGATCA GCGCGAAGGC ACGCAATGCC TCCAATAATC CACGCTGGGC GTCGTCGACC TGGCACGATC TTTGGCTGGC CGAATAG
|
Protein sequence | MALSRFEINR RTVLLTSAAI AANVINPMRA FAQETPRKGG VFNVHYGAEQ RQLNPSLQAS TGVYIIGGKI QEPLVDLDAA GNPVGVLAES WDSTPDGKTI TFRLRKGVTW HDGKPFTSED VAFTAMNMWK KILNYGSTLQ LFLTAVDTPD PQTAIFRYER PMPLNLLLRA LPDLGYISPK HIYETGDIRQ NPVNLAPIGT GPFKFNKYER GQYIIADRND NYWRPNAPYL DRIVWRVITD RAAAAAQLEA GSLHLSPFSG LTISDMARLG KDKRFVVSTK GNEGNARTNT LEFNFRRKEL SDIRVRKAIA HAINVPFFIE NFLGDFAKLG TGPIPSTSTD FYPGPNTPQY PYDKKKAIAL LDEAGLKPAA GGNRLTLRLL PAPWGEDISL WATFIQQSLG EVGIQVEVVR NDGGGFLKQV YDEHAFDLAT GWHQYRNDPA VSTTVWYRSG QPKGAPWTNQ WGWKDEAIDK IIDDAATEVD PVKRKALYAD FVTRANGELP LWMPIEQLFV TVISAKARNA SNNPRWASST WHDLWLAE
|
| |