Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1152 |
Symbol | |
ID | 4021628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1309995 |
End bp | 1311611 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637961344 |
Product | extracellular solute-binding protein |
Protein accession | YP_568291 |
Protein GI | 91975632 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.296162 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTTGT CGCGCTTCGA GATCAACCGC CGGACCGTCC TGCTGACGTC GGCCGCCATC GCCGCCAATG TACTCAATCC GATGCGGGCG TTCGCGCAGG AGACGCCGCG CAAGGGCGGG GTGTTCAACG TGCATTACGG CGCGGAGCAA CGCCAGCTCA ACCCCAGCTT GCAGGCATCG ACCGGCGTGT ACATCATCGG CGGCAAGATC CAGGAGCCGC TGGTCGATCT CGACGCCGCC GGCAATCCGG TCGGCGTGCT GGCGGAGAGC TGGGAATCGA CGCCGGACGG CAAGACGATC ACTTTCAAGC TGCGCAAGGG CGTCGTCTGG CACGACGGCA AGCCGTTCAC CTCCGAGGAC GTCGCCTTCA CCGCGCTGAA CATGTGGAAG AAGATCCTCA ACTACGGATC GACGCTGCAG CTGTTCCTCA CCGCGGTCGA CACCCCCGAT CCGCAGACTG CGATCTTCCG TTACGAGCGG CCGATGCCGC TCAATTTGCT GCTGCGCGCG CTGCCGGACC TCGGTTACGT CTCGCCCAAG CACATCTACG AGACCGGCGA CATCCGCCAG AACCCGGTCA ATCTCGCGCC GATCGGCACC GGCCCGTTCA AGTTCAACAA ATACGAGCGC GGCCAGTACA TCATCGCCGA CCGCAACGAC AATTACTGGC GGCCGAATGC GCCCTATCTC GACCGCATCG TCTGGCGGGT AATCACCGAC CGCGCCGCGG CGGCGGCGCA GCTCGAAGCC GGCAGCCTGC ATCTCAGCCC GTTCTCGGGC CTGACGATTT CCGACATGGC GCGGCTCGGC AAGGACAAGC GCTTCATCGT CTCGACCAAG GGCAACGAGG GCAACGCCCG CACCAACACG CTGGAGTTCA ACTTCCGCCG CAAGGAGCTG TCGGACATCC GCGTCCGCCA GGCGATCGCG CACGCGATCA ACGTGCCGTT CTTCATCGAG AACTTCCTTG GCGACTTCGC CAGGCTCGGC ACCGGGCCGA TCCCCTCGAC CTCGGCCGAT TTCTATCCCG GCCCGAACAC GCCGCAATAC GCTTACGACA AGAAGAAGGC GATCGCGCTG CTCGACGAGG CCGGGCTGAA GCCCGCCGGC GGCGGCACCC GCCTCTCGCT GCGGCTGTTG CCGGCGCCGT GGGGCGAGGA CATCTCGCTG TGGGCGACCT TCATCCAGCA ATCCCTGTCG GAGATCGGCG TCCAGGTCGA GATCGTGCGC AACGATGGCG GCGGCTTCCT CAAGCAGGTC TATGACGAAC ACGCCTTCGA CCTCGCCACC GGCTGGCACC AGTATCGCAA CGATCCCGCG GTCTCGACCA CGGTGTGGTA TCGCTCCGGC CAGCCCAAGG GCGCGCCGTG GACTAATCAG TGGGGCTGGG AAGACGCCAC CACCGACAAG ATCATCGATA ACGCCGCCAC CGAGGTCGAT CCCGTCAAGC GCAAGGCGCT GTATGCCGAT TTCGTCACCC GCGCCAACAC CGAACTGCCG ATCTGGATGC CGATCGAGCA ATTGTTCGTC ACGGTGATCT CCGCCAAGGC GCGCAATCAC TCCAACAATC CGCGCTGGGC GTCATCGACC TGGCATGATC TTTGGCTGGC CGAATAG
|
Protein sequence | MALSRFEINR RTVLLTSAAI AANVLNPMRA FAQETPRKGG VFNVHYGAEQ RQLNPSLQAS TGVYIIGGKI QEPLVDLDAA GNPVGVLAES WESTPDGKTI TFKLRKGVVW HDGKPFTSED VAFTALNMWK KILNYGSTLQ LFLTAVDTPD PQTAIFRYER PMPLNLLLRA LPDLGYVSPK HIYETGDIRQ NPVNLAPIGT GPFKFNKYER GQYIIADRND NYWRPNAPYL DRIVWRVITD RAAAAAQLEA GSLHLSPFSG LTISDMARLG KDKRFIVSTK GNEGNARTNT LEFNFRRKEL SDIRVRQAIA HAINVPFFIE NFLGDFARLG TGPIPSTSAD FYPGPNTPQY AYDKKKAIAL LDEAGLKPAG GGTRLSLRLL PAPWGEDISL WATFIQQSLS EIGVQVEIVR NDGGGFLKQV YDEHAFDLAT GWHQYRNDPA VSTTVWYRSG QPKGAPWTNQ WGWEDATTDK IIDNAATEVD PVKRKALYAD FVTRANTELP IWMPIEQLFV TVISAKARNH SNNPRWASST WHDLWLAE
|
| |