Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2918 |
Symbol | |
ID | 6410587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3185033 |
End bp | 3186649 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642712798 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001991901 |
Protein GI | 192291296 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.258086 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCTCT CGCGGATCGA GATTAGCCGC CGCACTGCGC TACTGACGTC GGCCGCGATC GCCGCCAATG TGATCAACCC AATGCGCGCC TTCGCACAGG AGACGCCGCG CAAGGGCGGC GTGTTCAACG TGCACTACGG CGCCGAACAG CGGCAGCTCA ATCCGAGCCT GCAGGCCTCG ACCGGCGTCT ACATCATCGG CGGCAAGATC CAGGAACCGC TGGTCGACCT CGACGCAGCC GGCAACCCGG TCGGCGTGCT GGCGGAAAGC TGGGAATCCA CGCCTGACGG CAAGACCATC ACCTTCAAGC TGCGCAAGGG CGTGACCTGG CACGACGGCA AGCCGTTCAC CTCCGAGGAC GTCGCCTTCA CGGCGATGAA CATGTGGAAG AAGATCCTCA ACTACGGATC GACGCTGCAG CTCTTCCTCA CCACGGTCGA CACGCCCGAT CCGCAGACCG CGATCTTCCG CTACGAGCGG CCGATGCCGC TGAACCTGCT GCTGCGCGCG CTGCCGGATC TCGGCTACGT CTCGGCCAAG CACATCTACG AGACCGGTGA CATCCGCCAA AATCCGGCCA ACCTGGCGCC GATCGGCACC GGCCCGTTCA AGTTCAACAA ATACGAGCGC GGCCAGTACA TCATCGCCGA CCGCAACGAC AATTACTGGC GGCCGAACGC CCCCTATCTC GACCGCATCG TCTGGAAGGT GATCACCGAC CGTGCCGCGG CTGCGGCCCA GCTCGAAGCC GGCGGACTGC AGCTCAGCCC GTTCTCTGGT CTGACGATTT CCGACATGGC TCGGCTCGGC AAGGACAAGC GCTTCATCGT CTCCACCAAG GGCAACGAAG GCAACGCCCG CACCAACACC ATCGAATTCA ACTTCCGCCG CAAGGAGCTG TCGGACATCC GTGTCCGCCG CGCCATCGCG CACGCCATCA ACGTGCCGTT CTTCATCGAG AACTTCCTCG GCGACTTCGC CAAGCTCGGC ACCGGCCCGA TCCCTTCGAC CTCGACGGAC TTCTATCCGG GCCCGAACAC GCCGCAATAT CCCTATGACA AGCAGAAAGC GATCGCCCTG CTCGACGAGG CCGGCCTGAA GCCCGGCGCC GGCGGCAACC GGCTGTCGCT GCGGCTGCTC CCCGCGCCGT GGGGCGAGGA CATCTCGCTT TGGGCGACCT TCATCCAGCA GTCGCTGGGT GAGGTCGGCA TTCAGGTCGA AGTGGTGCGC AACGACGGCG GCGGCTTTCT CAAGCAGGTC TATGACGAGC ACGCGTTCGA TCTCGCCACC GGCTGGCACC AGTATCGCAA CGATCCCGCG GTCTCGACCA CGGTGTGGTA TCGCTCCGGT CAGCCGAAGG GCGCGCCCTG GACCAACCAG TGGGGCTGGG AAGACCCGGC GATCGACAAG ATCATCGACG ACGCCGCGAC CGAAGTCGAT CCCGCCAAGC GCAAGGCGCT GTATGCCGAC TTCGTCACCC GAGCCAATAC CGAGCTGCCG ATCTGGATGC CAATTGAGCA ATTATTCGTC ACGGTGATCA CTGCGAAGGC GCGCAATCAC TCCAATACGC CACGCTGGGC GTCATCGACC TGGCACGATC TTTGGCTGGC CGAATAG
|
Protein sequence | MPLSRIEISR RTALLTSAAI AANVINPMRA FAQETPRKGG VFNVHYGAEQ RQLNPSLQAS TGVYIIGGKI QEPLVDLDAA GNPVGVLAES WESTPDGKTI TFKLRKGVTW HDGKPFTSED VAFTAMNMWK KILNYGSTLQ LFLTTVDTPD PQTAIFRYER PMPLNLLLRA LPDLGYVSAK HIYETGDIRQ NPANLAPIGT GPFKFNKYER GQYIIADRND NYWRPNAPYL DRIVWKVITD RAAAAAQLEA GGLQLSPFSG LTISDMARLG KDKRFIVSTK GNEGNARTNT IEFNFRRKEL SDIRVRRAIA HAINVPFFIE NFLGDFAKLG TGPIPSTSTD FYPGPNTPQY PYDKQKAIAL LDEAGLKPGA GGNRLSLRLL PAPWGEDISL WATFIQQSLG EVGIQVEVVR NDGGGFLKQV YDEHAFDLAT GWHQYRNDPA VSTTVWYRSG QPKGAPWTNQ WGWEDPAIDK IIDDAATEVD PAKRKALYAD FVTRANTELP IWMPIEQLFV TVITAKARNH SNTPRWASST WHDLWLAE
|
| |