Gene Rpal_4214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4214 
Symbol 
ID6411898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4519001 
End bp4520896 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content62% 
IMG OID642714096 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001993185 
Protein GI192292580 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.159532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGTTGA CCCGACGACA TCTTCTGCAG GGCGGCTTGT TGGCCGCAGC GACCCCGGCT 
TTGAGCTTCA GCCCCGGCCT GTTTGGGGCG AGCGCGGCGC GCGCCGAAAC CGCGGTCGAT
GGGGCGGCAT GGCGCCACGG TCTGTCGCTG TTCGGGGAGC TGAAATATCC GGCCGGCTTC
GCGCAGTTCG ATTATGTGAA TCCGAAAGCA CCGAAGGGCG GTGCCGCGCG GCAGATCGCG
CTAGGCACCT TCGACAATTT CAATCTCGCG GTCGCCGGCG TGAAAGGTAA CATCGCCGGG
CCGGTGGGGT ATCTCTACGA AACCCTGATG ACGCCGTCGC AGGACGAGGT CGGCACCGAA
TACGGCTTGC TCGCCGAGGG CGCTGCGCAT CCCGACGATT TTTCCTGGGT GATCTATCGC
GTGCGCAAGG AAGCGCGCTG GAACGACGGC AAGCCGGTCA CGGCCGACGA CGTCGTCTTT
TCGTTCGATG CGCTGAAGAA ATACAGCCCA CGCTACGCCT CGTATTATCG TCACGTCGTC
AAGGCCGAGA AGGTCGGCGA GCGCGACGTC CGCTTCACCT TCGATGCGCC GGGCAACCGC
GAACTGCCGA CCATCGTCGG CGAATTGATG GTGCTGCCGA AGCATTGGTG GGAGGGCACT
GACGCCCAGG GGCGCAAGCG CGACGTCTCG GCAACAACGC TGGAGCCTCC GCTCGGCTCG
GCGCCCTACA AGATCAAGGA CTTCGTCGCC GGCCGTTCGA TCGTGCTGGA GCGCGTGAAG
GACTATTGGG GCGAGAAGCT GCCGGTGCGG ATCGGCCAGA ACAATTTCGA CGAGCTGCGG
TTCGAGTACT TCCGTGACAA CACCGTCGCA CTGGAGGCCT TCAAGGCCGA CCAGGCCGAC
TGGATCATGG AGAACTCCGC CAAGCAGTGG GCGACTGCCT ACGACTTTCC CGCGGTGAAC
GACAAGCGTG TTGTCAAAGA AGAATTCCCG ATCAACGATT CGGGACGGAT GCAGGCGTTC
GTGCTGAATA CCCGCCGCGA GATGTTCAAG GATCCGCGGG TGCGGCGCGC GTTCAACTAC
GCGTTCGATT TCGAAGAGAT GAACAAGCAG CTGTTCTATG GACAGTACAA GCGGATCGCG
AGCTTCTTCG AAGGCACCGA GCTCGCCTCC AGCGGACTAC CTGAAGGGCA GGAACTGGCG
CTGCTCGAAA CCGTGCGCGA CAAGGTGCCG GCCGAGCTGT TCACGCAGCC CTATACCAAT
CCAGTCGGCG GCAACCCGGA GGCGGTACGC GCCAATCTCC GTGAGGCGAT CAAGCTGGTG
AAAGAGGCTG GCTTCGACAT CAAGGATCGC AAACTGGTCG ATCCGTCCGG CAAGCCGGTC
GCTGTCGAGA TCCTGGTGCA GGACCCGTCG TCGGAGCGGA TTGCGCTGTT CTACAAGCCT
TCGCTGGAGC GGCTCGGCGT CACCGTCTCG ATCCGCGTGG TCGACGACGC GCAGTATCAG
AACCGGATTC GCGCGTTCGA TTTCGACATC ATCACCGACC TGTGGGGCCA GTCGCTGTCG
CCCGGTAATG AACAGCGCGA TTATTGGGGA TCACAGGCGG CCAATGAGCA GGGCTCGCAC
AACACCATCG GCATCAAGAA TCCGGCCGTC GATGAGCTGA TCGAAAAGGT GATCTACGCC
AAGGACCGGC CCTCGTTGAT TGCGGCGACG CGAGCGCTCG ACCGCGTGCT GCTGTGGAAC
TTCTATGTCG TCCCACAATT CACCTACGGC TTCATGCGCT ACGCGCGCTG GGACCGGTTT
GGGCACGCGC CGCTGCCGAA ATACGCTCGC TCTGGTCTGC CGGCGTTGTG GTGGTACGAC
GCCGACAAGG CCGCCAATCT CGGCAAGCGC TCTTGA
 
Protein sequence
MTLTRRHLLQ GGLLAAATPA LSFSPGLFGA SAARAETAVD GAAWRHGLSL FGELKYPAGF 
AQFDYVNPKA PKGGAARQIA LGTFDNFNLA VAGVKGNIAG PVGYLYETLM TPSQDEVGTE
YGLLAEGAAH PDDFSWVIYR VRKEARWNDG KPVTADDVVF SFDALKKYSP RYASYYRHVV
KAEKVGERDV RFTFDAPGNR ELPTIVGELM VLPKHWWEGT DAQGRKRDVS ATTLEPPLGS
APYKIKDFVA GRSIVLERVK DYWGEKLPVR IGQNNFDELR FEYFRDNTVA LEAFKADQAD
WIMENSAKQW ATAYDFPAVN DKRVVKEEFP INDSGRMQAF VLNTRREMFK DPRVRRAFNY
AFDFEEMNKQ LFYGQYKRIA SFFEGTELAS SGLPEGQELA LLETVRDKVP AELFTQPYTN
PVGGNPEAVR ANLREAIKLV KEAGFDIKDR KLVDPSGKPV AVEILVQDPS SERIALFYKP
SLERLGVTVS IRVVDDAQYQ NRIRAFDFDI ITDLWGQSLS PGNEQRDYWG SQAANEQGSH
NTIGIKNPAV DELIEKVIYA KDRPSLIAAT RALDRVLLWN FYVVPQFTYG FMRYARWDRF
GHAPLPKYAR SGLPALWWYD ADKAANLGKR S