Gene Rpal_5294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5294 
Symbol 
ID6412995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5711582 
End bp5712802 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content63% 
IMG OID642715183 
Productputative ABC-type branched-chain amino acid transporter, periplasmic binding protein 
Protein accessionYP_001994255 
Protein GI192293650 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCCATTC GCACCTCGCT CCTCCTGGCC AGCGCCGTCG CGCTCGCTGC AGCTCAGCCC 
GCGTTCGCCC AGAAGCAATA CGGCCCCGGC GTCACCGACA CCGAGATCAA GATCGGCCAG
ACCATGCCGT ATAGCGGCCC GGCTTCGGCC TATGGCGTGC AGGGCCACGT CGAAGACGCG
TACTACACGA TGGTCAACGC CAAAGGCGGC GTGAACGGCC GCAAGATCAA GCTGATCAGC
CTCGACGACG CCTATTCGCC ACCCAAGACG GTCGAGCAGA CCCGCAAGCT GGTCGAACAG
GAGGAGGTGC TGGCGATCAT CGGCACCATC GGCACCCCGA CCAACACGGC GATCCAGAAG
TATCTCAACG CCAAGAAAGT GCCGCAGATC TTCATCTCGA CCGGCGCGGC GAAGTGGGAC
GATCCGAAGA ACTTTCCGTG GACCACGCAG CTCTACCCGC CCTACCAGAT GGAGGGCATG
ATTTTCGCCA AGTACCTGCT GAAGAACAAA CCGGACGCCA AGCTCGGCGT ATTCTCGCAG
AATGACGACG CCGGCAAGGA CTACGTCAAA GGCCTGAAGG ACGGGCTCGG CGACAAGGCC
AAGACCATGA TCGTCAAGGA AGTGACCTAT GAGGTCACCG ATCCGACGGT CGACTCGCAG
ATCGTGGCGC TGAAGGCGTC CGGCGCCGAC ACGCTGTTCA CGATGGCGAC GCCGAAGTTC
GGCGCGCAGG CGATCCGCAA GGTGCACGAA CTGAACTGGA AGCCGCTGAA CTTCGTCGTC
AGCGTGTCGA GCTCGATCAA GGGTGTGCTC GAGCCCGCCG GCAAGGAAGC CTCGACCGGC
CTCCTGACCG CACTGGCGGC GAAGACGCCG ACCGACCCGC GGTTCGAGAA CGATGCCGAC
GTCAAGGAGT TCAAGGACTT CCTCGCCAAG TGGTATCCGA AAGGCGACAT CGCCGACGGC
TCCACCGTCA CCGGCTACAT CTCGGCCTAT ATGACCGTGA AGGTGCTGGA AGCCTGCGGC
GACAACCTGA CCCGCGACAA CCTGCTGAAG CAGGCGACCA ACATCAAGCC GACCGCGGCG
CCGCTGCTGC TGCCGGGCAT CAAGATCTCG ACCCGCCCCG ACCGCTACGC GCCCTACACC
CAGATGCAGA TCGCGCGGTT CGACGGAAAG AGCTGGGTGC CGGAAGGCGA AGTGTTCAAC
ACCGATGCGG CGAGCCAATA A
 
Protein sequence
MSIRTSLLLA SAVALAAAQP AFAQKQYGPG VTDTEIKIGQ TMPYSGPASA YGVQGHVEDA 
YYTMVNAKGG VNGRKIKLIS LDDAYSPPKT VEQTRKLVEQ EEVLAIIGTI GTPTNTAIQK
YLNAKKVPQI FISTGAAKWD DPKNFPWTTQ LYPPYQMEGM IFAKYLLKNK PDAKLGVFSQ
NDDAGKDYVK GLKDGLGDKA KTMIVKEVTY EVTDPTVDSQ IVALKASGAD TLFTMATPKF
GAQAIRKVHE LNWKPLNFVV SVSSSIKGVL EPAGKEASTG LLTALAAKTP TDPRFENDAD
VKEFKDFLAK WYPKGDIADG STVTGYISAY MTVKVLEACG DNLTRDNLLK QATNIKPTAA
PLLLPGIKIS TRPDRYAPYT QMQIARFDGK SWVPEGEVFN TDAASQ