Gene Rpal_4650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4650 
Symbol 
ID6412336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5012356 
End bp5013558 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content61% 
IMG OID642714529 
Productputative ABC transporter (substrate-binding protein); putative branched-chain amino acid transporter 
Protein accessionYP_001993616 
Protein GI192293011 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATGA CGACCGCGCG GGCATTGCTC GCCGTATCAC TCGGTCTGAT CGGCACGGCC 
GCATCGGCCG AGGATCAACC GGGGATCACC CAGACCGAAA TCCGCATCGG GCAGACCATG
CCTTATAGCG GGCCGGTTTC GGCATTCGGG ATTCTCGGCA AGGGCGAACT CGCTTACTTC
AAGATGGTCA ATGATCGCGG CGGCATCAAC GGCCGCAAGA TCAACCTGAT CTCGCTCGAC
GACGGCTACG TGCCGCCGAA GACGGTGGAG CAGACCAGAC GACTGGTGGA AAGCGACGAA
GTCTCGTTCA TCTTCTCCAC CATGGGCACC GCGCACAACA CCGCGATCGC CAAATATCTG
CAAAACAAGA AGGTGCCGCA GCTGTTCGTC GCTTCCGGCG CCTCCAAATT CGGCGACATC
TCGCAGTACC CGCTCGCCAT CATGGGCATC ATGGCGCCGT TCCGCAACGA AGCGAGAATG
TACGCCCGCT ACGCCCTGGA GAAGAAGCCG GACGCCACCT TTGCGGTGAT CGCACAGAAC
GACGATTTCG GCCGCGACTA TCTTGCCGGG CTGCGCGACG TGCTCGGCGA GCGCTACGAC
AAGGCGGTGA CCGCAAGCAT GTACGAAGTC ACCGACCCGA CCATCGACTC GCAGATCGTC
AGCCTGAAAG CCAGCGGCGC CGATGCGCTG ATCATCGCCG CGACACCAAA GTTCGCCGCG
CAGGCGATCC GCAAGACGTT CGAGATCGGC TGGAAGCCGA TGAGATTCCT GTCCAACGTC
TCGGTGTGGA TGTCGTCGGT GATGGAGCCG GCCGGCGTCG ATGCCGGCGT CGGCATCATC
TCGACTGCCT ACGTCAAAGA TCCGCTCGAT CCCGCCTGGG CCAACGATCC CGGCGTGAAG
GATTGGCGAG CCTACATGCA GAAGTACATC CCGGACGGAG ACTTGCGCGA TTCCAACTAC
GTCAACGGCT ACAACAACGG CATGGTTCTC GAACATGTGC TGAAGGCGGC CGGCAACGAT
CTCAGCCGCG ACAACATCAT GAAGCAGGCG CTCTCGATCA AAGAGCTGGA GTTGCCGATG
CTGCTGCCGG GCATCAAGGT TCAGACTGCG GCCGACGACC ACCTTCCGAT CGAGCAGGTC
CAGTTCATGC GCTTCACCGG CAAGCAATGG GAACGGTTCG GAGAGGTGCG CTCGACCAAG
TAA
 
Protein sequence
MRMTTARALL AVSLGLIGTA ASAEDQPGIT QTEIRIGQTM PYSGPVSAFG ILGKGELAYF 
KMVNDRGGIN GRKINLISLD DGYVPPKTVE QTRRLVESDE VSFIFSTMGT AHNTAIAKYL
QNKKVPQLFV ASGASKFGDI SQYPLAIMGI MAPFRNEARM YARYALEKKP DATFAVIAQN
DDFGRDYLAG LRDVLGERYD KAVTASMYEV TDPTIDSQIV SLKASGADAL IIAATPKFAA
QAIRKTFEIG WKPMRFLSNV SVWMSSVMEP AGVDAGVGII STAYVKDPLD PAWANDPGVK
DWRAYMQKYI PDGDLRDSNY VNGYNNGMVL EHVLKAAGND LSRDNIMKQA LSIKELELPM
LLPGIKVQTA ADDHLPIEQV QFMRFTGKQW ERFGEVRSTK