Gene Rpal_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2001 
Symbol 
ID6409661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2164920 
End bp2166155 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content63% 
IMG OID642711887 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_001990999 
Protein GI192290394 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCCC TGTCCCGTTC GATCGCCGCC GTCGCGGCCG TCGCGCTGCT GTCCGCCACC 
GGCGGGCAAG CGCTGGCGCA AAAGAAATAC GGCCCCGGTG CGAGCGACAC CGAAGTCAAG
ATCGGCAACA TCGTGCCGTA TTCCGGCCCC GCCTCGGCCT ATGGCAGCGT CGGCCGGGCG
CAGGAAGCCT ACTTCAAGAT GATCAATGAC AAGGGCGGCA TCAACGGCCG CAAGATCGTC
TACGTGTCAT ATGACGACGC CTATTCGCCG CCGAAGGCCG TCGAGCAGAC CCGCAAGCTG
GTCGAGAGCG ACGAAGTGCT GTTCATGTTC TCGCCGCTCG GAACGCCGTC CAACACCGCG
ATCCAGAAGT ATCTCAACGC CAAGAAGGTG CCGCATCTGT TCCTGGCGTC GGGCGCCACC
AAGTGGAACG ATCCGAAGCA CTTCCCGTGG ACGATGGGCT GGCTGCCGAG CTACCAGAGC
GAAGGTCGAA TCTACGCCAA GTACATCTTG AAGGAGAAGC CGGACGCCAA GATCGCCGTG
CTGTATCAGG GCGACGACTT CGGCAAGGAC TATCTGAAGG GCCTCAAGGA CGGTCTCGGT
GCCAAGGTGT CGCAGGTCGT GATCGAGGAC AGCTACGAGC TGACCGAGCC GACCGTCGAC
TCCCACATCG TCAAGATCAA GGCCGCGAAC CCGGACGTGC TGGTGATCTT CGCCACGCCG
AAGTTCGCCG CGCAGACCAT CAAGAAGGTC GCCGAGCTCG CCTGGAAGCC GATGATGATC
GTGCCAAACG TCTCGGCCTC CACCGGCAGC GTGATGAAGC CGGCCGGCTT CGAAAACGCG
CAGGGCATCG TCTCGGCCGC CTACGCCAAG GACGCCACTG ACAAGCAGTG GGAAAATGAC
CCGGGCATGA AGGCGTATTA CGAGTTCATG GCCAAGTACG CGCCGGACGC CAGCCGCGCA
GACAGCTCGT TCACCACCGG CTACAACATC GCCGAGACCG TGGCGGTGCT GATCAAGCAG
TGCGGCGACG ACCTCAGCCG TGAGAACGTG ATGAAGCAGG CGGCGAACCT CAAGGGCGTG
CAGCTCGGCG GCCTGCTGCC GGGCGTAACG CTGAACACCT CGCCTACCGA CTTCGCGCCG
ATCGAGCAGC TGCAGATGAT GCGGTTCGAA GGCGAAAACT GGAAGCTGTT CGGCGACGTG
ATCGAAGGCG AAGTCGCCGC GCCGAGCGGC GGCTAG
 
Protein sequence
MSSLSRSIAA VAAVALLSAT GGQALAQKKY GPGASDTEVK IGNIVPYSGP ASAYGSVGRA 
QEAYFKMIND KGGINGRKIV YVSYDDAYSP PKAVEQTRKL VESDEVLFMF SPLGTPSNTA
IQKYLNAKKV PHLFLASGAT KWNDPKHFPW TMGWLPSYQS EGRIYAKYIL KEKPDAKIAV
LYQGDDFGKD YLKGLKDGLG AKVSQVVIED SYELTEPTVD SHIVKIKAAN PDVLVIFATP
KFAAQTIKKV AELAWKPMMI VPNVSASTGS VMKPAGFENA QGIVSAAYAK DATDKQWEND
PGMKAYYEFM AKYAPDASRA DSSFTTGYNI AETVAVLIKQ CGDDLSRENV MKQAANLKGV
QLGGLLPGVT LNTSPTDFAP IEQLQMMRFE GENWKLFGDV IEGEVAAPSG G