Gene Rpal_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1991 
Symbol 
ID6409651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2152522 
End bp2153679 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content64% 
IMG OID642711877 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_001990989 
Protein GI192290384 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.179885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCAAGT TCAAGCTATC CGCCACGGCG ATCGCCGTGG CCCTCGCGTT GCCGGGGCTT 
TCCGGGGCCG CGCTTGCCGA AACTAACGAA ATCACCATCG GTATCACCGT CACCACCACC
GGTCCGGCGG CGGCACTCGG CATTCCGGAG CGCAATGCTC TAGAATTCGT GGCTAAGGAA
ATCGGCGGTC ATCCGCTCAA GTTGATCGTG CTCGACGACG GCGGCGATCC CACCGCGGCC
ACCACCAATG CGCGGCGTTT CGTCACGGAG TCGAAGGCCG ACGTGATCAT GGGCTCGTCG
GTGACGCCGC CAACCGTGGC GGTCTCCAAC GTCGCCAACG AAGCGCAGGT GCCGCACATC
GCGTTGGCGC CACTGCCGAT CACGCCGGAG CGCGCCAAGT GGTCGGTGGC GATGCCGCAG
CCGATCCCGA TCATGGGCAA GGTGCTCTAC GAGCACATGA AGAAAAACAA CATCAAGACC
GTCGGCTACA TCGGCTATTC GGATTCCTAC GGCGATCTGT GGTTCAACGA CCTGAAGAAG
CAGGGCGAGG CTATGGGTTT GAAGATCGTC GCCGAAGAGC GCTTCGCGCG GCCGGACACG
TCGGTGGCAG GTCAGGTGCT GAAGCTGGTC GCCGCCAATC CGGATGCGAT CCTGGTCGGT
GCGTCCGGCA CCGCGGCAGC GCTGCCGCAG ACCAGTCTGC GCGAGCGCGG TTACAAGGGC
CTGATCTATC AGACCCATGG CGCCGCCTCG ATGGACTTCA TCCGTATCGC CGGCAAGTCG
GCCGAGGGCG TGCTGATGGC GTCGGGCCCG GTGATGGATC CGGAAGGTCA GGACGACAGC
GCGTTGACCA AGAAGCCTGG CCTCGAACTC AACACCGCCT ATGAAGCCAA GTACGGCCCG
AACAGCCGCA GCCAGTTCGC CGCGCATTCG TTCGACGCCT TCAAGGTGCT GGAGCGGGTG
GTGCCGGTGG CGCTGAAGAC CGCCAAGCCG GGCACGCAGG AATTCCGCGA GGCGATCCGC
AAGGCGCTGG TCAGCGAAAA GGACATCGCG GCGAGCCAGG GCGTCTACAG CTTCACTGAA
ACCGATCGCT ACGGCCTCGA CGACCGTTCG CGCATCCTGC TGACGGTGAA GGATGGCAAA
TACGTGATGG TGAAGTAA
 
Protein sequence
MPKFKLSATA IAVALALPGL SGAALAETNE ITIGITVTTT GPAAALGIPE RNALEFVAKE 
IGGHPLKLIV LDDGGDPTAA TTNARRFVTE SKADVIMGSS VTPPTVAVSN VANEAQVPHI
ALAPLPITPE RAKWSVAMPQ PIPIMGKVLY EHMKKNNIKT VGYIGYSDSY GDLWFNDLKK
QGEAMGLKIV AEERFARPDT SVAGQVLKLV AANPDAILVG ASGTAAALPQ TSLRERGYKG
LIYQTHGAAS MDFIRIAGKS AEGVLMASGP VMDPEGQDDS ALTKKPGLEL NTAYEAKYGP
NSRSQFAAHS FDAFKVLERV VPVALKTAKP GTQEFREAIR KALVSEKDIA ASQGVYSFTE
TDRYGLDDRS RILLTVKDGK YVMVK