Gene Rpal_5300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5300 
Symbol 
ID6413001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5719391 
End bp5720584 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content63% 
IMG OID642715189 
Productputative periplasmic substrate binding protein 
Protein accessionYP_001994261 
Protein GI192293656 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.401613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCA AACTTCTTTC CGCCGCGCTT GCGGCTACTG CCCTCGTTGC CATCACGCCG 
GCTTCGGCGC AGGGCGTCAA GATCGGCATC CTCAACGACC AGTCCGGCGT GTACGCGGAC
TACGGCGGCA AGTGGTCGCT CGAAGCCGCC AAGATGGCGG TGGAGGATTT CGGCGGCGAA
GTGCTGGGGC AGAAGATCGA ACTCATCTCG GCCGATCATC AGAACAAGCC CGACAACGCG
GTCGCGATCG CGCGCAAGTG GTACGACGTC GACGGCGTCG ACATGATTAC CGAACTCACC
ACCTCGTCGG TGGCGCTGGC GATCCAGGAT CTGTCGAAGG AAAAGAAGAA GATCGATATC
GTGGTCGGCG CCGCGACCTC GCGGATCACC GGCGATGCCT GCCAGCCTTA CGGGTTCCAT
TGGGCGTTCG ATACTCATGC CCTCGCGGTC GGAACCGGCG GCGCCCTGGT GAAGGCCGGC
GGCGACACCT GGTTCTTCCT GACCGCGGAC TACGCGTTCG GCTACGCGCT GGAGAAGGAC
ACCAGTGAGG TCGTGCTAGC CAGCGGCGGC AAGGTGCTCG GCTCGGTACG GCATCCGCTG
AATTCGTCGG ACTTCTCCTC GTTCCTGCTG CAGGCGCAGG CCTCGAAGGC CAAGGTCGTC
GGCCTTGCCA ATGCCGGCCT CGACACCGCC AACTCGATCA AGCAGGCCTC TGAATTCGGC
ATCGTCGCCG GCGGCCAGAA GCTTGCGGGC CTCTTGATGA CGCTGGCCGA AGTGAACGGT
CTCGGCCTCA AGGCGGCGCA GGGCTTGGTG CTGACCGAAG CCTATTATTG GGATCGCGAC
GACAAATCGC GTGAGTTCGC CGAGCGTTTC TTCAAGCGCA CCAGCCGGAT GCCGAGCATG
ATCCAGGCCG GGACCTATTC GGCGACGCTG TCCTATTTGA AGGCCGTCAA GGCCGCCGGA
ACCAAGGACA CCGATGCGGT GGCCAAGAAG CTGAAGGAGC TGCCAGTCGA CGACGCGTTT
GCATCCGGCA AGGTGCTGGC CAACGGCCGC TTCGTCCACG ACATGTATCT GTTCGAGGTG
AAGAAGCCGG AAGAATCGAA GAAGCCGTGG GATTACTACA AGCTGCTCGC CACCGTGCCG
GGCGACAAGG CGTTCCCGAC TGCGGCAGAG AGCGGCTGCC CGCTGACCAA GTAA
 
Protein sequence
MKLKLLSAAL AATALVAITP ASAQGVKIGI LNDQSGVYAD YGGKWSLEAA KMAVEDFGGE 
VLGQKIELIS ADHQNKPDNA VAIARKWYDV DGVDMITELT TSSVALAIQD LSKEKKKIDI
VVGAATSRIT GDACQPYGFH WAFDTHALAV GTGGALVKAG GDTWFFLTAD YAFGYALEKD
TSEVVLASGG KVLGSVRHPL NSSDFSSFLL QAQASKAKVV GLANAGLDTA NSIKQASEFG
IVAGGQKLAG LLMTLAEVNG LGLKAAQGLV LTEAYYWDRD DKSREFAERF FKRTSRMPSM
IQAGTYSATL SYLKAVKAAG TKDTDAVAKK LKELPVDDAF ASGKVLANGR FVHDMYLFEV
KKPEESKKPW DYYKLLATVP GDKAFPTAAE SGCPLTK