Gene Rpal_3503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3503 
Symbol 
ID6411177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3751039 
End bp3752265 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content63% 
IMG OID642713382 
Productputative ABC transporter, substrate binding protein 
Protein accessionYP_001992479 
Protein GI192291874 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.10294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGGAG TGTTCTCGCA CGCCATCGCT GCCGCGTTGG TGAGTGCGGC GTTGATCCTG 
CCGGCGTCGG CCCAGTCCGG CGACAAGACC GCCAAGATCG GCGTACTTAA CGACATGTCG
AGCCTGTACG CCGACATCGG CGGACCGAAC TCGGTCGTCG CCGCCAAGAT GGCGATCGCC
GATTCCGGGC TCGAAGCAAA AGGCTGGAAG ATCGAACTGG TCTCCGGCGA TCACCAGAAC
AAACCGGACA TCGGCGTCAA TCTGGCGCGG CAGTGGATCG ATGTCGACAA GGTCGACCTG
ATCACCGACA CGCCGAACTC CGGCGTCGCG CTGGCGATCA GCAATCTGGT CAAAGAGAAG
AACAGCATCC TGATGAATTC AGGAGGCGCC AGCGCCGATC TGACCGGCAA GGCGTGCAAC
GCCAACACCA TCTCGATGAC TTACGACACC TACATGCTGG CGCACGGCAC TGGTCAGGCC
CTGACCAAGG CCGGCGGTGA TACTTGGTTC TTCCTCACCG CCGACTACGC GTTCGGCGCC
GCGCTCGAGC GCGACACCAC CGCGGTCGTC AAGGCCAATG GCGGCAAGGT CATCGGCAGC
GTCAAGCATC CGCTGAATAC ACCGGACTTC TCGTCGTTCC TGCTGCAGGC GCAGGCGTCG
AAGGCCAAGG TGATCGGCCT CGCCAATGCC GGCGGCGACA CCACCAACTC GATCAAGCAG
GCCGCCGAGT TCGGCATCAC CGCGGGCGGC CAGAAGCTGG CCGCGCTGCT GCTGTTCATC
AACGACGTGC ACTCGCTCGG GCTGAAGACG GCTCAAGGCC TGACCTTTAC CGAATCCTAC
TATTGGGACC TCAACGACAA CACGCGCGCG TTCGCGGACC GCTTCCAGAA GCAGGCTAAG
AACAACGCCA AGCCGTCGAT GACCCAGGCC GGCGTGTACG CCGCGGTGCT GCACTATCTG
AAGACTCTTG AAGCGATGGG CGGCAATCCG CATGACGGCG CCAAGGTCGT CGCCAAGATG
AAGGAGATCC CGGCGGACGA TCTGCCGTTC GGCAAGTCGG TGATCCGCGC CGATGGACGT
CGCTTGGTGC CGGCGTTCCT GTTCGAAGTG AAGTCGCCGG CCGAATCCAA GGGCCCGTGG
GACTACTACA AGAAGATCGC CGACATCTCC GCCGAAGACG CTGCGCGTCC GCTGGCGGAC
AGCGAGTGCC CGCTGATCAA GAAGTAA
 
Protein sequence
MRGVFSHAIA AALVSAALIL PASAQSGDKT AKIGVLNDMS SLYADIGGPN SVVAAKMAIA 
DSGLEAKGWK IELVSGDHQN KPDIGVNLAR QWIDVDKVDL ITDTPNSGVA LAISNLVKEK
NSILMNSGGA SADLTGKACN ANTISMTYDT YMLAHGTGQA LTKAGGDTWF FLTADYAFGA
ALERDTTAVV KANGGKVIGS VKHPLNTPDF SSFLLQAQAS KAKVIGLANA GGDTTNSIKQ
AAEFGITAGG QKLAALLLFI NDVHSLGLKT AQGLTFTESY YWDLNDNTRA FADRFQKQAK
NNAKPSMTQA GVYAAVLHYL KTLEAMGGNP HDGAKVVAKM KEIPADDLPF GKSVIRADGR
RLVPAFLFEV KSPAESKGPW DYYKKIADIS AEDAARPLAD SECPLIKK