Gene Rpal_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1850 
Symbol 
ID6409509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1987372 
End bp1988580 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content64% 
IMG OID642711738 
Productputative ABC transporter, substrate binding protein 
Protein accessionYP_001990851 
Protein GI192290246 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.164156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGCCA GATCGATCGC GGCGGCGATG ATGACGGCTG CGCTGCTGTC ATCGTCCATC 
GCCCGCGCGC AAGTCTCCGA CGACGTCGTC AAGCTCGGCG TCCTCACCGA CATGAACGGC
CCCGCCTCGA CGCCCACCGG CCAGGGCTCC GTCACCGCCG CGCAGATGGC GGTGGACGAT
TTCGGCGGCA CCGTGCTCGG CAAGCCGATC CAGCTGATCG TCGGCGATCA TCAGCTCAAG
CCCGATATTG GCGCCACGCT GGCGCGGCGC TGGTACGACG TCGAGCAGGT CGACCTGATC
CTCGACGTGC CGGTCTCCGC CGTCGGCCTC GCGGTGCAGA ACATCGCCAA TGAAAAGAAG
CGGCTGTTCA TCACCCAGTC GACCGGCGCC GCAGATTTTC ACGGCAAGTT CTGCAGCCCT
TACGCAATGC AATGGGTATT CGACACGCAT GCCCTGGCAG TCGGCACCGC GCAGGAGGTG
GTGAAGCGCG GCGGCGATAC CTGGTTCTTC ATCACCGATG ACTATGCCTT CGGCCAGTCG
CTGGAGAAGG ACGCCGCCGC GATGGTCACC AAGACCGGCG GCAAGGTGCT CGGCTCGGTG
CGACCGCCGT TCGCGACCTC GGACGTGTCG TCGTTCGTGC TACAGGCGCA GGCCTCGAAG
GCCAAGATCA TCGGTATTGC GGCCGGTCCG CCCAACAACG TCAACGAGAT CAAGACCGGC
GGCGAATTCG GCATCTTCAA GGGTGGCCAG CAGATGGCGG CACTGCTGGC GCTGATCACC
GACGTGCATT CGCTCGGCCT CCCCACCGCG CAGGGCCTGC TGCTGACGAC GTCGTTCTAT
TGGGACATGG ACGACAAGAC CCGGGAATGG TCGAAGCGCT ATTTCGCCAA GATGAACCGG
ATGCCGACGA TGTGGCAGGC CGGCGTGTAT TCGGCGACCA TGCATTACCT GCAGGCGATC
AAGGATGCCG GTACCGACGA TCCCCTGAAG GTCGCGGCCA AGATGCGGGA GAAACCGGTC
AACGACTTCT TTGCCCGCGG CGGCAAACTG CGCGAGGATG GACTGATGGT CCACGACCTG
ATGCTGGTGC AGGTGAAGAC CCCGGAGGAG TCGAAATATC CGTGGGACTA CTATAAGATT
CTGGCCCACA CCCCCGGCGA TGCCGCCTTC GGCCCGCCCG ATCCGGCATG TTCGCTGGTG
AAGAAGTAA
 
Protein sequence
MIARSIAAAM MTAALLSSSI ARAQVSDDVV KLGVLTDMNG PASTPTGQGS VTAAQMAVDD 
FGGTVLGKPI QLIVGDHQLK PDIGATLARR WYDVEQVDLI LDVPVSAVGL AVQNIANEKK
RLFITQSTGA ADFHGKFCSP YAMQWVFDTH ALAVGTAQEV VKRGGDTWFF ITDDYAFGQS
LEKDAAAMVT KTGGKVLGSV RPPFATSDVS SFVLQAQASK AKIIGIAAGP PNNVNEIKTG
GEFGIFKGGQ QMAALLALIT DVHSLGLPTA QGLLLTTSFY WDMDDKTREW SKRYFAKMNR
MPTMWQAGVY SATMHYLQAI KDAGTDDPLK VAAKMREKPV NDFFARGGKL REDGLMVHDL
MLVQVKTPEE SKYPWDYYKI LAHTPGDAAF GPPDPACSLV KK