Gene Rpal_4210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4210 
Symbol 
ID6411894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4513189 
End bp4514826 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content65% 
IMG OID642714092 
ProductABC transporter related 
Protein accessionYP_001993181 
Protein GI192292576 
COG category[R] General function prediction only 
COG ID[COG4172] ABC-type uncharacterized transport system, duplicated ATPase component 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.355372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCGA TCAACCAGCC TCTCCTGGAC GTCTCCGACC TGTCGGTGGC GTTTCACCAG 
CCCAGCGGGA CGACGACTGC CGTCGATCAT GTGTCGTTCC GGATCAAGCG CGGCGAGTGC
GTCGCACTGG TCGGCGAATC CGGCTCGGGC AAGTCGGTCA GCGCGCTGTC GATCCTGAAG
CTGCTGCCGT ATCCGAGCGC GTCGCATCCG TCCGGCCATA TCCGGTTCAA GGGGCACGAA
CTGCTCGGGA TGTCCGAGCG CGAGATCCGC GGCATTCGTG GCAACGAGAT CTCGATCGTG
TTTCAGGAGC CGATGACCTC GCTCAATCCG CTGCACACGA TCGAGGCCCA GATCGGCGAA
ATCCTGCAGT TGCACGGCGG CGTGCGCGGC GCCAAGGCAC GGGCGCGGAT CATCGAGCTG
CTGACCCAGG TTGGCATCCC CGAGCCGGAG ACACGTCTCG CCAGCTATCC GCATCAATTG
TCCGGCGGGC AGCGCCAGCG CGTGATGATC GCGATGGCGC TCGCGAACGA GCCGGATCTG
CTGATTGCCG ACGAGCCGAC CACGGCGCTC GACGTCACCG TGCAGGCGCA GATCTTGGCG
CTGCTCGCCG ACATCCGCGC GCGGCTCGGG ATGAGCATGC TGTTCATCAC CCACGATCTC
GGCATCGTCC GTCGTATCGC CGATACGGTG TGCGTGATGC ACACCGGCAA GATCGTCGAG
CAGGGGCCGG TCGAACAAGT ATTCACCGAT CCGCAGCATC CCTACACCAA GGCGCTGCTT
GCCGCCGAGC CGAAGCCCGA CCCGGCGCCG CCGTGTCCCG ATGCGCCGGT GGTGATCTCG
ACGAACGATC TCAAGGTCTG GTTTCCGATC CGCCGTGGCC TGCTGCGCAA GACCGTCGGC
CATATCAAGG CGGTCGACGG CGTGACGCTG GCGATCCGCA AAGGCGAGAC GCTCGGCGTG
GTGGGCGAGT CCGGTTCGGG CAAGACAACG CTGGGACTGG CGCTGCTGCG GTTGATCTCG
TCCGACGGGC CGATCGTGTT TCTCGGCAAG GATGTTCAGG GTCTGAAGTT CAAGCAGATG
CTGCCGTTTC GCCGCGATAT GCAGATCGTG TTTCAAGACC CGTTCGGAGC GCTCAGCCCG
CGCATGTCGG TCGGCGACAT CATTGCCGAG GGGCTGAGCG TGCATCAGCC GCAACTCGGC
GAGTCCGAGC GCGAGGCGCG GGTGATCAAG GCGCTGAAGG ATGTCGGCCT CGATCCGGCG
ACGCGCTTCC GCTATCCGCA CGAGTTCTCC GGCGGCCAGC GCCAGCGGAT TTCGATCGCG
CGCGCGGTGG TGCTGGAGCC GAACTTCGTC GTGCTCGACG AGCCGACCAG CGCGCTCGAC
ATGCTGTTTC AAGCCCAGAT GGTCGATCTG CTTCGCGAAC TGCAGCGCAA GCGCGACCTG
ACCTACATGT TCATCTCCCA CGATCTGCGG GTGGTCGCCT CGCTCGCCAG TCATTTGATC
GTCATGAAAC AAGGCAAAGT GGTCGAGGAA GGCCCCGCAG GCGAGCTATT CAAGTCTCCG
AAGACCGATT ATACGCGGGC GCTGTTTGCC GCTGCGTTCC GGCTCGAGAC CGCGCCCGGC
GGCGCTGCGG CTCAATAG
 
Protein sequence
MDAINQPLLD VSDLSVAFHQ PSGTTTAVDH VSFRIKRGEC VALVGESGSG KSVSALSILK 
LLPYPSASHP SGHIRFKGHE LLGMSEREIR GIRGNEISIV FQEPMTSLNP LHTIEAQIGE
ILQLHGGVRG AKARARIIEL LTQVGIPEPE TRLASYPHQL SGGQRQRVMI AMALANEPDL
LIADEPTTAL DVTVQAQILA LLADIRARLG MSMLFITHDL GIVRRIADTV CVMHTGKIVE
QGPVEQVFTD PQHPYTKALL AAEPKPDPAP PCPDAPVVIS TNDLKVWFPI RRGLLRKTVG
HIKAVDGVTL AIRKGETLGV VGESGSGKTT LGLALLRLIS SDGPIVFLGK DVQGLKFKQM
LPFRRDMQIV FQDPFGALSP RMSVGDIIAE GLSVHQPQLG ESEREARVIK ALKDVGLDPA
TRFRYPHEFS GGQRQRISIA RAVVLEPNFV VLDEPTSALD MLFQAQMVDL LRELQRKRDL
TYMFISHDLR VVASLASHLI VMKQGKVVEE GPAGELFKSP KTDYTRALFA AAFRLETAPG
GAAAQ