Gene RPB_3010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3010 
Symbol 
ID3910809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3432990 
End bp3434327 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content63% 
IMG OID637884916 
ProductSodium:dicarboxylate symporter 
Protein accessionYP_486623 
Protein GI86750127 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0666603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.647042 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACGA TGACCGATGT CGGCGTTCCT GAAACGTCGC GACCGTCCAA CGCCAAGCCT 
TGGTACAAGG TGCTCTATAT CCAGGTCTTG ATCGCGATCG TGCTCGGCGT GCTGGTCGGC
TGGCTGTCTC CGCATCTGGC GACCAATCCG TGGATCAAGG CGCTCGGCGA CGGATTCGTC
AAACTGATCA AGATGGTGAT AGCGCCGATC ATCTTCTGCA CGGTCGTCTC CGGCATCGCG
CATATCCAGG ACGCCCGCAA GGTCGGCCGG GTGGGCATCA AGGCACTGGT GTATTTCGAA
GTGGTGTCGT CGTTCGCGCT GATCCTCGGT CTCGTCGTCG GCAATCTTCT GCCGGTCGGG
CATGGGCTCG CAGCCAAGCC GGACGCCGGA GCCGTGGCGA AGTACGTCGA CCAGGCCAGC
CACATGCACG CGGTCGACTT CTTTCTCAAC ATCATTCCCG AGAGCGTCGT CGGCGCGTTC
GCGAAGGGCG ACATCCTGCA GGTGCTGCTG TTCGCCATCC TGTTCGGCTT CGCGCTGATG
GCGCTCGGTG AGCGCGGGCA TCGGCTGCGC GACGTGATCG ACGACACCGC TCATGCGGTG
TTCGGCGTGA TCGCGATCGT GATGAAGGCC GCGCCGGTCG GTGCCTTCGG CGCGATGGCC
TTCACCATCG GCAAATACGG CCCGGCCGCG CTCGGCAATC TGATCGGCCT GGTCGCGCTG
TTCTATGCGA CCGCGGCGTT GTTCGTGTTC GTGGTGCTGG GGGTGATCGC CAAATTCGTC
GGCTTCAACA TCTTCAAGTT CCTCGGCTAC ATCAAGGACG AGCTGTTGAT CGTGCTCGGC
ACCTCGTCGT CCGAGAGCGC GCTGCCGCAA CTGATGGAGA AGCTCGAGCG GCTGGGCTGC
TCGAAGTCGG TTGTGGGCCT GGTGGTGCCG ACCGGATACT CGTTCAATCT CGACGGCACC
AACATCTACA TGACGCTGGC GACGCTGTTC ATCGCGCAGG CGCTCGGCAT CGAGCTGTCG
TTCTCCGAAC AGGTCACGAT CCTGCTGGTT GCGATGCTGA CCTCGAAGGG CGCCAGCGGC
GTCACCGGCG CTGGTTTCGT CACGCTGGCG GGGACGCTCG CCGCGGTCAA TCCGGCTCTG
GTGCCGGGCA TGGCGATCGT ATTCTCGATC GACAAGTTCA TGAGCGAGGT GCGCGCGCTC
ACCAACATCA CCGGCAACGG CGTCGCCACC GTGTTCGTGT CGTGGTGGGA GGGCGAGCTC
GACCACGATC GGCTGCACGC CAATCTCGAC AAGACGATCG ACCCGTCGGA CGTCGAGACT
GCGGTCACCA CCGGCTGA
 
Protein sequence
MSTMTDVGVP ETSRPSNAKP WYKVLYIQVL IAIVLGVLVG WLSPHLATNP WIKALGDGFV 
KLIKMVIAPI IFCTVVSGIA HIQDARKVGR VGIKALVYFE VVSSFALILG LVVGNLLPVG
HGLAAKPDAG AVAKYVDQAS HMHAVDFFLN IIPESVVGAF AKGDILQVLL FAILFGFALM
ALGERGHRLR DVIDDTAHAV FGVIAIVMKA APVGAFGAMA FTIGKYGPAA LGNLIGLVAL
FYATAALFVF VVLGVIAKFV GFNIFKFLGY IKDELLIVLG TSSSESALPQ LMEKLERLGC
SKSVVGLVVP TGYSFNLDGT NIYMTLATLF IAQALGIELS FSEQVTILLV AMLTSKGASG
VTGAGFVTLA GTLAAVNPAL VPGMAIVFSI DKFMSEVRAL TNITGNGVAT VFVSWWEGEL
DHDRLHANLD KTIDPSDVET AVTTG