Gene RPD_2441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2441 
Symbol 
ID4022932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2719311 
End bp2720648 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content63% 
IMG OID637962634 
Productsodium:dicarboxylate symporter 
Protein accessionYP_569572 
Protein GI91976913 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.990408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.705886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACGA TGACCGATAT CGGTGCTTCC GGAGCGTTGC ATCGCACCAG ATCCAAGCCC 
TGGTACAAGG TGCTCTATGT CCAGGTTCTG ATCGCGATCG TGCTCGGCGT GCTGCTCGGC
TGGGTGTCGC CGCATCTGGC GACCAACCCG TGGATCAAGG CGCTCGGCGA TGGCTTCGTC
AAGCTGATCA AGATGGTGAT CGCGCCGATC ATCTTCTGCA CCGTCGTCTC CGGCATCGCG
CATATCCAGG ACGCGCGAAA GGTCGGGCGG GTCGGCGTCA AGGCGCTGCT GTATTTCGAA
GTGGTGTCGT CCTTCGCGCT GATCCTCGGC CTCATCGTCG GCAATCTGGT GCCGGTCGGG
CACGGGCTTG CGGCCAAGCC GGACGCCGGC GCGGTGGCGA AATACGTCGA CCAGGCAAGC
CACATGAGCT CGGTCGATTT CTTCCTGAAC ATCATTCCGG AGAGTGTCGT CGGCGCGTTC
GCGAAAGGCG ACATCCTGCA GGTGCTGCTG TTCGCGATCC TGTTCGGCTT CGCACTGATG
GCGCTCGGCG AGCGCGGCCA CCGACTGCGC GACGTGATCG ACGACACCGC CCACGCGGTG
TTCGGCGTGA TCGCGATCGT GATGAAGGCC GCCCCGGTCG GTGCATTCGG CGCGATGGCC
TTCACCATCG GCAAATACGG CCCCGCCGCG CTGGGCAATC TGATCGGCCT GGTCGCGTTG
TTCTACGCGA CCGCGGCGCT GTTCGTGTTC GTGGTGCTGG GGCTGATCGC GAAATTCGTC
GGCTTCAACA TTTTCAGGTT CGTCGGCTAC ATCAAGGACG AGCTGCTGAT CGTGCTCGGC
ACCTCGTCGT CCGAAAGCGC GCTGCCGCAA CTGATGGAAA AGCTCGAACG GCTCGGCTGC
TCGAAATCGG TGGTCGGTCT GGTGGTGCCG ACCGGCTATT CGTTCAACCT CGACGGCACC
AACATCTACA TGACGCTGGC GACCTTGTTC ATCGCCCAGG CGCTCGGGAT CGAGCTCACC
TTCACCGAAC AGGTCACCAT CCTGCTGGTG GCGATGCTGA CTTCGAAGGG TGCGAGCGGC
GTCACCGGGG CGGGCTTCGT CACCCTCGCC GGAACGCTCG CGGCGGTGAA CCCGGCGCTG
GTGCCGGGGA TGGCAATCGT GTTTTCGATC GACAAGTTCA TGAGCGAGGT GCGCGCGCTC
ACCAACATCA CCGGCAACGG CGTCGCCACG GTGTTCGTGT CGTGGTGGGA GGGTGAACTC
GACCACGACC GGCTGCAGGC CAATCTCAAC CGGACGATCG ATCCGTCCGA CGTCGAGACC
GCAATCACCA CCGGCTGA
 
Protein sequence
MSTMTDIGAS GALHRTRSKP WYKVLYVQVL IAIVLGVLLG WVSPHLATNP WIKALGDGFV 
KLIKMVIAPI IFCTVVSGIA HIQDARKVGR VGVKALLYFE VVSSFALILG LIVGNLVPVG
HGLAAKPDAG AVAKYVDQAS HMSSVDFFLN IIPESVVGAF AKGDILQVLL FAILFGFALM
ALGERGHRLR DVIDDTAHAV FGVIAIVMKA APVGAFGAMA FTIGKYGPAA LGNLIGLVAL
FYATAALFVF VVLGLIAKFV GFNIFRFVGY IKDELLIVLG TSSSESALPQ LMEKLERLGC
SKSVVGLVVP TGYSFNLDGT NIYMTLATLF IAQALGIELT FTEQVTILLV AMLTSKGASG
VTGAGFVTLA GTLAAVNPAL VPGMAIVFSI DKFMSEVRAL TNITGNGVAT VFVSWWEGEL
DHDRLQANLN RTIDPSDVET AITTG