Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3010 |
Symbol | |
ID | 3910809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3432990 |
End bp | 3434327 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637884916 |
Product | Sodium:dicarboxylate symporter |
Protein accession | YP_486623 |
Protein GI | 86750127 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0666603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.647042 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACGA TGACCGATGT CGGCGTTCCT GAAACGTCGC GACCGTCCAA CGCCAAGCCT TGGTACAAGG TGCTCTATAT CCAGGTCTTG ATCGCGATCG TGCTCGGCGT GCTGGTCGGC TGGCTGTCTC CGCATCTGGC GACCAATCCG TGGATCAAGG CGCTCGGCGA CGGATTCGTC AAACTGATCA AGATGGTGAT AGCGCCGATC ATCTTCTGCA CGGTCGTCTC CGGCATCGCG CATATCCAGG ACGCCCGCAA GGTCGGCCGG GTGGGCATCA AGGCACTGGT GTATTTCGAA GTGGTGTCGT CGTTCGCGCT GATCCTCGGT CTCGTCGTCG GCAATCTTCT GCCGGTCGGG CATGGGCTCG CAGCCAAGCC GGACGCCGGA GCCGTGGCGA AGTACGTCGA CCAGGCCAGC CACATGCACG CGGTCGACTT CTTTCTCAAC ATCATTCCCG AGAGCGTCGT CGGCGCGTTC GCGAAGGGCG ACATCCTGCA GGTGCTGCTG TTCGCCATCC TGTTCGGCTT CGCGCTGATG GCGCTCGGTG AGCGCGGGCA TCGGCTGCGC GACGTGATCG ACGACACCGC TCATGCGGTG TTCGGCGTGA TCGCGATCGT GATGAAGGCC GCGCCGGTCG GTGCCTTCGG CGCGATGGCC TTCACCATCG GCAAATACGG CCCGGCCGCG CTCGGCAATC TGATCGGCCT GGTCGCGCTG TTCTATGCGA CCGCGGCGTT GTTCGTGTTC GTGGTGCTGG GGGTGATCGC CAAATTCGTC GGCTTCAACA TCTTCAAGTT CCTCGGCTAC ATCAAGGACG AGCTGTTGAT CGTGCTCGGC ACCTCGTCGT CCGAGAGCGC GCTGCCGCAA CTGATGGAGA AGCTCGAGCG GCTGGGCTGC TCGAAGTCGG TTGTGGGCCT GGTGGTGCCG ACCGGATACT CGTTCAATCT CGACGGCACC AACATCTACA TGACGCTGGC GACGCTGTTC ATCGCGCAGG CGCTCGGCAT CGAGCTGTCG TTCTCCGAAC AGGTCACGAT CCTGCTGGTT GCGATGCTGA CCTCGAAGGG CGCCAGCGGC GTCACCGGCG CTGGTTTCGT CACGCTGGCG GGGACGCTCG CCGCGGTCAA TCCGGCTCTG GTGCCGGGCA TGGCGATCGT ATTCTCGATC GACAAGTTCA TGAGCGAGGT GCGCGCGCTC ACCAACATCA CCGGCAACGG CGTCGCCACC GTGTTCGTGT CGTGGTGGGA GGGCGAGCTC GACCACGATC GGCTGCACGC CAATCTCGAC AAGACGATCG ACCCGTCGGA CGTCGAGACT GCGGTCACCA CCGGCTGA
|
Protein sequence | MSTMTDVGVP ETSRPSNAKP WYKVLYIQVL IAIVLGVLVG WLSPHLATNP WIKALGDGFV KLIKMVIAPI IFCTVVSGIA HIQDARKVGR VGIKALVYFE VVSSFALILG LVVGNLLPVG HGLAAKPDAG AVAKYVDQAS HMHAVDFFLN IIPESVVGAF AKGDILQVLL FAILFGFALM ALGERGHRLR DVIDDTAHAV FGVIAIVMKA APVGAFGAMA FTIGKYGPAA LGNLIGLVAL FYATAALFVF VVLGVIAKFV GFNIFKFLGY IKDELLIVLG TSSSESALPQ LMEKLERLGC SKSVVGLVVP TGYSFNLDGT NIYMTLATLF IAQALGIELS FSEQVTILLV AMLTSKGASG VTGAGFVTLA GTLAAVNPAL VPGMAIVFSI DKFMSEVRAL TNITGNGVAT VFVSWWEGEL DHDRLHANLD KTIDPSDVET AVTTG
|
| |