Gene Rsph17029_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2229 
Symbol 
ID4895649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2361312 
End bp2363129 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content66% 
IMG OID640112823 
ProductNa+/solute symporter 
Protein accessionYP_001044104 
Protein GI126462990 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR03648] probable sodium:solute symporter, VC_2705 subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.164942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAAT TCACTCTCAA CCTGATCGTG GTCGGCATCA CCTTTGCCAT CTACATCGGG 
ATTGCGATCT GGGCCCGGGC GGGTTCGACC GCAGAGTTCT ATGCCGCCGG TCAGGGCGTG
CACCCGGTCA TGAACGGCAT GGCCACGGGC GCGGACTGGA TGTCGGCGGC CTCCTTCATC
TCGATGGCGG GCATCATCGC GCTCGCCCCG GCCGGCGGCT ACACCCAGTC GGCCTTCCTC
ATGGGCTGGA CCGGCGGCTA TGTGCTTCTG GCCCTGCTGC TCGCGCCCTA CCTGCGCAAG
TTCGGCAAGT TCACCGTGCC GGAATTCATC GGGGACCGCT TCTATTCGGG CGGCGCCCGG
CTTCTGGCGG TGGTCTGCCT CATCATCATC TCGATCACCT ATGTGATCGG GCAGATGCGC
GGCGTGGGCG TGACCTTCGG GCGCTTCCTC GAGATCTCGA CCGACATGGG TCTCTACATC
GGCGCGGCGA TCGTCTTCGC CTATGCCGTG TTCGGCGGCA TGAAGGGCAT CACCTACACG
CAGGTGGCCC AGTATGTCGT GCTCATCATC GCCTACACGA TCCCCGCGAT CTTCATCTCG
CTCGCGCTGA CGGGGGCCTT CCTGCCGCAG CTCGGCCTGA TCGGCGACTA TGCGCCCGCG
GGCGGCGACC AGGCGTTCCT CGCCAAGCTC GATCAGGTGG TCACGGAACT GGGCTTTACC
GCCTATACCG CCGACACGCC CTCGATGCTG AACATGTTCC TCTTCACCAT GTCGCTGATG
ATCGGCACCG CGGGCCTGCC GCACGTCATC ATCCGCTTCT TCACCGTGCC GAAGGTGGCC
GACGCCCGTT CCTCGGCCGG CTGGGCGCTC GTCTTCATCG CGCTCCTCTA TACCGTCGCT
CCGGCCGTGG GCTCGATGGC GCGCCTGAAC CTGACCACCA CCATGTGGCC CGGCGCCGTG
ACGGGCAATG CGCTGGATGC CGATGCGATC TCGGTCGAGG CGATCCAGAC CGATCCGCGC
CTGAACTGGG TGAAGACCTG GGAACAGACC GGCCTCCTCG TGATGGAAGA CCTGAACGGC
GACGGCAAGA TCCAGTACTA CAACGACAAG AACCCGGCGA TGGCCGAGGT GGCCGCCGAG
CGCGGCTGGG CCGGCAACGA AGTCACCAAG ATCGACAACG ACATCATGGT GCTCGCCAAC
CCCGAGATCG CGGCCCTTCC GGGCTGGGTG ATCGCGCTCG TGGCCGCCGG TGCGCTGGCT
GCCGCCCTCT CGACCGCCGC GGGCCTGCTG CTCGCCATCT CGTCGGCGAT CTCGCACGAC
CTCATCAAGG GGTCGATCAA CCCGAACATC TCGGAGAAGG GCGAGCTTCT GGCCGCGCGG
ATCTCGATGA CCGGCGCGAT CGCGATCGCC ACCTGGCTCG GCCTCAACCC GCCCGGCTTT
GCCGCGCAGG TGGTGGCACT GGCCTTCGGT CTGGCCGCCG CGACGATCTT CCCGGCGCTG
ATGATGGGGA TCTTCTCGAA GCGCGTGAAC AAGGAAGGCG CGATCCTCGG GATGCTCGTC
GGCCTGATCT TCACCGCGGT CTACATCTTC CTCTACAAGG GCTGGTTCTT CATCCCCGGC
ACCGCGATGT ATGAGGACGT TCCCGCCAAC TACTTCTTCG GCATCTCGCC GCTGTCGATC
GGCACCATCG GCGCCATCCT GAACTTCGCC GTGGCCTTCG GGGTCTCGGC GGTGACGAAG
GCTCCGCCGC GTCAGGTGGT CGATCTGGTC GAGAGCATCC GCATTCCGCG CGGTGCCGGC
AAGGCCACGG CCCACTGA
 
Protein sequence
MDQFTLNLIV VGITFAIYIG IAIWARAGST AEFYAAGQGV HPVMNGMATG ADWMSAASFI 
SMAGIIALAP AGGYTQSAFL MGWTGGYVLL ALLLAPYLRK FGKFTVPEFI GDRFYSGGAR
LLAVVCLIII SITYVIGQMR GVGVTFGRFL EISTDMGLYI GAAIVFAYAV FGGMKGITYT
QVAQYVVLII AYTIPAIFIS LALTGAFLPQ LGLIGDYAPA GGDQAFLAKL DQVVTELGFT
AYTADTPSML NMFLFTMSLM IGTAGLPHVI IRFFTVPKVA DARSSAGWAL VFIALLYTVA
PAVGSMARLN LTTTMWPGAV TGNALDADAI SVEAIQTDPR LNWVKTWEQT GLLVMEDLNG
DGKIQYYNDK NPAMAEVAAE RGWAGNEVTK IDNDIMVLAN PEIAALPGWV IALVAAGALA
AALSTAAGLL LAISSAISHD LIKGSINPNI SEKGELLAAR ISMTGAIAIA TWLGLNPPGF
AAQVVALAFG LAAATIFPAL MMGIFSKRVN KEGAILGMLV GLIFTAVYIF LYKGWFFIPG
TAMYEDVPAN YFFGISPLSI GTIGAILNFA VAFGVSAVTK APPRQVVDLV ESIRIPRGAG
KATAH