Gene Rru_A1449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1449 
Symbol 
ID3834864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1713883 
End bp1714959 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content64% 
IMG OID637825539 
Productbile acid:sodium symporter 
Protein accessionYP_426537 
Protein GI83592785 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.939169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTCCG TTCACAGCGA TCCCCCCGCC GCCAAGGTCT CTGGCAAGCC GATGGGGTTT 
TTCGAGCGTA CCCTGACCCT GTGGGTCGGG CTGTGCATCG TGGTCGGGGT GACCTTGGGC
CATGTCGCGC CCGGCCCCTT TCAGGCCATC GCCGGGCTGG AAATCGCCCA GGTCAATCTG
CCCGTCGCCC TGCTGATCTG GCTGATGATC ATTCCGATGC TGCTGAAGAT CGACTTCGCC
GCGCTGGGCA CCGTCGGGCG GCACTGGAAG GGCATGGGCG TGACCCTGTT CATCAACTGG
GGGGTCAAGC CGTTTTCCAT GGCCCTGCTC GGCTGGTTGT TCATCAGCAC GCTGTTTCGG
CCCTGGCTGC CCGCCGATCA GATCGACAGC TATATCGCCG GGTTGATCCT GCTGGCCGCC
GCGCCCTGCA CGGCGATGGT CTTCGTCTGG TCCAACCTGA CCGGCGGCGA ACCCAATTTC
ACCCTGTCGC AGGTGGCGCT GAACGACCTG ATCATGGTCT TCGCCTTCGC CCCCATCGTC
GGGTTGCTGC TTGGCCTGTC GTCGATCACC ATTCCCTGGG ACACCCTGCT GATCTCGGTC
GCGCTGTATA TCGTGGTGCC GGTGATCATC GCCCAGATCT GGCGGCGCGC CCTGGTTGCG
CGCGGTCCCC AGGCCGTGGA GCGCGTGCTC AAAACCCTGC ATCCGCTGTC TTTGGGCGCT
CTTCTCGCCA CGCTGGTGCT GCTGTTTGGC TTCCAGGGCG AACAGATCCT CGCCCAACCA
CTGATCATCG CGCTGCTGGC GGTGCCGATC ACCATCCAGG TCTATTTCAA CAGCGGCCTT
GCCTATCTGC TGTCGAAGAA GCTGGGCGTC GCCCATTGCG TGGCCGGACC GGCGGCGCTG
ATCGGCGCCA GCAACTTCTT CGAACTGGCC GTCGCCGCCG CGATCAGCCT GTTCGGCTTC
CAGTCCGGGG CGGCGCTGGC CACCGTGGTC GGCGTGCTGA TCGAAGTGCC GGTGATGCTG
TCGGTGGTGA AAATCGTCAA TGCCTCCAAG GGCTGGTACG AAGGAAAGCC CGCATGA
 
Protein sequence
MMSVHSDPPA AKVSGKPMGF FERTLTLWVG LCIVVGVTLG HVAPGPFQAI AGLEIAQVNL 
PVALLIWLMI IPMLLKIDFA ALGTVGRHWK GMGVTLFINW GVKPFSMALL GWLFISTLFR
PWLPADQIDS YIAGLILLAA APCTAMVFVW SNLTGGEPNF TLSQVALNDL IMVFAFAPIV
GLLLGLSSIT IPWDTLLISV ALYIVVPVII AQIWRRALVA RGPQAVERVL KTLHPLSLGA
LLATLVLLFG FQGEQILAQP LIIALLAVPI TIQVYFNSGL AYLLSKKLGV AHCVAGPAAL
IGASNFFELA VAAAISLFGF QSGAALATVV GVLIEVPVML SVVKIVNASK GWYEGKPA