Gene TM1040_0523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0523 
Symbol 
ID4077229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp549000 
End bp550034 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content61% 
IMG OID638005819 
Productbile acid:sodium symporter 
Protein accessionYP_612518 
Protein GI99080364 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.27289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.171222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCT TCGAGAAATA CCTGAGCCTC TGGGTGGCAT TGGCGATGAC GGCCGGCATT 
GCGCTTGGCA GTCTGGCCCC CGGCGTGATG GAGGCCATCG CCGCGCTGGA GGTGGCCCGT
GTCAATTTAG TGGTCGCGGC ACTGATCTGG GCCATGGTTT ACCCGATGAT GATTGGCGTG
AACCCACGAA GCCTACGAGA TGTGGCGCGC CAGCCCAAGG GCCTTGCGAT CACGCTGGTG
GTCAACTGGC TGATCAAACC CTTTACCATG GCCGCACTTG GCGTGCTGTT CTTCGAGGTG
GTCTTTGCGC CCTTTCTGGA GCCACAAGAC GCACAGCAGT ATATCGCGGG GCTGATCCTT
TTGGGGGCCG CGCCCTGCAC CGCGATGGTT TTTGTGTGGT CGCAACTCAC CCGGGGCGAC
GAAAGCTACA CCCTGCTGCA AGTCTCGGTG AACGATCTCA TCATGGTGGT GGCCTTTGCC
CCTATCGTGG CCTTTCTCTT GGGTGTCACG GACATTGAGG TGCCATGGAG CACGCTGATC
CTGTCGGCGG TGCTGTTTGT TGCTCTGCCG CTGATGGCCG GTCTCTGGAC CCGCAACCGT
TTGGCGGAAG AGGCGCGTAT CACGGCCTTT CTCGCACGGA TCAAACCGCT CTCGATGCTG
GGGCTGATCA CAACGGTGGT GATCCTGTTT GGCCTGCAAG GTCAGGTCAT TCTGGACCGC
CCGAGCGTGA TTGCGATGAT CGCCGTGCCC ATCCTGATCC AGAGCTACGG GATCTTCTTT
CTCGCCTATG GCGCCGCCTA TGCGCTGCGG GTGCCACATC GGATCGCAGC ACCCTGCGCG
CTGATCGGGA CGTCGAATTT CTTTGAACTG GCGGTGGCTG TCGCGATCAG CCTCTTTGGG
CTTCACTCCG GGGCGGCGCT CGCAACCGTG GTTGGCGTAC TGGTCGAGGT TCCAGTGATG
CTGACACTGG TGGCCTTTGC CAATCGCACC CGTGCCAGGT TTGCCTTGAC CGGAGCAGAC
CACCAGGCAA GCTGA
 
Protein sequence
MSIFEKYLSL WVALAMTAGI ALGSLAPGVM EAIAALEVAR VNLVVAALIW AMVYPMMIGV 
NPRSLRDVAR QPKGLAITLV VNWLIKPFTM AALGVLFFEV VFAPFLEPQD AQQYIAGLIL
LGAAPCTAMV FVWSQLTRGD ESYTLLQVSV NDLIMVVAFA PIVAFLLGVT DIEVPWSTLI
LSAVLFVALP LMAGLWTRNR LAEEARITAF LARIKPLSML GLITTVVILF GLQGQVILDR
PSVIAMIAVP ILIQSYGIFF LAYGAAYALR VPHRIAAPCA LIGTSNFFEL AVAVAISLFG
LHSGAALATV VGVLVEVPVM LTLVAFANRT RARFALTGAD HQAS