Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0523 |
Symbol | |
ID | 4077229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 549000 |
End bp | 550034 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005819 |
Product | bile acid:sodium symporter |
Protein accession | YP_612518 |
Protein GI | 99080364 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.27289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.171222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATCT TCGAGAAATA CCTGAGCCTC TGGGTGGCAT TGGCGATGAC GGCCGGCATT GCGCTTGGCA GTCTGGCCCC CGGCGTGATG GAGGCCATCG CCGCGCTGGA GGTGGCCCGT GTCAATTTAG TGGTCGCGGC ACTGATCTGG GCCATGGTTT ACCCGATGAT GATTGGCGTG AACCCACGAA GCCTACGAGA TGTGGCGCGC CAGCCCAAGG GCCTTGCGAT CACGCTGGTG GTCAACTGGC TGATCAAACC CTTTACCATG GCCGCACTTG GCGTGCTGTT CTTCGAGGTG GTCTTTGCGC CCTTTCTGGA GCCACAAGAC GCACAGCAGT ATATCGCGGG GCTGATCCTT TTGGGGGCCG CGCCCTGCAC CGCGATGGTT TTTGTGTGGT CGCAACTCAC CCGGGGCGAC GAAAGCTACA CCCTGCTGCA AGTCTCGGTG AACGATCTCA TCATGGTGGT GGCCTTTGCC CCTATCGTGG CCTTTCTCTT GGGTGTCACG GACATTGAGG TGCCATGGAG CACGCTGATC CTGTCGGCGG TGCTGTTTGT TGCTCTGCCG CTGATGGCCG GTCTCTGGAC CCGCAACCGT TTGGCGGAAG AGGCGCGTAT CACGGCCTTT CTCGCACGGA TCAAACCGCT CTCGATGCTG GGGCTGATCA CAACGGTGGT GATCCTGTTT GGCCTGCAAG GTCAGGTCAT TCTGGACCGC CCGAGCGTGA TTGCGATGAT CGCCGTGCCC ATCCTGATCC AGAGCTACGG GATCTTCTTT CTCGCCTATG GCGCCGCCTA TGCGCTGCGG GTGCCACATC GGATCGCAGC ACCCTGCGCG CTGATCGGGA CGTCGAATTT CTTTGAACTG GCGGTGGCTG TCGCGATCAG CCTCTTTGGG CTTCACTCCG GGGCGGCGCT CGCAACCGTG GTTGGCGTAC TGGTCGAGGT TCCAGTGATG CTGACACTGG TGGCCTTTGC CAATCGCACC CGTGCCAGGT TTGCCTTGAC CGGAGCAGAC CACCAGGCAA GCTGA
|
Protein sequence | MSIFEKYLSL WVALAMTAGI ALGSLAPGVM EAIAALEVAR VNLVVAALIW AMVYPMMIGV NPRSLRDVAR QPKGLAITLV VNWLIKPFTM AALGVLFFEV VFAPFLEPQD AQQYIAGLIL LGAAPCTAMV FVWSQLTRGD ESYTLLQVSV NDLIMVVAFA PIVAFLLGVT DIEVPWSTLI LSAVLFVALP LMAGLWTRNR LAEEARITAF LARIKPLSML GLITTVVILF GLQGQVILDR PSVIAMIAVP ILIQSYGIFF LAYGAAYALR VPHRIAAPCA LIGTSNFFEL AVAVAISLFG LHSGAALATV VGVLVEVPVM LTLVAFANRT RARFALTGAD HQAS
|
| |