Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3666 |
Symbol | |
ID | 4075635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 717675 |
End bp | 719459 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005186 |
Product | Na+/solute symporter |
Protein accession | YP_611895 |
Protein GI | 99078637 |
COG category | [R] General function prediction only |
COG ID | [COG4147] Predicted symporter |
TIGRFAM ID | [TIGR03648] probable sodium:solute symporter, VC_2705 subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCAGT TTACACTCAA CCTTCTCTTT GTGGGCGCGT CCTTTGCGCT CTACATCGGG ATCGCGATCT GGGCACGGGC CGGATCAACC TCTGAATTCT ATGCCGCCGG GCGCGGCGTG CATCCTGTCA CCAACGGGAT GGCCACCGCG GCAGACTGGA TGTCGGCGGC TTCCTTTATC TCCATGGCAG GTCTCATCGC CTTTACCGGC TATGACAACT CCTCCTTCCT GATGGGCTGG ACCGGGGGCT ACGTGCTGCT CGCACTGCTG CTGGCACCAT ATCTGCGCAA GTTCGGCAAG TTCACCGTCT CTGAATTCAT CGGCGACCGC TTCTATAGCC CGACTGCACG TCTGGTGGCG GTGATATGTC TGCTGGTGGC CTCGATCACC TATGTGATCG GGCAAATGCA GGGTGTGGGT ATCGCCTTTG GGCGTTTTCT TGAAATCGAC GCCTTTTGGG GTCTGCTGAT CGGTGCCTGT GTTGTGTTTG CCTATGCGGT GTTTGGCGGC ATGAAGGGCG TGACCTACAC GCAGGTTGCA CAATACTGCG TGCTGATTAC CGCCTACACG ATCCCGGCGG TGTTTATTTC GCTGCAACTC ACTGGCAATC CGATCCCAGC CTTGGGGCTC TTTGGCTCCA CCGAGAGCGG CGAGCCGCTG CTGGCCAAGC TCAACCAGAT CGTCACCGAC CTTGGCTTTG CGGAATACAC CGCAGCACAT GGCTCCACCA TCAACATGGT GCTCTTCACC CTGTCGCTGA TGATCGGCAC CGCAGGTCTG CCCCACGTCA TCATGCGCTT CTTTACGGTG CCGCGCGTGT CCGATGCGCG CTGGTCGGCG GGCTGGACCC TTGTGTTCAT CGCGCTTCTC TATCTGACGG CGCCGGCCGT GGGCGCAATG GCGCGCCTCA ACATCTCTGA GCTGATGTGG CCTAACGGGA CCGAAGCACA GGCTGTGAGT GTCGAGCAGA TCGAAACCGA TCCTGAGTAC GCATGGATGG CGACGTGGCA GAAAACCGGC CTTCTCGGTT GGGAAGACAA GAACGGCGAC GGGCGCATTC AGTACTACAA TGACGCCAAT GCGGACCTGC AAGCCAAAGC CGAAGCAAAC GGTTGGAAAG GCAATGAGCT CACCAACTTC AACCGCGACA TCCTTGTGCT TGCAAACCCT GAGATTGCAT CGCTCCCCGG TTGGGTGATC GGTCTGGTGG CCGCAGGTGG TCTCGCGGCG GCGCTTTCGA CCGCAGCCGG TCTCTTGCTG GCGATCTCCT CGGCGGTGAG CCACGACCTT CTCAAGGGTC AGCTGACTCC CAACATGTCG GAGAAATCCG AACTGTTGGC GGCGCGGGTG TCGATGGCAG CTGCAATCGT GGTGGCGGTT CTTCTGGGCC TCAACCCTCC GGGGTTTGCG GCGCAGACGG TGGCGTTGGC CTTTGGTCTT GCGGCAGCCT CGATTTTCCC GGCGCTGATG ATGGGGATCT TCTCGACTCG CATCAACAAC AGCGGTGCGG TTGCAGGCAT GCTGGCCGGT CTCGTGGTGA CCTTGCTCTA TATCTTCCTG CACAAGGGCT GGTTCTTCAT CCCGGACACC AATTCGTTCA CCGATGCCGA CCCGCTCCTT GGGCCGATCA AATCCACCTC CTTTGGTGCA ATCGGAGCTC TGGTCAACTT TGCGGTGGCT TATGTCGTCA CCAACATGAC CAAGGAAACT CCGCAGCACA TCAAGGATCT CGTCGAGAGC GTCCGTGTGC CGCGCGGCGC AGGTCAAGCG GTCGACGGTC ACTAA
|
Protein sequence | MDQFTLNLLF VGASFALYIG IAIWARAGST SEFYAAGRGV HPVTNGMATA ADWMSAASFI SMAGLIAFTG YDNSSFLMGW TGGYVLLALL LAPYLRKFGK FTVSEFIGDR FYSPTARLVA VICLLVASIT YVIGQMQGVG IAFGRFLEID AFWGLLIGAC VVFAYAVFGG MKGVTYTQVA QYCVLITAYT IPAVFISLQL TGNPIPALGL FGSTESGEPL LAKLNQIVTD LGFAEYTAAH GSTINMVLFT LSLMIGTAGL PHVIMRFFTV PRVSDARWSA GWTLVFIALL YLTAPAVGAM ARLNISELMW PNGTEAQAVS VEQIETDPEY AWMATWQKTG LLGWEDKNGD GRIQYYNDAN ADLQAKAEAN GWKGNELTNF NRDILVLANP EIASLPGWVI GLVAAGGLAA ALSTAAGLLL AISSAVSHDL LKGQLTPNMS EKSELLAARV SMAAAIVVAV LLGLNPPGFA AQTVALAFGL AAASIFPALM MGIFSTRINN SGAVAGMLAG LVVTLLYIFL HKGWFFIPDT NSFTDADPLL GPIKSTSFGA IGALVNFAVA YVVTNMTKET PQHIKDLVES VRVPRGAGQA VDGH
|
| |