Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0410 |
Symbol | |
ID | 4078804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 420423 |
End bp | 421640 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638005705 |
Product | sodium:galactoside symporter family protein |
Protein accession | YP_612405 |
Protein GI | 99080251 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0469682 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACTGGC GGGTCAGCAC ATATGCACTG ATGCTGGCTG CGGCTGGCTT GCCGCTCTAT ATCCACCTAC CGCGATACGC GACGGGGGAG CTTGGCATGA GTCTTGCCAC CCTCGGCGTG ATCCTGGCCG GCATCCGGGT GATGGATTTT GCACAGGACC CCGCGATTGG CTGGCTGGTG GACCGCTACC CGCGTCAAAA GCCTGCCTTT GCCACCCTTG CCGCTCTGGG CATGGCACTG GGGTTTGTGA TGCTCTACAC GCTGCGGCCC GAGACGGGCA GCACGGTGTG GTTTACCGCA GCCCTTGCGG TGGTATTCAC CGCCTACAGC CTTGGGACGA TCTTGTTTTA CGGGCAAAGC GCCGCACTGG CGGCGCAGGG AAACGGGCTG ATTTCACTGG CCGGCTACCG TGAGGCAGGC ACGCTGGCGG GCATTATCAT TGCCGCGAGT GCCCCGGCGG CACTGGTGGC ACTTGGGGCA TCGGGCAGTG GATACGGCGC ATTTGGTATC CTGTTGGCCG CTATCTGCCT CGTCGCGCTC TGGTCAAGCC GCCCGCTCTG GCGCGTCCCA AGTGCGTCTG ACGCCCCTTT GACCCTGTCC GACTTGCGCA GCTCGGGGGC CCTCGGCCTT CTGGCGCTGG CGTTTGTGAA TGCGCTGCCG GTGGCAATCA CCTCGACGCT GTTCTTGTTC TTTGTCGAAG ACAGGCTTCT GCTGCCGGAG TTTGCCGGCC CCTTCCTGAT CCTGTTCTTT CTCGCAGCCG GGCTCTCGGT GCCGGTCTGG ACCCGCACCG CAGCGCGATA TGGGGCGACT CGCAGCCTGA TCTTTGCGAT GTGCCTCGCC ATTCTGGCCT TTGTCGGCGC CGCTCTCCTG CCTGCGGGTG CAGCCTTTGG ATTTGCGCTG ATCTGCATTG GGTCCGGCGC TGCGCTTGGC GCAGATATGG TCATCCTGCC CGCCCTTTTT GCAGGCGCGC TCGATCGTGC AGGGCTGCAA GCTGGTCGCG CCTTCGGGCT TTGGTCCTTT GCCGCAAAGC TTGCACTCGC AAGCGCGGCG GCACTGCTGC TGCCGCTGCT CGAAGTGAGC GGTTATCGGC CCGGCGAAAC AAATTCCGCA GCGGCGCTGA CCGCGCTGAC CCTCGCCTAC GCGGTTCTGC CCTGTGTCAT CAAATGCGCC GCAATCGTGC TGGCGCTGCA ACTTCCCCGT GAAGAGGTCC ATGCATGA
|
Protein sequence | MNWRVSTYAL MLAAAGLPLY IHLPRYATGE LGMSLATLGV ILAGIRVMDF AQDPAIGWLV DRYPRQKPAF ATLAALGMAL GFVMLYTLRP ETGSTVWFTA ALAVVFTAYS LGTILFYGQS AALAAQGNGL ISLAGYREAG TLAGIIIAAS APAALVALGA SGSGYGAFGI LLAAICLVAL WSSRPLWRVP SASDAPLTLS DLRSSGALGL LALAFVNALP VAITSTLFLF FVEDRLLLPE FAGPFLILFF LAAGLSVPVW TRTAARYGAT RSLIFAMCLA ILAFVGAALL PAGAAFGFAL ICIGSGAALG ADMVILPALF AGALDRAGLQ AGRAFGLWSF AAKLALASAA ALLLPLLEVS GYRPGETNSA AALTALTLAY AVLPCVIKCA AIVLALQLPR EEVHA
|
| |