Gene TM1040_0410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0410 
Symbol 
ID4078804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp420423 
End bp421640 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content64% 
IMG OID638005705 
Productsodium:galactoside symporter family protein 
Protein accessionYP_612405 
Protein GI99080251 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0469682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTGGC GGGTCAGCAC ATATGCACTG ATGCTGGCTG CGGCTGGCTT GCCGCTCTAT 
ATCCACCTAC CGCGATACGC GACGGGGGAG CTTGGCATGA GTCTTGCCAC CCTCGGCGTG
ATCCTGGCCG GCATCCGGGT GATGGATTTT GCACAGGACC CCGCGATTGG CTGGCTGGTG
GACCGCTACC CGCGTCAAAA GCCTGCCTTT GCCACCCTTG CCGCTCTGGG CATGGCACTG
GGGTTTGTGA TGCTCTACAC GCTGCGGCCC GAGACGGGCA GCACGGTGTG GTTTACCGCA
GCCCTTGCGG TGGTATTCAC CGCCTACAGC CTTGGGACGA TCTTGTTTTA CGGGCAAAGC
GCCGCACTGG CGGCGCAGGG AAACGGGCTG ATTTCACTGG CCGGCTACCG TGAGGCAGGC
ACGCTGGCGG GCATTATCAT TGCCGCGAGT GCCCCGGCGG CACTGGTGGC ACTTGGGGCA
TCGGGCAGTG GATACGGCGC ATTTGGTATC CTGTTGGCCG CTATCTGCCT CGTCGCGCTC
TGGTCAAGCC GCCCGCTCTG GCGCGTCCCA AGTGCGTCTG ACGCCCCTTT GACCCTGTCC
GACTTGCGCA GCTCGGGGGC CCTCGGCCTT CTGGCGCTGG CGTTTGTGAA TGCGCTGCCG
GTGGCAATCA CCTCGACGCT GTTCTTGTTC TTTGTCGAAG ACAGGCTTCT GCTGCCGGAG
TTTGCCGGCC CCTTCCTGAT CCTGTTCTTT CTCGCAGCCG GGCTCTCGGT GCCGGTCTGG
ACCCGCACCG CAGCGCGATA TGGGGCGACT CGCAGCCTGA TCTTTGCGAT GTGCCTCGCC
ATTCTGGCCT TTGTCGGCGC CGCTCTCCTG CCTGCGGGTG CAGCCTTTGG ATTTGCGCTG
ATCTGCATTG GGTCCGGCGC TGCGCTTGGC GCAGATATGG TCATCCTGCC CGCCCTTTTT
GCAGGCGCGC TCGATCGTGC AGGGCTGCAA GCTGGTCGCG CCTTCGGGCT TTGGTCCTTT
GCCGCAAAGC TTGCACTCGC AAGCGCGGCG GCACTGCTGC TGCCGCTGCT CGAAGTGAGC
GGTTATCGGC CCGGCGAAAC AAATTCCGCA GCGGCGCTGA CCGCGCTGAC CCTCGCCTAC
GCGGTTCTGC CCTGTGTCAT CAAATGCGCC GCAATCGTGC TGGCGCTGCA ACTTCCCCGT
GAAGAGGTCC ATGCATGA
 
Protein sequence
MNWRVSTYAL MLAAAGLPLY IHLPRYATGE LGMSLATLGV ILAGIRVMDF AQDPAIGWLV 
DRYPRQKPAF ATLAALGMAL GFVMLYTLRP ETGSTVWFTA ALAVVFTAYS LGTILFYGQS
AALAAQGNGL ISLAGYREAG TLAGIIIAAS APAALVALGA SGSGYGAFGI LLAAICLVAL
WSSRPLWRVP SASDAPLTLS DLRSSGALGL LALAFVNALP VAITSTLFLF FVEDRLLLPE
FAGPFLILFF LAAGLSVPVW TRTAARYGAT RSLIFAMCLA ILAFVGAALL PAGAAFGFAL
ICIGSGAALG ADMVILPALF AGALDRAGLQ AGRAFGLWSF AAKLALASAA ALLLPLLEVS
GYRPGETNSA AALTALTLAY AVLPCVIKCA AIVLALQLPR EEVHA