Gene TM1040_3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3666 
Symbol 
ID4075635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp717675 
End bp719459 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content60% 
IMG OID638005186 
ProductNa+/solute symporter 
Protein accessionYP_611895 
Protein GI99078637 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR03648] probable sodium:solute symporter, VC_2705 subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGT TTACACTCAA CCTTCTCTTT GTGGGCGCGT CCTTTGCGCT CTACATCGGG 
ATCGCGATCT GGGCACGGGC CGGATCAACC TCTGAATTCT ATGCCGCCGG GCGCGGCGTG
CATCCTGTCA CCAACGGGAT GGCCACCGCG GCAGACTGGA TGTCGGCGGC TTCCTTTATC
TCCATGGCAG GTCTCATCGC CTTTACCGGC TATGACAACT CCTCCTTCCT GATGGGCTGG
ACCGGGGGCT ACGTGCTGCT CGCACTGCTG CTGGCACCAT ATCTGCGCAA GTTCGGCAAG
TTCACCGTCT CTGAATTCAT CGGCGACCGC TTCTATAGCC CGACTGCACG TCTGGTGGCG
GTGATATGTC TGCTGGTGGC CTCGATCACC TATGTGATCG GGCAAATGCA GGGTGTGGGT
ATCGCCTTTG GGCGTTTTCT TGAAATCGAC GCCTTTTGGG GTCTGCTGAT CGGTGCCTGT
GTTGTGTTTG CCTATGCGGT GTTTGGCGGC ATGAAGGGCG TGACCTACAC GCAGGTTGCA
CAATACTGCG TGCTGATTAC CGCCTACACG ATCCCGGCGG TGTTTATTTC GCTGCAACTC
ACTGGCAATC CGATCCCAGC CTTGGGGCTC TTTGGCTCCA CCGAGAGCGG CGAGCCGCTG
CTGGCCAAGC TCAACCAGAT CGTCACCGAC CTTGGCTTTG CGGAATACAC CGCAGCACAT
GGCTCCACCA TCAACATGGT GCTCTTCACC CTGTCGCTGA TGATCGGCAC CGCAGGTCTG
CCCCACGTCA TCATGCGCTT CTTTACGGTG CCGCGCGTGT CCGATGCGCG CTGGTCGGCG
GGCTGGACCC TTGTGTTCAT CGCGCTTCTC TATCTGACGG CGCCGGCCGT GGGCGCAATG
GCGCGCCTCA ACATCTCTGA GCTGATGTGG CCTAACGGGA CCGAAGCACA GGCTGTGAGT
GTCGAGCAGA TCGAAACCGA TCCTGAGTAC GCATGGATGG CGACGTGGCA GAAAACCGGC
CTTCTCGGTT GGGAAGACAA GAACGGCGAC GGGCGCATTC AGTACTACAA TGACGCCAAT
GCGGACCTGC AAGCCAAAGC CGAAGCAAAC GGTTGGAAAG GCAATGAGCT CACCAACTTC
AACCGCGACA TCCTTGTGCT TGCAAACCCT GAGATTGCAT CGCTCCCCGG TTGGGTGATC
GGTCTGGTGG CCGCAGGTGG TCTCGCGGCG GCGCTTTCGA CCGCAGCCGG TCTCTTGCTG
GCGATCTCCT CGGCGGTGAG CCACGACCTT CTCAAGGGTC AGCTGACTCC CAACATGTCG
GAGAAATCCG AACTGTTGGC GGCGCGGGTG TCGATGGCAG CTGCAATCGT GGTGGCGGTT
CTTCTGGGCC TCAACCCTCC GGGGTTTGCG GCGCAGACGG TGGCGTTGGC CTTTGGTCTT
GCGGCAGCCT CGATTTTCCC GGCGCTGATG ATGGGGATCT TCTCGACTCG CATCAACAAC
AGCGGTGCGG TTGCAGGCAT GCTGGCCGGT CTCGTGGTGA CCTTGCTCTA TATCTTCCTG
CACAAGGGCT GGTTCTTCAT CCCGGACACC AATTCGTTCA CCGATGCCGA CCCGCTCCTT
GGGCCGATCA AATCCACCTC CTTTGGTGCA ATCGGAGCTC TGGTCAACTT TGCGGTGGCT
TATGTCGTCA CCAACATGAC CAAGGAAACT CCGCAGCACA TCAAGGATCT CGTCGAGAGC
GTCCGTGTGC CGCGCGGCGC AGGTCAAGCG GTCGACGGTC ACTAA
 
Protein sequence
MDQFTLNLLF VGASFALYIG IAIWARAGST SEFYAAGRGV HPVTNGMATA ADWMSAASFI 
SMAGLIAFTG YDNSSFLMGW TGGYVLLALL LAPYLRKFGK FTVSEFIGDR FYSPTARLVA
VICLLVASIT YVIGQMQGVG IAFGRFLEID AFWGLLIGAC VVFAYAVFGG MKGVTYTQVA
QYCVLITAYT IPAVFISLQL TGNPIPALGL FGSTESGEPL LAKLNQIVTD LGFAEYTAAH
GSTINMVLFT LSLMIGTAGL PHVIMRFFTV PRVSDARWSA GWTLVFIALL YLTAPAVGAM
ARLNISELMW PNGTEAQAVS VEQIETDPEY AWMATWQKTG LLGWEDKNGD GRIQYYNDAN
ADLQAKAEAN GWKGNELTNF NRDILVLANP EIASLPGWVI GLVAAGGLAA ALSTAAGLLL
AISSAVSHDL LKGQLTPNMS EKSELLAARV SMAAAIVVAV LLGLNPPGFA AQTVALAFGL
AAASIFPALM MGIFSTRINN SGAVAGMLAG LVVTLLYIFL HKGWFFIPDT NSFTDADPLL
GPIKSTSFGA IGALVNFAVA YVVTNMTKET PQHIKDLVES VRVPRGAGQA VDGH