Gene TM1040_3648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3648 
Symbol 
ID4075076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp702946 
End bp704712 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content59% 
IMG OID638005168 
Productsulfate permease 
Protein accessionYP_611877 
Protein GI99078619 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGCT TTCGCCAATA TTTCCCCATC CTCGTATGGG GCCGGGACTA CGACAAATCC 
GCCCTGTCGA ATGATCTCAT AGCGGCGGTG ATCGTAACGA TCATGTTGAT CCCGCAGTCG
TTGGCCTATG CCCTTTTGGC GGGATTGCCG CCCGAAGCGG GGATCTATGC CTCCATCGCG
CCTATCCTGC TCTATGCGGT TTTTGGCACC AGCCGGGCGC TTGCGGTTGG TCCGGTTGCG
GTGGTGTCGC TGTTGACAGC ATCTGCTGTG GGGCAAGTGG CCGAACAGGG CACGGCGGGG
TATGTCGTCG CGACCCTCAC GCTGGCCTTC TTGTCGGGGA GCTTTCTGGT TCTGATGGGG
GTGCTCAAAC TTGGCTTCAT TGCCAATTTT CTGAGCCACC CGGTCATAGC GGGTTTTATC
ACCGCATCGG GTATTCTAAT CGCGACAAGC CAGATCAAAC ATATCCTCGG CATTCGTGCC
GAAGGTCATA CCTTGCCGGA GATGCTCTAT TCGATCGCGC TGCGTCTGGG CGAGGTAAAC
TGGATCACCT TGTTGATCGG AGCCAGCGCG ACAGGCTTTC TATTCTGGGC TCGCAAACAC
CTGAAGCAGA CACTGCATGG CATGGGGACG CCGCCGCTCT TGGCGGATAT TCTGAACAAG
GCGGGGCCAG TGGCGGCCGT CGTCACCACG ACTGTGGTTG TCTGGGGATT TGACCTTGCT
GAGAAGGGCG TCAAAATCGT GGGTGAAGTA CCCCAAGGCT TACCACCGCT CACGATGCCG
GGCTTTGCTC CCGATCTGAT CGGAGCGCTT CTGGTGCCCG CGATCCTGAT TTCCATCATC
GGTTTTGTTG AGTCTGTTTC CGTGGCGCAA ACTCTTGCCG CCAAGCGACG CCAGCGCATT
GACCCGGATC AGGAGTTGAT CGGCCTCGGC GCGGCCAATT TAGGGGCCGC CTTTACCGGT
GGCTACCCGG TGACAGGCGG CTTTGCACGG TCGGTTGTGA ACTTTGACGC TGGCGCCGAG
ACGCCGGCTG CTGGGGCCTT CACGGCCATC GGGTTGGCCC TTGCCGCCGT GGCCCTCACC
CCGTTGGTTT ATTACCTGCC GATCGCGACA CTAGCGGCGA CCATCATCGT GGCTGTGCTG
AGCCTCGTCG ACCTGTCGAT CCTCAAAAAA ACCTGGACCT ATTCGCATGC CGACTTCATC
GCTGTTGCGG CCACCATTCT TTTGACCCTG GGACTCGGTG TCGAAATCGG TGTCGCTTCC
GGCGTCATCC TCTCTGTGGT CTTGCACCTC TACAAGACCT CTCGCCCCCA TGTGGCGGAG
GTTGGGCTGG TGCCCGGCAC CCAGCATTTT CGCAACATCG ATCGTCACAA CGTCCAAACG
GACCCCCGTT TGGTGTCGCT GCGCGTCGAT GAAAGTCTCT ACTTCGTCAA CGCCCGATTT
CTCGAGGACC TGATCCAGAA ACGCGTCACC GAAGGCTGCG CGATCAAACA TGTGGTGCTT
ATGTTTTCGG CGGTGAACAT GGTGGACTAT TCTGCGCTCG AGAGCCTCGA AGCCATCAAT
CACCGCCTAA AGGACATGGG CGTTGGTCTC CACCTTTCCG AGGTCAAAGG GCCCGTGATG
GATCGCCTTC AAAGATCTGA TTTCATTGAC GAAATGAACG GAAAGATCTT CCTTTCTCAA
TATGAGGCCT GGGCCAATCT GACCGCGGGG GCGCAACAGG GCGCTGCGGA CACAGGGCAG
GGCGACCAGC TGCGCTGCGG CGCATGA
 
Protein sequence
MPSFRQYFPI LVWGRDYDKS ALSNDLIAAV IVTIMLIPQS LAYALLAGLP PEAGIYASIA 
PILLYAVFGT SRALAVGPVA VVSLLTASAV GQVAEQGTAG YVVATLTLAF LSGSFLVLMG
VLKLGFIANF LSHPVIAGFI TASGILIATS QIKHILGIRA EGHTLPEMLY SIALRLGEVN
WITLLIGASA TGFLFWARKH LKQTLHGMGT PPLLADILNK AGPVAAVVTT TVVVWGFDLA
EKGVKIVGEV PQGLPPLTMP GFAPDLIGAL LVPAILISII GFVESVSVAQ TLAAKRRQRI
DPDQELIGLG AANLGAAFTG GYPVTGGFAR SVVNFDAGAE TPAAGAFTAI GLALAAVALT
PLVYYLPIAT LAATIIVAVL SLVDLSILKK TWTYSHADFI AVAATILLTL GLGVEIGVAS
GVILSVVLHL YKTSRPHVAE VGLVPGTQHF RNIDRHNVQT DPRLVSLRVD ESLYFVNARF
LEDLIQKRVT EGCAIKHVVL MFSAVNMVDY SALESLEAIN HRLKDMGVGL HLSEVKGPVM
DRLQRSDFID EMNGKIFLSQ YEAWANLTAG AQQGAADTGQ GDQLRCGA