Gene Emin_0405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0405 
Symbol 
ID6262479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp432369 
End bp434051 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content42% 
IMG OID642610872 
Productsulphate transporter 
Protein accessionYP_001875299 
Protein GI187250817 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAGAC CCAAACTTTT CACACTAATC AAAAATAGAC CGGAAGAATT TACACCTCGG 
CGCGTATTAA AAGATATCGG CGCGGGCGCG GTTGTGGCTT TCATAGCGAT GCCTCTTTCA
ATAGCTTTAG CGATAGCAAG CGGCGTCACT CCTGAAATAG GCTTAGCCAC CGCGGTGATA
GCGGGCTTTC TTATTTCATT TTTTGGAGGC AGCAGAGTAC AGATAGGCGG GCCTACGGCC
GCTTTCGTAA TAATAATTTT AGGTATAACA GCCGAGTACG GGCACGACGG GCTTATAGCC
GCAACGGCAA TGGCGGGCAT AATACTTATT ATAATGGGGT TTTTAAAACT GGGAAGCGTA
ATCCAATATA TTCCATATCC CGTTACCACA GGTTTTACAA GCGGCATAGC CGTAGTTTTA
TTTTCCACGC AGGTTAATGA CTTTTTAGGC CTTAATTTAA CAAACATGCC TTCGGAATTT
TTTGATAAAT GGCTTGTTTA TTTCCAAAAT TTAAATCATA TTGATTTACC TACGGTATTT
ATAGGCATGC TCGCTTTAGC GATAATAATG TTTTGGCCTA AAAAACTAAA GGCAATTCCC
GGCACTTTAG CAGCAATTAT TATAACTACT TTAGTTGTAA AATTTTTCCA TTTAGATATT
GAAACAATTT ATTCACGCTT TGGCGAAATA GGACACTCTT TCCCAAAACC GCATTTGCCC
AATCTTACCT GGGATATGAT TCAAAAACTT TTAAGGCCCG CTTTTGTAAT AGCCGTTTTG
GCCGGTATTG AAAGTTTACT TTCAGCCGTA GTTGCGGACG GTATGATAGG CAAACGCCAC
CGATCCAACA CCGAGCTTAT AGCCCAAGGT ATAGCGAACA CGGTCTCCGC CGCGTTCGGC
GGGTTGCCAG CCACGGGCGC CATAGCCAGA ACAACGGCAA ATATTGAAAA CGGCGGCAGA
ACGCCCATAG CGGGCATAAT GCACGCTGTC TGCATTCTTA TAATGATGCT TCTTTTTATG
CCTTATATAA GCCTTGTGCC TATGGCTACG TTAGCGGCTA TACTTTTTAC TGTGGCTTAC
CGAATGAGCG AATGGCGCAG CTTTGTATTT TTATTTAAAG CGCCTTTAAG CGATATTTTG
GTGTTACTCA CAACATTCCT GCTTACAGTT ATGAAAGACC TTGTTATCGC CATTGAAGTA
GGTATGATTC TTGCCGCGAT ACTCTTTATA AAACGTATGG TTAACGTATA TAACATCGCG
CGCTTAACCG ATGACGACCT TGTTAACGAG TTTGAGGAAG ACGACGATTT GGACAAAAAA
ACAATTGCCC AGCACGTGCG TGTTTATGAA ATTAACGGGC CTTTCTTTTT CGGTGCGGCA
AATATGTTTT TAGAAACGCT TGAAAACATC GCGGACTGCA AAGTTTTAAT TTTGCGTATG
CGAAGCGTGC CCGCTATGGA TGCAACGGCT TTCCACGCTT TAAATAAAAT ATATTTAAGA
TGTAAAAAAG ATAATATAAC GCTTATTCTT TCCGAAGTGC CTAACCAGCC TTACAAAACG
CTCAGAAAAT ACAACTTTGT GTTTGAGATA GGCAAGGAAA ATGTTTTGCG ATCTTTTAAC
GCGGCTCTTA AAAAAGCGGC AAAAACAGCT AAGGAAAAAC AAGCGTCTGA AGAAACTAAG
TAA
 
Protein sequence
MFRPKLFTLI KNRPEEFTPR RVLKDIGAGA VVAFIAMPLS IALAIASGVT PEIGLATAVI 
AGFLISFFGG SRVQIGGPTA AFVIIILGIT AEYGHDGLIA ATAMAGIILI IMGFLKLGSV
IQYIPYPVTT GFTSGIAVVL FSTQVNDFLG LNLTNMPSEF FDKWLVYFQN LNHIDLPTVF
IGMLALAIIM FWPKKLKAIP GTLAAIIITT LVVKFFHLDI ETIYSRFGEI GHSFPKPHLP
NLTWDMIQKL LRPAFVIAVL AGIESLLSAV VADGMIGKRH RSNTELIAQG IANTVSAAFG
GLPATGAIAR TTANIENGGR TPIAGIMHAV CILIMMLLFM PYISLVPMAT LAAILFTVAY
RMSEWRSFVF LFKAPLSDIL VLLTTFLLTV MKDLVIAIEV GMILAAILFI KRMVNVYNIA
RLTDDDLVNE FEEDDDLDKK TIAQHVRVYE INGPFFFGAA NMFLETLENI ADCKVLILRM
RSVPAMDATA FHALNKIYLR CKKDNITLIL SEVPNQPYKT LRKYNFVFEI GKENVLRSFN
AALKKAAKTA KEKQASEETK