Gene Emin_0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0476 
Symbol 
ID6262660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp507886 
End bp509646 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content40% 
IMG OID642610947 
ProductNa/Pi-cotransporter II-related protein 
Protein accessionYP_001875370 
Protein GI187250888 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCT TTTTTATTAT TTTAAACATA TTAGGCGGGT TAGCCGTATT TTTATACGGT 
ATGAAATTAA TGAGCGACGG CATGCAAAAA ATAGCCGGCA ACTCAATGCG AAAACTACTC
GCTACAGCCC CCAAAAACAG AGTTACAGCA ACTTTAACCG GTATAATAGT AACTTCAATA
ATACAGTCTT CAAGCGCAAC TACTGTTATG GTAGTCGGCT TTAGCAGCGC CGGGATTTTA
AGTCTTACCC AGGCGATGGG CGTAATTTTC GGCGCGAATA TAGGCACCAC TGTTACGGTA
TGGATTATAA GCTTTTTCGG TTTTAAATTA CAACTTTCAT TATTTTTACT GCCTGTCATA
GCCGCAGGGT TTTTTATACT TTATGCGGTT AAGTGGAAAA CTCTTCATCG TCTTGGTGAA
GTTATGGTAG GCTTCGGTTT CATGTTTTTG GGCCTTTATA TAATACAGAT AACCGTACCT
GATTTCACAC AGGCTCCGCA GGTAGCGCAG TGGTTCGCCC AGCTCAGGCC CGATACGCTA
GGCTCCCTTT TACTGTTGAT ATCCATAGGC ACCGCTCTTA CCGCCGTTTT GGAATCTTCT
ACGGCTGTAA TAGCGCTTAC CATTACCCTT GCAGCCAAAG GCTTTTTTGA TTTTCCTACA
GCGGCCGCTC TTGTTTTGGG TGAAAATATC GGCACAACAA TTACGGCAAA TCTTTCGGCC
ATAGGCTCTT CCCGCACAGC GAAAAGAGCC GCGCTTGCGC ATTTTCTTTT TAACTTCTTA
GGAGTAGTAT GGGTTATATG TATATTTAAA TACTTTGTGG ATTTTGTTAA CTGGGTTATG
CCTGGCTCCC CGTACGGAAC GGACAACAAA ACTTTAATGC TGTATATTCC TTTTCACATA
AGCGCGTTTC ATACCATTTT TAACGTTTTT AACACGGTTA TAATGCTGTT TTTTATTAAA
CCGCTGGGCA GGCTTACAAC AATAATTATA CCCCACTCAA AACGTGAGGA AAAGCCCGCC
GGCCTTGTTT TTATAGACCC GCGTTTTACC GTAGCGCCGG AACTCGCCGT TGAAGCTGCC
CGCAAAGAAA TTGAAAGAAT GGCAGCCTCG GTAAGCAAAA TAATAAGCAA ACTTATACAC
GCTTTAAAAG TTGATGACGA CGCCATGTTT GAGCAGCTTA TTAAAGACGT TTATACTTTA
GAAGAAAGCA CCGACATCCT TGAATACAAA ATCAACACAT ATTTAGTAAA ACTTTTGCAT
GAAAGGATTT CAACTGAAGT TTTAGATGAA ACTATGGCTT TAATAGACAT TATTAATACT
ATTGAACGTA TGGGTGACGG CGGGCAAAAA ATAGCCAAAA TTATTGAAAT TACTAGAAAA
GATAAAGAAT TTTCCCAAAC TGATAAAGAT AATATTGAGG AAATAGCCCT TAAAGTTAAA
CAGGCTGTTA AAGACGCGAG AGAAGGCTTG TTAAGAACCG AATCAGTAAA GAGCAGCGTA
TCAAAAGAAG CCTTAAGAAA AGCTTTTGAA CGGGAAATGG AAATTAACGA TTTAAGGCAC
AGATTAAGAG ACGAAAGAAA TTTAAGAATG AAACAGGACT CTTCCATAAC GCCTATTTCT
TCAACGCATT ATTCCGATAT TCTAAGCCGT TTTGAAAGAA TGGCTGACCA TGCTTTAAGA
ATTATAGAAG CTTCAGCCAG CGGTAAAACG CCGGAGCAGG AAAGAGCCGA TATGGGCGGA
GAATTTAAAA AATATAAATA A
 
Protein sequence
MKTFFIILNI LGGLAVFLYG MKLMSDGMQK IAGNSMRKLL ATAPKNRVTA TLTGIIVTSI 
IQSSSATTVM VVGFSSAGIL SLTQAMGVIF GANIGTTVTV WIISFFGFKL QLSLFLLPVI
AAGFFILYAV KWKTLHRLGE VMVGFGFMFL GLYIIQITVP DFTQAPQVAQ WFAQLRPDTL
GSLLLLISIG TALTAVLESS TAVIALTITL AAKGFFDFPT AAALVLGENI GTTITANLSA
IGSSRTAKRA ALAHFLFNFL GVVWVICIFK YFVDFVNWVM PGSPYGTDNK TLMLYIPFHI
SAFHTIFNVF NTVIMLFFIK PLGRLTTIII PHSKREEKPA GLVFIDPRFT VAPELAVEAA
RKEIERMAAS VSKIISKLIH ALKVDDDAMF EQLIKDVYTL EESTDILEYK INTYLVKLLH
ERISTEVLDE TMALIDIINT IERMGDGGQK IAKIIEITRK DKEFSQTDKD NIEEIALKVK
QAVKDAREGL LRTESVKSSV SKEALRKAFE REMEINDLRH RLRDERNLRM KQDSSITPIS
STHYSDILSR FERMADHALR IIEASASGKT PEQERADMGG EFKKYK