Gene Elen_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1474 
Symbol 
ID8415772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1763530 
End bp1765212 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content62% 
IMG OID645024443 
Productsulphate transporter 
Protein accessionYP_003181832 
Protein GI257791226 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.525598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAACGCG ACAAGATCAA ACCGATACTT TTCAGCATCA TCAAGCACAC CGACAGCGAA 
GAGCTCAAAC GACAGCTTCC CAAAGACGTT GTTTCGGGCG TCATGGTGGC CGTGGTGGCG
CTGCCTCTTT CCATCGCGCT GGCCATCGCT TCGGGCGTGA GCCCTGAGCA AGGCCTTTAC
ACGGCCATCG TCGCCGGGTT CCTCATCGCG TTCCTCGGCG GCAGCCGCGT GCAAATCTCG
GGCCCCACGG CGGCGTTCGC CACCATCGTT GCAGGCATCG TGGCAACCGA CGGCATGGAC
GGCCTCGTCG CAGCCACCGT CATCGCCGGC GTGCTGCTCA TGCTGATGGG CTTCTTCAAG
CTGGGATCGC TCATCCGGTT CGTTCCCTAC ACCATCACCA CGGGCTTTAC GGCAGGCATC
GCCGTGACCC TCGTCATCGG CCAAGTCAAA GACTTCCTCG GCCTCGCCTT CCCCGCCGGC
GCGCCCACGG TCGAGACTAT GGACAAGCTG CAGGCCGTCG CCCAAAGCAT CGGAACGGCC
AACTGGCAAG CGTTCGTCGT GGGAGCGGTC TGCCTCGCCA TCCTGTTCGC CTGGCCTAAG
GTCAGCGAAC GCATCCCCGG ATCCCTTGTC GCGCTCATCG TGGGAATCGC CCTGGTCAGC
GGCTTCGGCA TGCAGGTGAG CACCATCGGT GACCTGTACG CCATCAGCAG CGACCTGCCC
GAATTCCGCA TTCCCCAACT CAACGTCGAC CTGCTGGCCG ACCAGCTTCC CAACGGCATC
ACCATCGCCA TCCTGGCCGC GATCGAATCG CTCCTGTCCT GCGTCGTCGC CGACAGCATG
ATCAGCTCGC ACCACCGCAG CAACATGGAG CTGGTTGCGC AGGGCGTGGG CAACATCGGC
TCGGTACTAT TCGGCGGAAT CCCCGCAACC GGCGCCATCG CGCGCACCGC CGCGAACGTG
AAGAACGGAG GCAGAACGCC CGTCGCCGGC ATGACCCATG CGCTCGTGCT GCTGATCGTC
CTCGTGTTCT TCATGCCCTA CGCGGCTCTC ATCCCCATGC CGACCATCGC GGCCATCTTG
CTGCACGTCG CCTACAACAT GTCGGGATGG CGCAATTTCG CACACTTGTG CAAGACGGCC
TCCCGAGGAG CGGTGGCCAC GCTGCTGCTC ACCTTCGCGC TGACCGTCGT GTTCGACCTG
GTGGTGGCGA TTGCCGTGGG CATGTTGATC ACGGTCGTCC TGTTCATGAA GATGGTGAGC
GAGGAGACCG AGGTTCGCGG CTGGAAATAC TACTGCGACG AGGATTCCGA GGTCACGCAC
CTGCGCGAAC TCCCTGAAAG CGTGCGCGTG TACGAGATCA ACGGACCCAT GTTCTTCGGC
ATGACCGACC GCATATCCGA CATATCGGTG AAATCCTTCA CGAAGTACCT GATCATCCGC
ATGCGAGGCG TGCCATCGCT CGATTCGACG GGCATGAACG CGCTGGAGAA CCTCTACGGG
TACTGCCGCG AGAACGGCGT CAGCCTCATC TTCTCGCACG CCAACGAGCA GCCGATGAAA
ACCATGCGCC GCGCTGGTTT CGTGGACATG GTGGGAGAAG ACCATTTCCG CAGCAATATC
GACGATGCAA TCGCTTACGC GCGCAAGCTG CTGGACGAAG AGGGGGAAAC GGCATCGGCC
TAG
 
Protein sequence
MQRDKIKPIL FSIIKHTDSE ELKRQLPKDV VSGVMVAVVA LPLSIALAIA SGVSPEQGLY 
TAIVAGFLIA FLGGSRVQIS GPTAAFATIV AGIVATDGMD GLVAATVIAG VLLMLMGFFK
LGSLIRFVPY TITTGFTAGI AVTLVIGQVK DFLGLAFPAG APTVETMDKL QAVAQSIGTA
NWQAFVVGAV CLAILFAWPK VSERIPGSLV ALIVGIALVS GFGMQVSTIG DLYAISSDLP
EFRIPQLNVD LLADQLPNGI TIAILAAIES LLSCVVADSM ISSHHRSNME LVAQGVGNIG
SVLFGGIPAT GAIARTAANV KNGGRTPVAG MTHALVLLIV LVFFMPYAAL IPMPTIAAIL
LHVAYNMSGW RNFAHLCKTA SRGAVATLLL TFALTVVFDL VVAIAVGMLI TVVLFMKMVS
EETEVRGWKY YCDEDSEVTH LRELPESVRV YEINGPMFFG MTDRISDISV KSFTKYLIIR
MRGVPSLDST GMNALENLYG YCRENGVSLI FSHANEQPMK TMRRAGFVDM VGEDHFRSNI
DDAIAYARKL LDEEGETASA