Gene Msed_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2086 
Symbol 
ID5105066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2004741 
End bp2005958 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content45% 
IMG OID640507976 
Productcitrate transporter 
Protein accessionYP_001192150 
Protein GI146304834 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.771625 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGT TAGCGTTTGC TGTTGTTATA GTCACCTACG CACTAATAGC AACTAGGGGA 
GTTTCTGGAA TACCCCCTTG GGCCTCCATG TTTTTCGGGG GTGTAATGAT GATAGTGACG
GGTGTTATTA CTCCTCAGGA GGGATTTGCG TCTATCAACC TTGACGTCGT CCTTTTCCTC
ATTACCCTCT TCACCTTCGC CTCTGCCCTG GAGGTTTCAG ACTTTCTAAA ATACTTAGGA
TTTTATATTG TAAATAAGTT CAAGACGCCC TCTAGAACTC TTTTTGGGGT GTTGCTCTTC
TCCGGTTTGC TGTCAAATCT GGTGACCAAT GACGGTGTCT CGGCTAGCTG GACGCCAGTA
ATACTTGAAA GCAGTAAGAA ACTTGGAGTG GATGAGAAAC CATTCCTGTA CGCGCTAGCA
TTTGGAGTTA CAATTGGAAG CGTAATGCTC CCCACCGGAA ACCCACAAAA TCTTCTAATA
GCCCTAGATG CCGGTCTGAA GAATCCCTTC ATCGAGTTTG CGATGATCTT GGTACTGCCC
ACGCTGATCA ACCTTGTCCT ATCGTACCCC ATTTTACTCC TACTATTTAG GAAGGAACTT
AAGAGTGACG GAGAGATTGG CCATTTTCAG GAAAAAATAG AGGACCCTGT CACTGCCTAC
ACATCTCTGG CTCTACTGGG TATTACTGTA GTTTTATTTT TCTCACTTAG TTTTCTCGGA
ATAGACATAG TCCTAGGTTC CCTTACCACT TCGTCTATCC TTATCCTAGT CTCCAAAAGG
AGGAGAGAAA TCATTAGGAG GATGGATTGG TCTACCATTT TGTTCTTCAT AGGATTGTTC
ATGTTTACAG AGGGCATGAT AAAGGGTGGG GTCCTGAGTG CCATAGTCCG CTATCTTCCA
TCCCCGTCGT CCGTCTTTAC CATTATGCTG GTAAGCGTTC TGGTTAGCCA ATTGCTTAGC
AACGTACCCC TGGTCGCAAT ATACATCCCT GTGATGATTT CCTCTGGTGC CACGTCTCCG
CTGGATTGGC TCGCCCTAGC CGCAGGTAGC ACAATAGCTG GCAACTTCAC GTTAATAGGA
GCAGCTAGTA ATGTGATAAT CTCAGAGTCT TCAGAAAGTA GGGGAGGAAA AGGATTTGGA
TTTATAGAGT TTATTAAAAA CTCTGTTCCT CTGTTGATAG TAAATTTTCT AGTTCTCTAC
CTATTTCTTA GGCTGTAG
 
Protein sequence
MNLLAFAVVI VTYALIATRG VSGIPPWASM FFGGVMMIVT GVITPQEGFA SINLDVVLFL 
ITLFTFASAL EVSDFLKYLG FYIVNKFKTP SRTLFGVLLF SGLLSNLVTN DGVSASWTPV
ILESSKKLGV DEKPFLYALA FGVTIGSVML PTGNPQNLLI ALDAGLKNPF IEFAMILVLP
TLINLVLSYP ILLLLFRKEL KSDGEIGHFQ EKIEDPVTAY TSLALLGITV VLFFSLSFLG
IDIVLGSLTT SSILILVSKR RREIIRRMDW STILFFIGLF MFTEGMIKGG VLSAIVRYLP
SPSSVFTIML VSVLVSQLLS NVPLVAIYIP VMISSGATSP LDWLALAAGS TIAGNFTLIG
AASNVIISES SESRGGKGFG FIEFIKNSVP LLIVNFLVLY LFLRL