Gene Msed_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2082 
Symbol 
ID5105062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2000465 
End bp2002267 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content46% 
IMG OID640507972 
ProductAAA ATPase 
Protein accessionYP_001192146 
Protein GI146304830 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.901274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTGATG AACTCTACAA GGGCATGAAG GAGAGAACCA GTAGGGCCAG AGCCTTGGCA 
GTAAGCCTAG GGAAGATGAT AGGAAAGATT TCCAAGTATA TGCCAAACAG GATAGACCAG
GAGAGCCTTT ACGTAAACGC AGTAGTTGAT GCTGAAACGT ATTACTCTAA CGAATACCTT
GGTAGGATGG GAATCCTTCT GGGTGCAGTG GACATTAGAA CTCTAGAGTT CATACTGCTT
CAAGTTGTTG GGTATGAGAG AAGTGATATC ACGTCATCGC TGTTCAACTC AACTGACATC
ACTCCAAAAC TGGTTGAGGA CGACGATCCA GGAACCCTGA TAGGAAACGT TATACTTAAG
TGCGAGATGC TCACTAGACT TGACCCTCTC GGTAGAGGGG AACCTGTGCC CGCGGACATA
ATACCTGAAC CCCAGTCACC CGTGATAATC CCCTATCCTG AGATGGTCGA GAGGGCCATG
GGGATCAATC GCGGGAAATT GAAACTTGGC TTTCTCGACA TTTCCAACTC CGAGGCAAGG
GTCAGTATCC CTCCTGAGGA GTTGAACTTT CATAGTCTGG TCATAGGCAC CACAGGGGCC
GGGAAGACCT CATTCATTAA GGACGTGATA ACCTCATTGG TCACTACGGT TGAAGACGAA
CAGGTGATCA TCTTTGATGC CACAGGGGAT TATTACCACT CATTCCTCCC ACCTGATTTA
ACCTCAGAAC ATGTTAGGAG AGGGATAGAG GACTTCAATA AGCTGAACGG TCCAGTCAAC
GGGCTACACG AAGATATTCT CTTTCCGATT ACTAGTCGTT GGCTAAGGAA GTATACTCAG
GAGAGAACGG AAGAAGAAAT AACCAAGACC TACTATGATC TCTACATCAA ACCCCTAGTC
AATTACATAG AGAGGAAGGG GATGAAGATT GATGTTCAGA TTAGCGGAAG GAGGATTGAA
CTCGCCTCAG ATTACTGGAA ATCCAGCGCC GAGGTCCACC CCTTCTACCT GTCGTTCAGG
GAAAACAGAC GGATAGTTCA TAAGTTAAAC CCATACTTTA GTGAGCAGGC ATCCCATTTC
CTGAAGATAA TTACCTCTCA ACTGAAGGAC GTCGAGAGCC TGGACGAGTT CATTGAGAAC
ATGAACGAGG AGAACTTTGA GAAACTGCAG GTCCACAAGA GCACGCGTGA AAATATACTT
CGTGGGCTCT ACTTGCTCAG GGAAACTGGA CTTTTTGATC TGCGTTCTCC TAGGACGTCC
TTGGGAGACC TACTTTCCAC CTCTAAGATG TTGACAATTG ACCTCTACAA TCAGGAGCTC
GACGATTTCG CCCAGAAAAT ACTCACCTAC TATTTCCTGG ATAGAATCTT CCAGCTTAGG
GAGAGCAAGA TGAGAAAGGG CGAGATTAAC AGTAAACTGC TCATTATCAT AGATGAGGCC
CATAGGTTCT TTCCGTCAAA CAGAGGAGGG GAGGAGGACA GCAACTACGT GAGAAGGGTT
GCGGGAAAGA TATCTGTAAT GATGAGGTTA GGCCGTAGGA GAAGAATAGG GTTCATGTTC
TCCACTCACA ATCCCTCCGA TCTAAGTGAT ATTATTGTTC AGTTAGCTAA CACCAAGTTT
GTTTTCAGGA CATCTCTGGA CATTGCGGAG AGTTTAGGCG TTCCTAGATC GGAGGGGAAA
ATATTAAGCT GGGAGAGGAA TGGTGTAGCA TATATGATTT CGCCATGGCT GAAACAAGGA
AGATTAAAGG TCAGGGTTCC TGTTCCTCCT CCCATTGGCC ATTACGATCT CTCTAGGACT
TAG
 
Protein sequence
MIDELYKGMK ERTSRARALA VSLGKMIGKI SKYMPNRIDQ ESLYVNAVVD AETYYSNEYL 
GRMGILLGAV DIRTLEFILL QVVGYERSDI TSSLFNSTDI TPKLVEDDDP GTLIGNVILK
CEMLTRLDPL GRGEPVPADI IPEPQSPVII PYPEMVERAM GINRGKLKLG FLDISNSEAR
VSIPPEELNF HSLVIGTTGA GKTSFIKDVI TSLVTTVEDE QVIIFDATGD YYHSFLPPDL
TSEHVRRGIE DFNKLNGPVN GLHEDILFPI TSRWLRKYTQ ERTEEEITKT YYDLYIKPLV
NYIERKGMKI DVQISGRRIE LASDYWKSSA EVHPFYLSFR ENRRIVHKLN PYFSEQASHF
LKIITSQLKD VESLDEFIEN MNEENFEKLQ VHKSTRENIL RGLYLLRETG LFDLRSPRTS
LGDLLSTSKM LTIDLYNQEL DDFAQKILTY YFLDRIFQLR ESKMRKGEIN SKLLIIIDEA
HRFFPSNRGG EEDSNYVRRV AGKISVMMRL GRRRRIGFMF STHNPSDLSD IIVQLANTKF
VFRTSLDIAE SLGVPRSEGK ILSWERNGVA YMISPWLKQG RLKVRVPVPP PIGHYDLSRT