Gene Msed_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1016 
Symbol 
ID5104319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp937070 
End bp938323 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content46% 
IMG OID640506915 
ProductAAA family ATPase 
Protein accessionYP_001191108 
Protein GI146303792 
COG category[R] General function prediction only 
COG ID[COG1373] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0133655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATATTG AGGATATTAA GCAGGTCATA GTGGATCAGT CAGACATACT TCAAAATAAA 
TTGAGAGGAA GGATAGTGAA GAGGGACGTC CCAGATCTAC TTAAGTACTT GAGGGCCCCA
AACGCCCTAG CCATCCTAGG AGTAAGGAGG GCTGGAAAAT CCACCCTGGC AACTCTTCTC
CTTCAGGGGA AAAAATTCGC GTATGTAAAC TTCGAAGATG AGAGATTGAA AGGTATCAGG
AAAGAGGAAC TGAATAAGGT TCTCCAGGGT ATTCATGAGC TGTACGGTGA CGTTGAATAC
ATCATCTTTG ACGAGATACA AGGAGTAGAG GGCTGGGAGC CCTTCGTTTC AAGGTTAAGG
GATGTGAAGA GGGTAATCGT CACTGGGACC AACTCCAAAC TTTTGTCCGG TGAGCTAGCT
ACATCCCTAA CGGGTAGACA CAGCGATTTC ATCCTCTTTC CCTTTTCCTT TCGCGAGTAC
CTGAGATATA AGGGCGAAGA GGTTGCAGAT AACGACTTTT ACTCCACGTT AAGGGTATCC
AGACTGAAGG TGGAACTTGA GAACTACATT GTGGAGGGAG GATTTCCGGA GTCCCTAATT
CTGAGCAGGG AACAGGTGAA CTTCATCTAT AATGACATAC TCTTCAAGGA CGTGATAGCG
AGATACAGGA TTAGGGAAAT AGGTAAGTTT AGGGAGTTTG CCAGGACACT AGTCTCCTAC
TACTCCAATG AGGTCTCGCT CTCCTCTTTA GCTAAAGTCC TGGGTCTGAA CAAGGTTACC
GTGGAGATGT GGGCTAATGG GTTGAGTGAG GCATACCTAA TCTTCTTTCT ACCAAGGTAC
GGCGAGAAGC TAAAGCAAAG GCTAACATAC AACAAGAAGG TTTACGTTGT AGATCCTGGA
ATAATTTCAA GCGTAGCCAT AAAAGGAAAG GACAAGGGTA GAATAATGGA GAACCTGGTG
GCTATCAAGC TCGTGAGAGA ACTTCAGGGT ACAGATCATT TATACTACGT TAGGAACGGT
TTCGAGGTCG ACTTTTACGA TGAGTTAAAC TCTCGACTGA TTCAGGTTAC CTATGCTAGC
GATGTAGTAG AGGAGAGGGA AATAAGGGGG TTAATCAGGG GTCACGAGCT AACAAGGGCC
AAGGAGCTAA TAGTTGTCAG CTGGGATTTA AGGGAGACAA TCAAGCATGA GGGCATGGAG
ATAAGGGTAA TCCCCTTGTA TCAGTTCTTG CTGAGAGACT ACAACACTCT GTGA
 
Protein sequence
MDIEDIKQVI VDQSDILQNK LRGRIVKRDV PDLLKYLRAP NALAILGVRR AGKSTLATLL 
LQGKKFAYVN FEDERLKGIR KEELNKVLQG IHELYGDVEY IIFDEIQGVE GWEPFVSRLR
DVKRVIVTGT NSKLLSGELA TSLTGRHSDF ILFPFSFREY LRYKGEEVAD NDFYSTLRVS
RLKVELENYI VEGGFPESLI LSREQVNFIY NDILFKDVIA RYRIREIGKF REFARTLVSY
YSNEVSLSSL AKVLGLNKVT VEMWANGLSE AYLIFFLPRY GEKLKQRLTY NKKVYVVDPG
IISSVAIKGK DKGRIMENLV AIKLVRELQG TDHLYYVRNG FEVDFYDELN SRLIQVTYAS
DVVEEREIRG LIRGHELTRA KELIVVSWDL RETIKHEGME IRVIPLYQFL LRDYNTL