Gene Msed_1138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1138 
Symbol 
ID5103486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1074669 
End bp1075916 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content51% 
IMG OID640507030 
ProductAAA family ATPase 
Protein accessionYP_001191223 
Protein GI146303907 
COG category[R] General function prediction only 
COG ID[COG1373] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.631516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTTG AGGACTTCAA GGCGGTGATT GCTGAATTCC TTAACTCGGA GATACCCAAG 
ACCACGGATA GGGAGACCAG GTTGCCCCTT GACACAAACT ACGTGATCAC GTTGACCGGT
GGAAGAAGGG TTGGGAAGAC CTACATCCTC TACAACACCA TGTCCAGGTT AGTTTCCGAG
GGCAAGGCCT CCAAGGACGA GATAGTTTAT GTGGATTTCG AACACCCCAG GCTGAGGAAC
TTAAGTGCCG TTGATCTTGA CGACATATTG ACTGCCTTCT ACGAGTTAAC AGGGAAGAAG
CCAAGGTATC TCTTCCTCGA TGAGATACAG ACCGTGAAGG ATTACGGGAG TTGGTTCAGG
AGGAGGCTTG ACGCGAGGGT TTTCCTGACG GGGTCCTCCT CCGCATTAAC TCCCTCAAGG
ATAGCCGAGG AGCTCAGGGG AAGGAGCCTG AACTTTGAGG TCTTCCCCCT CTCCTTCAGG
GAGTACCTGT CCTTTCTGGG AGTACGTGTG AACCCTGAGA TCACCCTATA CACAGAGGAA
AAGGGGAAGA TCCTATCCCT ATTGAGGGAG TATCTCAGGT ATGGCGGATA TCCAGCGGTG
GTCCTTGAGA GGGACCCAGG TCTCAAGAAG ATGCTGTTAC GATCCTACTT CGACTCCGTC
GTGGTGAGGG ACCTGAACGA GAGGTATGCC GAAACCTTTG CCTCCTACAT CGTGTCAAAT
TACTCGTCGC TGATCTCGTA CAATAGGGTT TACAACTACC TGAAAACCCT GGGTTTCAAG
GTAAGTAAGG AGAAGGTGAT CGAACTCTTT CGCAGGGGGA GGGAGGCGTA CTTCCTGTTC
GAGGTGGAGG TGTTTGAAAG GAGCGAGACT AAGAGGAAGG TGAATCCTAG AAAGGTCTAC
ATCGTGGACA TGGGTTATCC CTACGCCTTG GGGTATGACT CAGTGTCTAA GGCTATGGAA
AACGCGGTCT ACCTCCAGTT GAGGAGGGAG GGGAAGGAGG TGTATTACTG GAGATCTGAG
GACGCGGAGG TGGATTTTGT GGTGAGTGAG AAAATGGAAC CTAAGGAGCT CATACAGGTG
ACCTACGCCG AGGACAAGAT AGAGGACAGG GAGGTGAAGG GATTGAGAAA GGCTGAAAGG
GAGATCAACG CGGAAAGGTC CACGATCATA ACCTGGAGCT ACCAAGGGAG GGTCAACGGT
TATCAGGCAG TTCCTCTTTG GTATTGGTTA TTAAGGAGAG AGAGATAG
 
Protein sequence
MRVEDFKAVI AEFLNSEIPK TTDRETRLPL DTNYVITLTG GRRVGKTYIL YNTMSRLVSE 
GKASKDEIVY VDFEHPRLRN LSAVDLDDIL TAFYELTGKK PRYLFLDEIQ TVKDYGSWFR
RRLDARVFLT GSSSALTPSR IAEELRGRSL NFEVFPLSFR EYLSFLGVRV NPEITLYTEE
KGKILSLLRE YLRYGGYPAV VLERDPGLKK MLLRSYFDSV VVRDLNERYA ETFASYIVSN
YSSLISYNRV YNYLKTLGFK VSKEKVIELF RRGREAYFLF EVEVFERSET KRKVNPRKVY
IVDMGYPYAL GYDSVSKAME NAVYLQLRRE GKEVYYWRSE DAEVDFVVSE KMEPKELIQV
TYAEDKIEDR EVKGLRKAER EINAERSTII TWSYQGRVNG YQAVPLWYWL LRRER