Gene Msed_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1444 
Symbol 
ID5104814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1411502 
End bp1413190 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content51% 
IMG OID640507332 
ProductDNA polymerase B region 
Protein accessionYP_001191525 
Protein GI146304209 
COG category[L] Replication, recombination and repair 
COG ID[COG0417] DNA polymerase elongation subunit (family B) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGTT ACGTAATAGA TGCCGTACCA TACAGGGGTA GAATAAAGAT CGTGCTGGAC 
TCCTTCAGAG AAGCGTGGAT TAAGACAACC TATCCCATTT ACGTCATCAC CGATAGACCT
GACCTCGTGA TGCAACATCC CTCGGTGGTG TCCCACGAAG AGGAGGAGTG GAGAACCCTA
TCTGGCGAAA GGATTACCTT ACACAGGTAT GAGCTCCTTG ACCTTGAGGC GAGGTCCTAT
ATTTCTAGGA GAACGACCGT GGTGAACCAG CTTCCCACCA CACTTTCCCT TGTCCTGGAC
AGGTTAGATG CTAGACCCTT CAGAAGGGTC AGGATAGACG AAGGTAAAGT CCTCGAATTC
TGGGATCTGG GCCTATCCTT TCCGCCCGTG AAATATGCCA CGGTGATAAC ACATGACTGG
TATGGCCCTT CAGAGACGGG GAACATGTTT GAGGCTGACG TGAACGGAGA GAAGTGGAGG
GGGTTACTGA GAGACCTGGA CCTCGAGGTA GACGTCGCTG GCTGTTTCGG ACGTGCGTGC
GACCACGTGA AGGCGCCAGT TAAGATAAAA CTTGAACAAA AGAGGTCTCC GGTCTCCATT
AAGGGGTTAA TTGAGTGGTC CTACACCTGC AAGACACCCG TAAGGGAATT GGTGGACGCT
ACCATAGGGA AAGCGCTGAC CACAAACGAG GCCTGGGTCG CATTCGAGAA GAAAATAGTT
GTTCCCAACA AGGTCCCGAG GGTAGAGAAG CTCAGGGATA TGGATGAACT GCTCCTCAAC
GATAAGGGTG GACTGGTCCT GTTTCCTAGA ACCGGTTGCT TCGACGATGC CTGGCAAGTT
GACTTCTCCT CTATGTATCC TTCTCTCATA GTTAAGCATA ACATCTCCGG GGAAACCGTT
GACGCCTGCG ATGACGTAGT CACAGAGATA GGCCATACCA TCTGCAATAG CGAGAGGGGG
ATAGTACCGG AGGCGCTATC CTGGTTAATA AGAAGGAAAG AGGCCCTAAA GCCCGTGGAC
CCTGAAAGGG CAGAGGCGAT AAAATGGATA CTTGTTGCCT CCTTTGGGTA TCTAGGATAC
AGGAACTCTA AGTTTGGGAA GATTGAGGCA TATGAACTGG TGACCTACTA CGCGAGGAAG
ACATTGAGGA GAGCCTTGGA AATTGCGCAG GAAATTGGGG TTGAAGTGCT TCACGGGATA
GTGGACTCCC TCGTGATTAG GGGGGATGCT CCACAGCTAG TGAGGAGACT GGAGGAGGAG
ACTGGGTTAC ACCTTCGCTC CACCAGGCTA AAGTGGATAG TTCTTGGGGG GAGAAGGGAT
GGACTTCCTT ACCCTATGAG GTACTTTGGA ATGACGGAGG AAGGAATGAA ATACAAAGGA
ATCATCAGGA GAAACATGCC GAACCTAGTA AGGGATTTCC TCGAGAGCTC CATGAACATC
CTTTCCAGGG CGGAGACGTG TGAGGACCTG AAAAGGCTTA GGTCAAACCT ACTCGAGAAC
CTCAGGGAGT ATGAAGAGAG GGTAATTCAT GGGGAGCCCA GGGACTTCGT TATGTGGGTG
AAGGGAGAGC CATACGTGAG GGGAGTGAGG GGGTTCTACA ACGCAAGGAG AGGGTTCATG
GGGAGGGATA CGAAGTACTA TCTGGAATAT CTTAGGAGAA CGGCGAGGGA GGTCCTGGGA
ATTGGATGA
 
Protein sequence
MEGYVIDAVP YRGRIKIVLD SFREAWIKTT YPIYVITDRP DLVMQHPSVV SHEEEEWRTL 
SGERITLHRY ELLDLEARSY ISRRTTVVNQ LPTTLSLVLD RLDARPFRRV RIDEGKVLEF
WDLGLSFPPV KYATVITHDW YGPSETGNMF EADVNGEKWR GLLRDLDLEV DVAGCFGRAC
DHVKAPVKIK LEQKRSPVSI KGLIEWSYTC KTPVRELVDA TIGKALTTNE AWVAFEKKIV
VPNKVPRVEK LRDMDELLLN DKGGLVLFPR TGCFDDAWQV DFSSMYPSLI VKHNISGETV
DACDDVVTEI GHTICNSERG IVPEALSWLI RRKEALKPVD PERAEAIKWI LVASFGYLGY
RNSKFGKIEA YELVTYYARK TLRRALEIAQ EIGVEVLHGI VDSLVIRGDA PQLVRRLEEE
TGLHLRSTRL KWIVLGGRRD GLPYPMRYFG MTEEGMKYKG IIRRNMPNLV RDFLESSMNI
LSRAETCEDL KRLRSNLLEN LREYEERVIH GEPRDFVMWV KGEPYVRGVR GFYNARRGFM
GRDTKYYLEY LRRTAREVLG IG