Gene Msed_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1771 
Symbol 
ID5104771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1712553 
End bp1713797 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content54% 
IMG OID640507669 
Producthypothetical protein 
Protein accessionYP_001191850 
Protein GI146304534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGGCT CATCGCCCCT AATGGCGTCT CTCTTCCCGG GATGGCAGTT CAAGACTGCC 
CTGGATGCCG TGAAGTTGGG GTGGTACTTC GCCCTGCCCA AAGTGGGAGA ATTTGGGGCC
TCCAAGCTTG TTCTCGAGAA GGAGAAGCCC TTCATTGACG TGAATGAGCT TAAGGGGTTT
AGGTACGACG GGTTAGTTAT CAATCAGGTT GTTACCTCGG GCAAGAACTT GAGGACCGTG
GAGAGGACCG TGAAGTCCAT TCATCATTGG TATGGCGAGG TGGAGAGGAG GTACTCTGTG
AGGATACCCC ACGAGGTCTG GATTGTGGTT GACGAGGGAA GGGAGGCAGG TTTGAGGGGG
TTAGACGCTA GGGTGGAGGT AGTTCCAGCA GAGTATAGGA CCAGGAATGG GTCCATGTTC
AAGGCCAGGG CACTCCAGTA CGCCGTGGAA CAGAGGGGAG GCACTGGATC AGGCACGTGG
GTTTACTACC ACGATGAGGA GACCGTGTTT GGGGAGGACA GCGTCCTAGG AATTGCCGAA
TTTGTTCAAG GGGACAGGGA CGTTGGGGTT CATCCCATAG TTTACCCGGT TAACTGGAGA
GGCGACGTGT TATCCACGAT TGAGACGTTG AGGACGTCCA ATGACGTGGT GAGCCTTTCC
CTGTCCCCCA GGGGAATGTG GCACGGCTCT GGTTTCATGG TTAGGGGAGA GGTGGAGAGG
GAGATTGGAT GGGACTTTGG CCCAGTGAGG GCTGAGGACC TCCTCTTCCA CCTGAGGGCA
TCACGGAGGT TCAGGTACGG AGTCATGAAG GGTTTCGTGT ACGAGATACC TCCGCAGAAC
TTAATGGACT TCATGAGGCA GAGGAGGAGA TGGATACTGG GGATACTTGA CGGGTTCAAG
GACGGAAGGA TGGATGTAAG GAATAGGGTG AAGTACCTTC TGGGTTTAAC TAGCTGGTAC
TCCTCCGCGT TGGGTTTCCT GGTGCCCCTA TTCGTGTACA TGAGGGATGC AAGCGCACCT
CTTCCCATTG GACCATATCT AACCGGGCCC ATCTGGTTCA CCCTGCTCCT CATGTTAAAG
GACGGCTTTG TGCTCACTAG GAGGTATGCT GGCCTCAGGG GACGGGACCT TCCAAGTTTC
ATGGTGAAGG GATTAGTAGG GCTCATGCTT GAGGCCATAG CCCCTTGGTA TACCCTGTTT
ACAGGATGGA GGGATCACGG GTTCCACGTC ATAGATAAGG GATAG
 
Protein sequence
MLGSSPLMAS LFPGWQFKTA LDAVKLGWYF ALPKVGEFGA SKLVLEKEKP FIDVNELKGF 
RYDGLVINQV VTSGKNLRTV ERTVKSIHHW YGEVERRYSV RIPHEVWIVV DEGREAGLRG
LDARVEVVPA EYRTRNGSMF KARALQYAVE QRGGTGSGTW VYYHDEETVF GEDSVLGIAE
FVQGDRDVGV HPIVYPVNWR GDVLSTIETL RTSNDVVSLS LSPRGMWHGS GFMVRGEVER
EIGWDFGPVR AEDLLFHLRA SRRFRYGVMK GFVYEIPPQN LMDFMRQRRR WILGILDGFK
DGRMDVRNRV KYLLGLTSWY SSALGFLVPL FVYMRDASAP LPIGPYLTGP IWFTLLLMLK
DGFVLTRRYA GLRGRDLPSF MVKGLVGLML EAIAPWYTLF TGWRDHGFHV IDKG