Gene Msed_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0439 
Symbol 
ID5105556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp390681 
End bp391988 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content49% 
IMG OID640506345 
Productdeoxyribodipyrimidine photo-lyase type I 
Protein accessionYP_001190540 
Protein GI146303224 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTGTG CCTTTGTGTT CAGGAGAGAC CTTAGGCTAG ATGACAACAC TTGCCTTCTG 
AGAGCGCTTC AGGAATGCGA TGAAGTGGTT CCAGTATTCG TGTTGGATCC CAGGCAACTG
GGCGATAATC CATACAAGTC CGCCTTCGCC TTGGGCTTCA TGGTTGATTC CCTCCTGGAT
CTTGACATGC AGTTGAAGCA TCGTAGTTCA AGGCTCCACA TTCTGCAGGG ATATCCTGAG
AAGGTGTTGC CAGAGCTCAA GGTTGAGGCA ATATACTTCA ACGAAGATTA CACTCCTTTC
AGCCTGAACA GGGATAACGC AATAAGGGAG ACAATGCGTG GAAGGGTTAA GTCATGCGAA
GACCTTCTCC TGACACCAAA GGACTTCTTC GTAAGAAAGG GAAAACCCTA CACAGTTTTC
ACGCACTTTT ATAACGATGC GAGGAAGCTC GAGGTGAGGA AACCCATGAA GAACGACATG
AGGAATTACC TCACTCTCGA TCTCCCTGGG ACGGAGGTCC TGAAGCTGGA GGTCGAGAGG
GGTATCCCAG GCGGGAGACA GGAGGGGCTC AAGAGGCTGG AAAGGGCCAG AAACCTGAAC
TACTCCATGC GTAACTTCCC AGGAGTTGAA GGTACTACGA AGCTCTCGCC ATATATCAAG
TTTGGGGTTG TCTCACCGAG GGAGGTGTAC TGGGCGGTCA ACGAGGAGAT AAGGAGGCAA
CTGTACTGGA GGGACTTCTA CACGCTTCTG GCCTACTATA ATCCCCACGT GTTCGGTCAT
TCGTACAAGA GGGAGTACGA CTGTATACCC TGGAAGTGGA ATGAGGCTCA TCTTGAGGCA
TGGAAGCAGG GTAAGACGGG TTATCCCATA GTTGACGCGG GGATGAGGGA ACTTAACGAG
ACTGGATTCA TGCATAACAG AACCAGGATG ATAACGGCCT CATTTCTCGT GAAGGTATTG
CATGTGGATT GGAGGATAGG GGAGAGATAC TTCGCTACAA AACTAGTTGA CTACGACCCA
TCAGTAAATA ACGGAAATTG GCAATGGGTG GCCTCAACTG GTGCGGATTA CATGTTTAGG
GTATTCAACC CTTGGTTGCA ACAGAGGAAG TTTGACCCAG ATGCGGTGTA CATAAAGACG
TGGGTACCAG AACTGAAGGA TCTTCCAGCC GAGAAGATTC ACGAGATTTA TAGGTTCAAG
GTTTCAGGCT ATCCCTCCCC CATAGTGGAT TATAGTGAGG AAGTCAAGAA AGCTAGGAAG
ATGTACGAAG ACTCGGTGGC GTTATGCAGT AAGAGGGGTC TCTTTTAG
 
Protein sequence
MPCAFVFRRD LRLDDNTCLL RALQECDEVV PVFVLDPRQL GDNPYKSAFA LGFMVDSLLD 
LDMQLKHRSS RLHILQGYPE KVLPELKVEA IYFNEDYTPF SLNRDNAIRE TMRGRVKSCE
DLLLTPKDFF VRKGKPYTVF THFYNDARKL EVRKPMKNDM RNYLTLDLPG TEVLKLEVER
GIPGGRQEGL KRLERARNLN YSMRNFPGVE GTTKLSPYIK FGVVSPREVY WAVNEEIRRQ
LYWRDFYTLL AYYNPHVFGH SYKREYDCIP WKWNEAHLEA WKQGKTGYPI VDAGMRELNE
TGFMHNRTRM ITASFLVKVL HVDWRIGERY FATKLVDYDP SVNNGNWQWV ASTGADYMFR
VFNPWLQQRK FDPDAVYIKT WVPELKDLPA EKIHEIYRFK VSGYPSPIVD YSEEVKKARK
MYEDSVALCS KRGLF