Gene Msed_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2059 
Symbol 
ID5105039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1978589 
End bp1979818 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content49% 
IMG OID640507949 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_001192123 
Protein GI146304807 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.761901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGG TCTTAGTTTT AGGTGGAAGA TTCGGCGGAC TTACTGCCGC CTATAACGCA 
AAGAGACTGT TGGGGAGCAA GGCTGAGGTT AAGCTGATGA ACCAGGACAG GTTCACGTAC
TTTAGGCCTG CTCTCCCACA CGTGGCCATA GGCGTGAGAG ACGTAGAGGA GCTCAGGATA
GATTTGGCCA GCGCCATGCC AGAGAGGGGA ATATCCTTTG CCCAAGGGAA GGTAGAGAAG
ATAGATGCCG AGTCTAGGAT AGTTTACTAC AAGAAACCAG ATGGAGGAAT GGGAGAAGAG
GAATATGATT ACCTAATGGT GGGGATAGGC GCACACCTCG GGACTGAACT CATAAAGGGA
TGGGATCAGT TCGGTTACAG CGTTTGCGAA CCGGAGTTTG CGGTCAAACT TAGGGATAGA
CTGAAGGACT TCAAGGGCGG ACATATTACC ATCGGATCGG GTCCCTTCTA CCAAGGAAAG
AATCCTAAAC CCAAGGTTCC AGAGAACTTT GTACCTCAAG CAGACTCGGC CTGTGAAGGG
CCTGTCTTCG AGATGTCGCT GATGCTACAC GGGTACTTCA CAAGGAAGGG TATGTGGGAT
AAGGTGAAAA TAACGGTCTA CTCTCCCGGC GAATATCTGT CAGATCTTTC TCCCGCATCC
AGGAAGGCCG TTGCTGAGAT CTATAAAGGA TTGGGAATAG AGCTAGTACA CAACTTCAGA
CTAAAGGAAT TGAGAGAGAA GGAAATAGTG GATGAAAAAG GTAACAAGCT TGAATCGGAT
CTGAGCATAT TACTCCCGCC TTACACGGGT AACCCGGCAC TTAAGGCTTC CACAAAGGAC
CTAGTGGACG ATGGAGGATT CATCCCCACT GACCTGAACA TGCAATCCAT CAAGTATGAC
AACATATATG CAGTTGGCGA TTCTAACGCC CTAACTGTGC CTAAGCTGGG GTACTTGGCA
GTTCAGACTG GCAGGATCGC GGCTCAACAT CTGGCGAAGA GATTGGGAGT TAACACGAAG
GTGGAATCCT ACTATCCCAC CATCGTATGC GTAGCCGACA ATCCACTTGA GGGATATGCC
GTCTCAGTGA AGGACGATAC CTGGTATGGA GGTCAGGTCT CGGTAGCTCA ACCTGCTGCA
GTGAATCACT TAAAGAAGGA ACTATTCACC AAGTACTTCA TGTGGACCAA GGGTGATATG
GTCCTAGAGA AATTCTTGGG AAGCTGGTGA
 
Protein sequence
MTKVLVLGGR FGGLTAAYNA KRLLGSKAEV KLMNQDRFTY FRPALPHVAI GVRDVEELRI 
DLASAMPERG ISFAQGKVEK IDAESRIVYY KKPDGGMGEE EYDYLMVGIG AHLGTELIKG
WDQFGYSVCE PEFAVKLRDR LKDFKGGHIT IGSGPFYQGK NPKPKVPENF VPQADSACEG
PVFEMSLMLH GYFTRKGMWD KVKITVYSPG EYLSDLSPAS RKAVAEIYKG LGIELVHNFR
LKELREKEIV DEKGNKLESD LSILLPPYTG NPALKASTKD LVDDGGFIPT DLNMQSIKYD
NIYAVGDSNA LTVPKLGYLA VQTGRIAAQH LAKRLGVNTK VESYYPTIVC VADNPLEGYA
VSVKDDTWYG GQVSVAQPAA VNHLKKELFT KYFMWTKGDM VLEKFLGSW