Gene Msed_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1039 
Symbol 
ID5104426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp963713 
End bp964888 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content52% 
IMG OID640506935 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_001191128 
Protein GI146303812 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.354978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA GAATTGTAAT AGTTGGTGGC GGAATAGGAG GAATGGGAGT AGCCACAACT 
CTCGCAGGTA AGTTAAATGC AGAGATAACC GTGATCAATA AGGATAACTT CTACGTGACA
GGGCCCTCGA GACCATTGCT CCTTACAGGC GAGCAAGAGT ACGGAAGAAT GCTGAGGGGA
TACGAAAAGG TGGGAGAGAA GGGTATTAAG GTGGTCGCGG GGAACGTGAT AAGGGTTGAT
CCCGACAATA GGAAGATAAC CCTGTCTGAA TCAGGGTTTG GACTGACAAG CAGGGAAATC
CAGTACGACT ATCTAGTGCT TGCCCCTGGC GTCGTATATG ACGGCTCCTC GATCACAGGG
CTTGATAGGA ACTGGTGGAG GAACACCACG GTCTACGACC CTGGAAGGGT AAACGTGTTG
AGGCAAAGGC TATGGAGCGA GAACGAGGGG ACAGTCCTGA TTTATGCCCC AAAGGCTCCC
TACAGATGTG CCCCTGCTCC GACGGAGACG GCCCTCCTGG CTCACACAGT GCTAAAGCAC
AGGGGAGTGA GGGAGAAGTT CAGGATAATA CATGTGGACG CAAACGATAA GACACAACCG
CCTTTCATCG CCGACGTTGT GAAGCAGGTC TACGAAAAGG CCGGGATAGA GCTTGTGACT
AACCAGGAGA TAGTTGAGGT GAATGAGAAA GAGGTGATCA CGAAGTCTGG CGAGAGATAT
GGATATACCA TACTTGCCCT CCTGGAGCCC AACAGGGCTC CCAGGTTCGT GGAGGAGGCT
GGACTAGGAA CGCCGTTCGT CGAGGTTAGG TCACCGCAGG ACTTGAGACA TCCGAAGTAT
GATGACGTCC TGGCAGTGGG AGATGCAGCG AAGTTACCCT TCCCTAAGAA CCAGGAGATC
GCCTTCGAGA GCGCCCTCTT CGCCTCCAAC AAGATTCTGG AGATGGAGGG TGTAACGGAG
AAAGTTCCCG TTCAGTATGC GTTTGTGGGC TGGGCCTATA TGGGTAATCT CGAGGGAAGA
CTTGAGACCC AGAGCCTCCA GTTCCAACTA GACTTAACAA CCCAACCGCC AAAGCCTGCG
AAGGATCCTC AGCTCAAGAG AGAATATACA CTACAGAAGG ACAGATGGGA GCAGGCATAC
CTTGAGAGGC TCTTCGGATA TTCCCCTAAA TCGTGA
 
Protein sequence
MAKRIVIVGG GIGGMGVATT LAGKLNAEIT VINKDNFYVT GPSRPLLLTG EQEYGRMLRG 
YEKVGEKGIK VVAGNVIRVD PDNRKITLSE SGFGLTSREI QYDYLVLAPG VVYDGSSITG
LDRNWWRNTT VYDPGRVNVL RQRLWSENEG TVLIYAPKAP YRCAPAPTET ALLAHTVLKH
RGVREKFRII HVDANDKTQP PFIADVVKQV YEKAGIELVT NQEIVEVNEK EVITKSGERY
GYTILALLEP NRAPRFVEEA GLGTPFVEVR SPQDLRHPKY DDVLAVGDAA KLPFPKNQEI
AFESALFASN KILEMEGVTE KVPVQYAFVG WAYMGNLEGR LETQSLQFQL DLTTQPPKPA
KDPQLKREYT LQKDRWEQAY LERLFGYSPK S