Gene Daud_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1102 
Symbol 
ID6027553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1159833 
End bp1160909 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content62% 
IMG OID641593916 
ProductNADH-ubiquinone oxidoreductase, chain 49kDa 
Protein accessionYP_001717245 
Protein GI169831263 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAGGA TGACCTTTCC GTTCGGCCCG CAGCATCCTG TGCTCCCGGA AGCGATTCAG 
CTAAAGCTGA CCGTGGAGGA TGAAAGAGTC GTCGAGGTGC TGCCGGCGAT CGGGTACATG
CACCGGGGCA TCGAGAAGGC GGCCGAACGG AACCCGTACA TCAACAATGT GTTTCTGTGC
GAGCGGATCT GCGGGATCTG CAGTTTCATC CACGGGATGG CTTACTGCCA GACGATCGAG
GAGATCATGA AGGTGGAAGT GCCGCCCCGC GCCAAATACC TGCGGGTAAT GTGGAGCGAG
CTTTCGCGTC TGCACAGCCA CCTATTGTGG CTCGGGCTAC TGGCCGACTC CTTCGGCTTT
GAGAGCCTGT TCATGCAGTG CTGGCGTGCC CGGGAGATCG TGCTCGATAT GCTGGAGATG
ACCACCGGGC AGCGGGTGAT CCAGTCCACC TGCGTCATCG GCGGTGTGAG GCGGGACATC
GACGCCGACC AGGCTGCCCG CCTGCGGGAA ATGCTGAAAA CATTGAAGCC GCAGATCGAC
GCCGTGATCC CAGTGTTCAA GCATGACTAC ACCATCAAGT CCCGCACGGT AGGCAGGGGT
GTGCTGCCGA AGGATCAGGC CTGGACTCTG GGCGCGGTCG GGCCGACCTT GCGTGGCAGC
GGCGGCACCT GGGACGCCCG CTCAACCGGT TACGCGGCGT ACGGCGAGCT TGAGTTTGAG
CCGGTGGTCG AGACCGACGG CGACAGCTAC GCGCGGACCA TGGTGCGGGT CCGGGAAACG
TATCAGGCTT ACGAACTGGT GTTGAAGGCG CTGGACCGGC TGCCGGAAGG CGAGACCAGG
GTCAAGGTGA AAGGTTCCCC GAATGGTGAA GCCGTAATGC GGGTCGAGCA GCCGCGCGGG
GAGCTTTTCT ACTACGCTCT GGGCAACGGA ACCGTGCGCC TGGAGCGATT GAAGGTGCGC
ACGCCGACGT TCGCCAACAT TCCGGCGCTG CTGACCATGC TGCCCGGCTG TGAGATCGCC
GACGTTCCGG TCATCGTACT GTCGATCGAC CCGTGCATGT CGTGTACCGA AAGGTGA
 
Protein sequence
MPRMTFPFGP QHPVLPEAIQ LKLTVEDERV VEVLPAIGYM HRGIEKAAER NPYINNVFLC 
ERICGICSFI HGMAYCQTIE EIMKVEVPPR AKYLRVMWSE LSRLHSHLLW LGLLADSFGF
ESLFMQCWRA REIVLDMLEM TTGQRVIQST CVIGGVRRDI DADQAARLRE MLKTLKPQID
AVIPVFKHDY TIKSRTVGRG VLPKDQAWTL GAVGPTLRGS GGTWDARSTG YAAYGELEFE
PVVETDGDSY ARTMVRVRET YQAYELVLKA LDRLPEGETR VKVKGSPNGE AVMRVEQPRG
ELFYYALGNG TVRLERLKVR TPTFANIPAL LTMLPGCEIA DVPVIVLSID PCMSCTER