Gene Daud_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1847 
Symbol 
ID6027634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1944598 
End bp1945557 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content69% 
IMG OID641594662 
Productcobalamin biosynthesis protein CobD 
Protein accessionYP_001717972 
Protein GI169831990 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1270] Cobalamin biosynthesis protein CobD/CbiB 
TIGRFAM ID[TIGR00380] cobalamin biosynthesis protein CobD 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTCT TGGCCTATAG GGCCGGAGTG ATCCTGGCCG CCGTGGTCCT GGACCGGCTG 
CTGGGCGATC CGCCCCAGCT CCCGCACCCC GTTCGGCTGA TCGGGTACTT GATCGCCTCC
GGCGAGTCCA TCCGGCGCCT ACCGCTGCCG GTGCGCTGGT CGGGGACCCT GCTGGCGATG
GGTGTGATCG CGGTAACCGG TGCGGCGGCA TGGCTGCTTG TTAAACTGGG GTACGCGGCG
GGCTTTTGGT GGGGCCTAGC CGTGGAAGCC TTGCTCGTCT ACCTGGCCGT CGCCCCTCAT
TCGCTAGCCG GTGAGGCGTT GGCGGTGGAC GGATATCTGC AGCGTGGCGA CCTGCCGGGC
GCGCGCCAAA GGCTGGCGAT GATCGTCGGG CGTGACACCG CCGGTCTCGA TGAACGGGAG
GTCCGCCGGG CGGCGGTGGA GACGGTGGGC GAGAACACGG TCGACGCGGT CCTCTGCCCG
GTTTTTTACG CGCTGCTCTT CGGAGCGGCG GGAGCCTGGG TTTACAAAGC GGTCAGCACA
CTGGATTCGA TGATCGGTTA CCGGCACGGG CCGTACCGCG ATTTCGGCCG GGCCGCCGCC
CGGCTGGACG ACTTCGCCGC CTTCGTCCCG GCGCGGCTGG CCCTCTTTTT CGTGCCCCCG
GCGGCCCTGC TGTCCGGCCT TTCCGCGCGG GACGCCTGGC GGGTGGGCTG GCGGGACCGC
CTGGCTCACC CCAGCCCCAA CGCCGGCCAC GGAGAAGCGC TGTTCGCCGG GGCGCTCAAC
GCCTGTCTCG GCGGGCCGTC GACTTACAAC GGGAGACCTT CGGAAAAACC GTACCTCGGC
GCGGAATACC CGCCGCCCGG ACCGCCGGCC ATACGCCAAG CGGTAACCCT GCTCTGGGCA
ACCGCCTGGC TGTTTGCGGC CGCGGGCGCG GCCGTTCTGG TTTTCCTCAG CCGGTTTTAA
 
Protein sequence
MEFLAYRAGV ILAAVVLDRL LGDPPQLPHP VRLIGYLIAS GESIRRLPLP VRWSGTLLAM 
GVIAVTGAAA WLLVKLGYAA GFWWGLAVEA LLVYLAVAPH SLAGEALAVD GYLQRGDLPG
ARQRLAMIVG RDTAGLDERE VRRAAVETVG ENTVDAVLCP VFYALLFGAA GAWVYKAVST
LDSMIGYRHG PYRDFGRAAA RLDDFAAFVP ARLALFFVPP AALLSGLSAR DAWRVGWRDR
LAHPSPNAGH GEALFAGALN ACLGGPSTYN GRPSEKPYLG AEYPPPGPPA IRQAVTLLWA
TAWLFAAAGA AVLVFLSRF