Gene Daud_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1778 
Symbol 
ID6027234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1869148 
End bp1870200 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content53% 
IMG OID641594595 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001717906 
Protein GI169831924 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID[TIGR03589] UDP-N-acetylglucosamine 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACTTA AGAATACGCA ACCAGGCTCA TTATCTCAAG ATGTGAAGGT CGGAGGCGGG 
GCAGGCGTGC TAAACGGCGG ATCTGTTCTG GTCACGGGGG GTACGGGCTC GTTCGGGCAG
AAATTTGTGG AGACGGTGCT CCAACGGTAC AAACCGCGCC GGCTGATTAT CTTAAGCCGG
GATGAACTGA AACAATACGA AATGCAGCAG GTTTTCGATC CAGCCAAATA CGATTGCCTA
CGTTACTTCC TTGGCGATGT GCGCGACAAG AACCGGTTGT ACCGGGCGTT CTACGGCGTT
GATGTTGTGG TGCACGCGGC GGCGCTCAAA CAGGTGCCGG CGGCCGAGTA TAACCCCTTT
GAAGTAGTCC AGACCAACAT AATCGGCACA CAGAATGTTA TTGACGCCGC CATCGACAAC
GGGGTCCAAA AGGTCATCGC CCTGAGCACT GATAAGGCGG TGAACCCGGT AAACCTTTAT
GGGGCGACCA AGTTGTGCTT GGAGAAACTG GTCGTGGCGG CCAATTCTTA CGCCGGTGGC
CGCACCAGAT TCAGCGTGGT CCGCTACGGC AACGTGGTCG GCAGCCGGGG CAGCGTGGTG
CCCGTGTTCC TCAAGCAGAA AAAGACGGGG ACTTTGACCG TCACTGACGA GCGGATGAGC
CGTTTTTGGA TCACCCTGGA ACAGGGGGTT TCTTTTGTTC TCAATTGCAT TGAGAACATG
CAGGGCGGCG AGGTTTTTGT ACCAAAGATT CCTAGTATGC GGATTATGGA TCTTGCCCAA
ACTGTTTGTC CCCACTGCGA GATTCGGTTT ATCGGCGTCC GCCCGGGAGA GAAGCTGCAC
GAATTGCTCA TCTCTAAGGA TGAGGCACGT AACGTTGTGG ATTGTGGAGA TTTTTTCGTT
GTAAAACCGT CTTTTCCGTT TTGGATGTCC AAGATTGAAC AGCAAGGGCA ACCGGTTCCG
GAGGAATGGG AATATGCAAG CAATACTAAC GAACAATGGC TTGAAAAAGG TCAGCTGCAG
AAGCTGATCA ATCAGTTCTC CCAGCATTCT TAA
 
Protein sequence
MSLKNTQPGS LSQDVKVGGG AGVLNGGSVL VTGGTGSFGQ KFVETVLQRY KPRRLIILSR 
DELKQYEMQQ VFDPAKYDCL RYFLGDVRDK NRLYRAFYGV DVVVHAAALK QVPAAEYNPF
EVVQTNIIGT QNVIDAAIDN GVQKVIALST DKAVNPVNLY GATKLCLEKL VVAANSYAGG
RTRFSVVRYG NVVGSRGSVV PVFLKQKKTG TLTVTDERMS RFWITLEQGV SFVLNCIENM
QGGEVFVPKI PSMRIMDLAQ TVCPHCEIRF IGVRPGEKLH ELLISKDEAR NVVDCGDFFV
VKPSFPFWMS KIEQQGQPVP EEWEYASNTN EQWLEKGQLQ KLINQFSQHS