Gene Daud_0445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0445 
Symbol 
ID6026784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp483672 
End bp484709 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content65% 
IMG OID641593285 
ProductN-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein 
Protein accessionYP_001716623 
Protein GI169830641 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.809516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGCG TTAACCTGTT CTTCTTCTTT CTCGCAGCGC CCCTGATCGG GGTGTTGATT 
GCCGTTTCTT CCTTCAGCCG GGGGGCGCAG GGCCTGGAGC TGCCGGTTGC CGAGCTTCAG
CCGGCCGTGG GGCTATTGGC GGACGGGGTG TACCAGCTAC GCTACACGGT CGGCTCCGTT
CACGAATCCC TGGAGGAACA GCAGCAGCGC TACGAGGAGC AGCAGGAACT CCTCAGGACC
CTGGCCGCCA AAAGCGCCGA GCACAAGCAG CTTTCCGACG ACATCTATGA GCAGCACATC
CTGGACAAGC TGGGGCCGCC GGTCCGCGTA CACCGTTCGG CGCGGGTGGA GGTCAAGATT
TTTGAACTAA AGGGGATCGG GTACCGCGGC TACATCGCCA AGGTCAAGCC CTTCGACCCG
GGTGTGCTCC GGGTGACGTA CCGGGAGGGG CCGGGTGAAA CCACCAGTGA GGCCGTCCGG
CGCACCGGGG CGGTCTTGGG GGTGAACGGG GGCGGTTTCT ACCGGGCTCC GGTTGACGGG
CTGATGCACA CCCTGCCCAT TGGGAACACG ATGGTGGACG GAAAACTGGT CGGGGGCTTC
CAGCCGCCAC GCGAAGACCT GTTTTTCGCT GGCTTTGACG GCCGGGGGCG GCTCGTGGGC
GGAATCTTCA ACGACCGCAC GGCCTTGCTG GGTACAGGCG CCAGGCAGGG GGTCAGCTTC
GTGCCGATCC TGATCAAAGA CCGCCAGCCG GTGCCGATCC CGGAGAAGTG GCGGAACCAG
CGGCAGCCGC GCACTATCCT GGGCGAGTAC GCCAACGGCG ACCTGATCAT GATCGTGGTC
GACGGGCGGC AGGCCGACTG GAGCAGCGGG GTGACTCTGG AGGACCTGCA GGTGACGCTG
ATCAAGTTCG GAGTGATCGA CGCCTACAAC CTGGACGGCG GCGGATCGAG CGTGTTCGTG
TTCGGCAACC AGATCCTGAA CCGCCCCTCG GACGGCCGGG AGCGGGTGGT GGCCACGAAC
ATTGTGGTTT TGCCGTAG
 
Protein sequence
MRRVNLFFFF LAAPLIGVLI AVSSFSRGAQ GLELPVAELQ PAVGLLADGV YQLRYTVGSV 
HESLEEQQQR YEEQQELLRT LAAKSAEHKQ LSDDIYEQHI LDKLGPPVRV HRSARVEVKI
FELKGIGYRG YIAKVKPFDP GVLRVTYREG PGETTSEAVR RTGAVLGVNG GGFYRAPVDG
LMHTLPIGNT MVDGKLVGGF QPPREDLFFA GFDGRGRLVG GIFNDRTALL GTGARQGVSF
VPILIKDRQP VPIPEKWRNQ RQPRTILGEY ANGDLIMIVV DGRQADWSSG VTLEDLQVTL
IKFGVIDAYN LDGGGSSVFV FGNQILNRPS DGRERVVATN IVVLP