Gene Dde_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_2022 
Symbol 
ID3757030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp2061633 
End bp2062931 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content60% 
IMG OID637782910 
Productheptosyltransferase family protein 
Protein accessionYP_388514 
Protein GI78357065 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.259554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG CACTGGTGAT TCAGCTTGCC CGATTCGGGG ACATTATACA GACCAAGCGG 
CTCATTCTGA CGCTGCTCGG CCGCTATGAC GAGGTGCATC TGGCTGTGGA CAGTTCGCTG
GTTCCGCTTG CCCGTCTGGT ATATCCTCAG CTCTGCGTGC ACGGCCTTGC GGCACACAGA
GGAGCGGGTT CTCCGGCGGA AGTGTTTGCC GCAAGCAGAA TGCTGTTCGG CAGGCTGGCT
TCAGAGCGTT TTGACGCTGT ATATAATCTT AATTTTTCGC GTCTTAATTT TTCGCTGTCC
GCTTTGTTTG ATGCTGACAG TGTGCGCGGG TATGCCATGC GCAGCGGGCA GCCTGTGGTG
CCACTGTGGG CAAGACTGGC CTTTCGCTGG ACAAGAAAAC GCCGCATAGC ACCGCTCAAT
CTTGTTGACT TCTGGGCATA TTTTGCGCGC AACCCTGTGT GTGCAGGAGA AGTTAATCCC
GTGGCGGCAC GCGGGGGAGG CGGTATAGGC GTGGTGTTGG CCGGACGTGA TTCACGCCGC
TCGTTGCCGC CTGCAGTGCT TGCCGCCTGT GTGCAGGCGG TTTTTGAGGG AACCGGAGGT
CCGGCAATAA CCCTGCTGGG CACCGCTGCG GAAAAAACGC TGGCCAGACA GCTGATGCGC
CACTTTTCCG GCCCCATGGT GGAGCGTACC GTAACGCTGG CGGGCAAAAC AGGGTACACG
GACCTTGCCG AAGTTGTGGG TACGCTGGAT ACGCTGATAA CTCCCGATAC CGGCACCATG
CATCTTGCTG CTCATCTGGG TACGCCCGTG CAGGCTTTTT TTCTTTCTTC CGCCTGGTGC
TTCGAGACGG GCCCTTACGG TATGGGGCAT AGAGTCTGGC AAGGACTGGT TCAGTGCTCC
CCGTGCGAGG AAAACCGGCC GTGTCCTGTG TCCGTGGCCT GCCTCGCTCC CTTTGCCGAC
AAACGGTTTC TTCAGGTGCT GGCCGGTAAG GCCGCAGAAG CGTATCCGGA GCATCTGTCC
GGTCTTGTCA GCATGTTTGA TGCCGTGGGT GTGACGTACC TGCCCGTGTT CGGAGAGGAT
ATGGCCGCGC CGGAGCGGCG GGAGCTGAGG CGGCTGGTTG CGGAGCATGT GTGCCCGGAA
ACGGCCAAAG CAGGGGCCGG GCTCATAGGC ATGAGCGCCA TGCAGGATAT GGCACTGCAG
GACAGTCTGG GCCGTTTTCT TTTCAGCGAG AGCGACTGGA TGCTGGACGC GGCGGGGCGG
GACTTTCCGG ATGTCGCGGA CGTACCGGGT TTCTGCTGA
 
Protein sequence
MKKALVIQLA RFGDIIQTKR LILTLLGRYD EVHLAVDSSL VPLARLVYPQ LCVHGLAAHR 
GAGSPAEVFA ASRMLFGRLA SERFDAVYNL NFSRLNFSLS ALFDADSVRG YAMRSGQPVV
PLWARLAFRW TRKRRIAPLN LVDFWAYFAR NPVCAGEVNP VAARGGGGIG VVLAGRDSRR
SLPPAVLAAC VQAVFEGTGG PAITLLGTAA EKTLARQLMR HFSGPMVERT VTLAGKTGYT
DLAEVVGTLD TLITPDTGTM HLAAHLGTPV QAFFLSSAWC FETGPYGMGH RVWQGLVQCS
PCEENRPCPV SVACLAPFAD KRFLQVLAGK AAEAYPEHLS GLVSMFDAVG VTYLPVFGED
MAAPERRELR RLVAEHVCPE TAKAGAGLIG MSAMQDMALQ DSLGRFLFSE SDWMLDAAGR
DFPDVADVPG FC