Gene Dbac_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_2203 
Symbol 
ID8377877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp2534781 
End bp2537054 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content60% 
IMG OID645001424 
Productglycosyl hydrolase BNR repeat-containing protein 
Protein accessionYP_003158701 
Protein GI256829973 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAGA GAAAGCATGG CGAATGCGCA GCGTATGGCC TGATGTTCTG GGCCGTGTTC 
GTCATCATGG GCATGTCCGC GCCGTCATCG TTTGCAACGG TGAATACGGG AGTGTTTTCC
CCGCGTGGCA TTGGCGGCGG GGGCGGAATG TACGTCCTCT CCATCTCTCC ATACGACGAG
GGGCTCATGT TTCTGGTCAC CGACATGGGC GGCGCGTACC GCTCGGAAGA CGGCGGCGAG
CGGTGGGAAC TGATCCATTA CACCCAGGGC TTTCGCTTCA TGCAGTTCTC CACGCCCCCG
GTGTTCTTCA AGGATCGCAT CTATTGGCGG TCGGGGAAGA CCGCACTGCG GGTCAGCACG
GACCAGGGAC GCTCGTGGCC CAAGGTGGAC GGCATGCCCT GGGGGAAAGA AGCGATTTTG
CACCTGACCG CGATTCCCGG CTCGCCGGAT GTTCTTCTCG TCGGCACGAA GGGCGGGCTG
TGGCGTACTG ATGACGACTG CGCGACCTGG AAAATAGTGC TGGACCGGCA AACAACCGAG
ACCGTCATCC TTGGCGAGGT CCTCTGCGCC ATGGCGGATC CGGATGTTGT CGCACGCAGC
CGTGACCGGG GCCAGACGTG GACGACGTCT CCCGTGGTCG TCGACGGGCG GGCAATCAAG
GGGCACCCTG CCATGGGGCT TACTGGCGCG GTGTCGGAAA AAGGTTCGCT CATGGTGGCC
AGTCTGCACA AGTTCGGCAT CATCCGCTCC ACGGACCAAG GGTCGAGCTG GGCCCTCGCG
CATTCACCCT ATGGGGGCGA AAATCACCTT GTCATGGCCC CCGGGCAGAT CGATGTCGTC
TATGCCGCCC AGACCGGATC AAACGTCAAC ATCAACCTGC TCCGTTCCAT TGACGGCGGT
CAAAGCTGGA GCCCGAGTTT CAGGATGGCC GATTTCGCCC GCAAATACGG CTGGAAGGCG
AATGTGGAAC CGACCTGGAT TCAGGACTCC CTGAAATGGA GTTATCTGAT CTCGCCCAAG
GGTTTTGCGG TCAATCCGCG AAAACCCGAG GAAGCCTTTC TCGTCACGCA GGGCGAACTG
TATCGCACCC GGGACGGAGG AGAGACATGG CATCCGCGCA TGGCCACAGA GGTTGCCGTT
GGGCCGGAGA AGTCGGCGCA TTACCAGAGT ATCGGGCTGG AGGTGACCAG CGCCTGGGGC
TATCATTTCG ACCCCCATCA CCCGGCCAGA CATTACATCA CCTACACGGA CATCGGGTTC
GCCAGGTCGC CGGACAGCGG AAAAAGCTGG CAATGGACGG CGCAGGGTTC GCCCTGGAGG
AACACGTTCT ATGATCTGGC CATCGATCCG GATGTGTCGG ACGTACTGTA CGCGGCGGTG
AGCGCCCGTC ACGACATTCC GCACCACTCC AATCTTTCGG TTACCAAATC GGGATATCGG
GGGCATCAGG GCGGGGTGGT CAGATCCACG GACGGCGGCC TGAGCTGGCA GGTTCCCTAC
AAGCCGGGCA GCTCCTCCGG GTTGCCCAAG CAGGTCTGCA CTACGGTGCT TCTCGATCCG
AAGTCGCCTA CGGACAGGCG GATTCTTTAC GCCGGAATCT ACGGGGAAGG CGATGATGAC
GAAGCGGGTG TGTACAAGTC CGTTGATGGA GGAACGTCGT GGGCAAAAAT TTCGCCGGGC
CCGGGCGTTG CGCCAAATCT GCATATCTAC AGGTTACGCA TGCATCCGAA GAGCGGAAAT
TTGTACTGCC TCATCACAGG GCTGCGCTCC AAGAATAATT ATTTCACGGC TCCAGGCGGC
GTCTGGAAGT CGACGGATGG TGGCGAAACA TGGAAGCCAA TCAGTAGTGG CGTCAACCTT
GTCTGGTGGA CCACGAATTT CGCCTGGGAC CCTGAAAACG AAGACGTGAT GTACGTCTCG
GCCGGGTCTT CGGAAGGGCA CTGGATGCAG GGCGGCATAT ACAAGACGAC GGACGGCGGA
GGCTCCTGGG CGCATGTCCT GACCGACGAG ATGATTCAAA AAGCGGCCCG GGGGGAGAGC
TATGAACAGA CCATGGCTGT GGCCCTTCAT CCTCAAAATT CCCGGTTGGT TTATGCCGGT
ACTTCCAGGA ACGGATTGCT CTACAGCCAG GATGGCGGGG CGACTTGGCG GCATTATTCG
GAATTTCCTG CTGCAACGGT GCAAAGCATT AATTTTGATC CTGCGGATAT GACACGGATC
ATTGTCACGA CCTTTGGACA GGGGGTGTTT GAAGGTCCCT ATCTTCCGCA ATGA
 
Protein sequence
MEERKHGECA AYGLMFWAVF VIMGMSAPSS FATVNTGVFS PRGIGGGGGM YVLSISPYDE 
GLMFLVTDMG GAYRSEDGGE RWELIHYTQG FRFMQFSTPP VFFKDRIYWR SGKTALRVST
DQGRSWPKVD GMPWGKEAIL HLTAIPGSPD VLLVGTKGGL WRTDDDCATW KIVLDRQTTE
TVILGEVLCA MADPDVVARS RDRGQTWTTS PVVVDGRAIK GHPAMGLTGA VSEKGSLMVA
SLHKFGIIRS TDQGSSWALA HSPYGGENHL VMAPGQIDVV YAAQTGSNVN INLLRSIDGG
QSWSPSFRMA DFARKYGWKA NVEPTWIQDS LKWSYLISPK GFAVNPRKPE EAFLVTQGEL
YRTRDGGETW HPRMATEVAV GPEKSAHYQS IGLEVTSAWG YHFDPHHPAR HYITYTDIGF
ARSPDSGKSW QWTAQGSPWR NTFYDLAIDP DVSDVLYAAV SARHDIPHHS NLSVTKSGYR
GHQGGVVRST DGGLSWQVPY KPGSSSGLPK QVCTTVLLDP KSPTDRRILY AGIYGEGDDD
EAGVYKSVDG GTSWAKISPG PGVAPNLHIY RLRMHPKSGN LYCLITGLRS KNNYFTAPGG
VWKSTDGGET WKPISSGVNL VWWTTNFAWD PENEDVMYVS AGSSEGHWMQ GGIYKTTDGG
GSWAHVLTDE MIQKAARGES YEQTMAVALH PQNSRLVYAG TSRNGLLYSQ DGGATWRHYS
EFPAATVQSI NFDPADMTRI IVTTFGQGVF EGPYLPQ