Gene Dbac_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_1954 
Symbol 
ID8377627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp2249280 
End bp2250293 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content58% 
IMG OID645001179 
ProductApbE family lipoprotein 
Protein accessionYP_003158458 
Protein GI256829730 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.355281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCATG GGACCCACAC TCAAGACCGC CGTGATTTTC TGAAAAAACT GGCTGTGCTC 
GCCGGTGGAG CAGCACTGGC TCCTGCGTTG CGCGTGCTTC CGGCCATGGC TGCGAGCGGC
TTGGTCATGA CCACCGAAAA GCGCATGCTC ATGGGAACGA TTGTCGGCAT GACCGTCATG
GCCCCAAGTA AGAATCAGGG TCAAGAAGCC ATTGGCCGCG CTTTTGATGA AATGAATCGT
TTGATCGGCA TTTTGAGCCG ATTTGATTCC AATACCGCCT TGTCCGCCCT GAATGTTCAC
GGACGCCTTT CCGGATCTCC GCGGGAACTG CTGGACGTCC TGGCTCACGG AAGCACGCTG
CACCGTCAAT CCGGTGGACG CTTCGACATG ACTGTGGCAC CTGTTGTCAA CCTCATGGAA
CGCACCAAGG GGCAGCCTGA CGCAAAGGAA CTTCAAGAGG CCCTGGCTCT GGTTGATTCC
ACCCAAGTGC GGCAGAGCGG ATCGGATTTG AAGTTCACCA CATCCGGGAT GAGCGCGACT
CTTGACGGAA TAGCCAAAGG ATACATTGCC GACAAAGCGG CAGAAATGCT GGGCGCGCTC
GGAGTTGCTC ATTACATGGT CGATGCCGGC GGAGACATTC GCGTCCAGGG CTCGCCCAAA
GGTGACGGTC GTCCGTGGCG CATTGCCATC GAAGATCCAA ACAAGCAGGG CGATTATCCT
GCCGTCATCG AAATGCGTTC GGGCGCCGTG GCAACATCCG GCGGTTATGA AGTCTTTTTT
GATTCTTCCC GCAAATCGAC TCACCTGATC AACCCCGAGA CCGGCGCTTC CCCGCAGTAC
ATCAGAAGCG TGAGCGTCCA GGCTCCCACG GTTATGCAGG CTGACGGCCT GGCCACGTCG
CTGAGTCTCA TGTCACCGCG CGAGGCTTTG CGTCTGACCT CATCGCTGCC CGGTCATTCC
TGTCTGCTGG TGACCTCCTC CGGTGCGCGC CTTGCTTCTC CTTTATGGAG CTAA
 
Protein sequence
MKHGTHTQDR RDFLKKLAVL AGGAALAPAL RVLPAMAASG LVMTTEKRML MGTIVGMTVM 
APSKNQGQEA IGRAFDEMNR LIGILSRFDS NTALSALNVH GRLSGSPREL LDVLAHGSTL
HRQSGGRFDM TVAPVVNLME RTKGQPDAKE LQEALALVDS TQVRQSGSDL KFTTSGMSAT
LDGIAKGYIA DKAAEMLGAL GVAHYMVDAG GDIRVQGSPK GDGRPWRIAI EDPNKQGDYP
AVIEMRSGAV ATSGGYEVFF DSSRKSTHLI NPETGASPQY IRSVSVQAPT VMQADGLATS
LSLMSPREAL RLTSSLPGHS CLLVTSSGAR LASPLWS