Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dbac_2826 |
Symbol | |
ID | 8378517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfomicrobium baculatum DSM 4028 |
Kingdom | Bacteria |
Replicon accession | NC_013173 |
Strand | - |
Start bp | 3203984 |
End bp | 3205081 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 645002057 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003159317 |
Protein GI | 256830589 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGGG ATTTTTTTCT GCTGGTCGTC GTTGTTTTTT TGGCCTTGGG TTTGTCGTTT TTGTGTTCAG TGGCCGAGGC CGTGCTTTTA AGCATCACCC CGTCCTATAT CGCCAGTTTG CGTGAAAGGA ATCCCGCACG GGCGGAGGTC CTGAAGAAAC TGCGTCTGGA GAAAGTGGAT CAGTCCCTGG CCGCGATTTT GACCCTGAAC ACCATCGCCC ACACGGTGGG CGCCATTGTG GCAGGCGCTC AGGCTCTCGT GGTTTTCGGC AATGCGTGGA TCGGCCTTTT TTCGGCGGTG ATGACGGCGC TTATCCTGTT TTTGTCCGAG ATCGTACCCA AGACCATCGG TGCCGTGTAC TGGCAGGCCT TCGTCGGAGT GACGGCGCAT TTCGTCAACA TGTTGATCAC GGTTCTCTAT CCCCTTGTCT GGCTGTCCAA TGGGCTGACC AAGCTGATAT CCCGAGGGAA AAAAGCGCAT GTCTTCAGCC GCGAGGAGTT TATCGCCATG GCCGGAATCG GGGAACAGTC CGGACATCTG GAGGAGCATG AATTCCGAAT CATCCGCAAC ATCTTCCGCT TCGGGTCCGT AAACATCACC GCCGTGATGA CTCCGCGCAC GGTCATGACG GCCCTGCAGC AGGACATGAC CATTGCCGAC TCCCTGCCTT TTGTCACCAA GACCCCTTTT TCCAGACTGC CCGTCTACGG CGCGGATCTG GATGACATCA CCGGCGTCGT GCTCAAGGAC GAGGTGCTGA TCTGCATGTC CCGGGGCGGC TGCGAGGGCT CTTTGGAATC CTTGAAGCGC CAGATACTTT CCGTGCCTGA CAGCCTGTCC CTTTCTGATC TGTTGGAGTT TTTTCTTGAC CAGCGTCAGC ACTTGGCCAT CGTCTTGGAC GAATACGGTG GGACTCGGGG ATTGGTTACC CTGGAGGATG TGGTGGAGAC CCTCTTTGGC ATGGAGATCG TGGATGAGAT GGACAGCGTG GCCGACATGC AGGCCCTGGC CCGGCAGCAA TGGAAAAAAA GAGCTCAGTC CCTTGGCATT TTCGAACAGG ACGAGGACGT TGAACCCGGT AAAAGCATTC GTTCGTGA
|
Protein sequence | MSGDFFLLVV VVFLALGLSF LCSVAEAVLL SITPSYIASL RERNPARAEV LKKLRLEKVD QSLAAILTLN TIAHTVGAIV AGAQALVVFG NAWIGLFSAV MTALILFLSE IVPKTIGAVY WQAFVGVTAH FVNMLITVLY PLVWLSNGLT KLISRGKKAH VFSREEFIAM AGIGEQSGHL EEHEFRIIRN IFRFGSVNIT AVMTPRTVMT ALQQDMTIAD SLPFVTKTPF SRLPVYGADL DDITGVVLKD EVLICMSRGG CEGSLESLKR QILSVPDSLS LSDLLEFFLD QRQHLAIVLD EYGGTRGLVT LEDVVETLFG MEIVDEMDSV ADMQALARQQ WKKRAQSLGI FEQDEDVEPG KSIRS
|
| |