Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dhaf_4447 |
Symbol | |
ID | 7261476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfitobacterium hafniense DCB-2 |
Kingdom | Bacteria |
Replicon accession | NC_011830 |
Strand | + |
Start bp | 4715420 |
End bp | 4718161 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643564368 |
Product | Ig-like, group 2 |
Protein accession | YP_002460888 |
Protein GI | 219670453 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000209553 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAT TAAGAAAAAT TCCGACCCTG TTTCTTGCTG CGTTTTTAAG TGTGACGGCG CTTCCCAACA CGGTAATGGC CCAATTGCCC GTCCTCTACC AGGATCACTC CCAACAAACC GTTACGGACG GCGTAACCCT GGAGAACATA TCCCGTTTTA CCACCGGCGG CTGGTTGAAT ATCAATGTGC TGCGTGTGGA TATGACTAAC CCTTATGTGA AAATTGACAC CCTAAGCAAT GACTCGATCA CCGATGACCT GGTCAGCATT TCCGCCCTGG CCGAAAAGGA AGGGGCTGTG GCTGCTGTCA ACAGCAGTTT CTTTAATCCG TTCACCGCCG GCAAAGGCTA TGCAGACGGC CCCACCGTCC GGGCCGGCGA TCTTTTGTCC ACCTCCGCCT GGTATAACCG GAGCAAAAAT GAAATGGCCT CCCTTTCCGT TGACTATGCC AATCAGCTGC TTTTTCACTA TTGGAAAAAT GACCTTACCC TCATCACCGG AAACGATACC GCCTTTGCCG TTACCCAGTA CAACCAGCCG AGCAGGCAGG ACTACCAAGA CCTTACGGTC CTGGATCGCA AGTGGGGACC TGTGGCTATC GGCGCCACCG AAAACTGCCC CGATCTGGTT GAAATCGTCG TAAGCGGGAA CACAATCCTG GAGATCCGGG AAGGACAGCC TGCCGCCGAG ATTCCCGAGG AAGGTTTCGT CGTCGTCTCC CGAGGCGAGC AGGCGGCCAA ACTTCTGGAA CAGGCCGCCC CGGGCGAACA GCTCCAATTC CAAGTCACCT CCACGCCGGA CTGGAACGAC CTGAAGATGT CCACCACCGG AACCTCCCTT CTTATCCAAG ACGGAGAAAT CCCCGCCACT TTTTCTTACA GCACCGCCAG CTTTAATCAA CGGAACCCCC GGACCATGGC CGGGAGTACT GAAGATGGGA GCGAACTCAT CCTGGTTACA GTTGACGGAA GGCAGGATAA CAGCATCGGA TTGACCCAGC AGGAATCCGC CGAGCTGATG CTGGAATTAG GGGCGTATCA GGCTATTATG TTTGACGGCG GCGCATCCAC CACCATGGCC GCCCGACAGC CCGGGGCCTT TTCCCCTGAT GTAGTCAACC TTCCTTCCGA AGGCATCCTA AGAAATGTGG CATCAGGCAT CGGGATTTTC TCCGCAGCCC CCGCCGGCCG GCTGGACCGC CTGATCATTG AGACTGAAGA CACCAACATA TTTGCCGATA CTTCGAGAAC TTTTTCCCTT AAGGGCATCG ACCGCTATGC CAATCCGGTG GAGCTCGACC CAAGAGAAGT TCAGTGGGAT GTCAGCGGGA TCGACGGCAG CTTTCAGGAC AATATCCTCT ATCCGGCTTC CCCAGGCCCA GGGAAAGTAA CAGCCACCCT CGACGGACTG GAGGCGGAGC TGGACATCCA AGTCTTAAGC GCCCCGGTGC AGCTTACCCT CAATCCGGCC AAATTCGACC TTCCTTTGTA CCAAAATAAA GCCATCCGGG TAAAGGGTCT CGACCCCCAA GGTTATGCCG CGGTCATCGA AGGCCGGGAT GTTCAATGGA ACGTTGACGG CGACATCGGT AGTTGCGAAG CCGGCGTCTT CACCCCCGTC ACCACCGGTA CCGGCACTAT CCGGGCCGCC GTCGGCGACA CCTATGCCTA CAGCGCCGTT GCCGTTACTT TGGATTCCAT GGAGCTCCTG CATCCCTTCG AAAGCTCCGG CCGGTCGGAT GAGCCACGCT TCCAAGCCGA CCCGCAAACA GGCCAGGGTT ATTTCGAACT CAACGCTGAT CAATTCTACG CCGGGGAGTC TTCCGGACAG CTGGTGTATG ATTTCTTATA TGGTAACTCT GAGGACCAGA TCTCTGTCCT CTTTGCCGAG CAGGGGCTCC CCCTTGATCC GGCCACTACC GGGTTATCCC TGTGGCTTTA TAATATCAGC CCCAACTCGA ATCGCATCAA AGGAGAAGTC ATAGACTCCG CCAATGTGAA ACACACCGTC GAATTCGCCT CCGACCTCAA TTGGACCGGC TGGAAAGAGA GCACAGCGTC CTTATCCGGG ATCGACTCTC CCGCTTATCT CACCAGACTC TACATAGAAA ACACCGATCC CGCCAACCCT TGGGGGAAGA TCTATTTTGA CGACTTGTCG GCTCTGATCC AGAAGCGCCC GACTATCGAT CCGAGCAGCA TTCCCGCAGA TACCCTTCCC AAAGACGAAG CGAACCGGCA GGCTGATTTC ACCTCCGGTC CCGACAATTT CCTGTTTTCC TTGCTCACCG GCGGCAATGC ACCGGGGGTT TACACCGCTC CCATCACCCA AGTGGGAGCC GGGGATGCCC AAACACCTAT ACTGGCCGCC GGTTCAGGGT ATCAATCCTC TACTTATCAA AACAGCACCT TCCTCCAGCT GGATGTAAGC CAGGGCGGGC TGAGGAGAAG CGACCCACAG CAATGGACCC GGCTGTTCAC AGCCCTGGAT GAAATCCAGA GCGCCAATGT CTTTCTCCTC ATGTCCCTCA GCCCGGCCGA TTTCATCAAT GCCAAAGAAG CACAGCTCCT GAAAGATACG CTGGCCAATT GCCGGGAAGC AACCGGCAAG AACATTTGGG TCCTTTTCCC TGGTCCTGCG GATCAAAGCG AGCTTGACCG CGGCGTGCGC TATATCAGCG TTGCCCAATC CCTGCAAGTG ACGATGCTGG GAGAGGACAT TTCCTATGAG TTCGAGCCCT AG
|
Protein sequence | MKRLRKIPTL FLAAFLSVTA LPNTVMAQLP VLYQDHSQQT VTDGVTLENI SRFTTGGWLN INVLRVDMTN PYVKIDTLSN DSITDDLVSI SALAEKEGAV AAVNSSFFNP FTAGKGYADG PTVRAGDLLS TSAWYNRSKN EMASLSVDYA NQLLFHYWKN DLTLITGNDT AFAVTQYNQP SRQDYQDLTV LDRKWGPVAI GATENCPDLV EIVVSGNTIL EIREGQPAAE IPEEGFVVVS RGEQAAKLLE QAAPGEQLQF QVTSTPDWND LKMSTTGTSL LIQDGEIPAT FSYSTASFNQ RNPRTMAGST EDGSELILVT VDGRQDNSIG LTQQESAELM LELGAYQAIM FDGGASTTMA ARQPGAFSPD VVNLPSEGIL RNVASGIGIF SAAPAGRLDR LIIETEDTNI FADTSRTFSL KGIDRYANPV ELDPREVQWD VSGIDGSFQD NILYPASPGP GKVTATLDGL EAELDIQVLS APVQLTLNPA KFDLPLYQNK AIRVKGLDPQ GYAAVIEGRD VQWNVDGDIG SCEAGVFTPV TTGTGTIRAA VGDTYAYSAV AVTLDSMELL HPFESSGRSD EPRFQADPQT GQGYFELNAD QFYAGESSGQ LVYDFLYGNS EDQISVLFAE QGLPLDPATT GLSLWLYNIS PNSNRIKGEV IDSANVKHTV EFASDLNWTG WKESTASLSG IDSPAYLTRL YIENTDPANP WGKIYFDDLS ALIQKRPTID PSSIPADTLP KDEANRQADF TSGPDNFLFS LLTGGNAPGV YTAPITQVGA GDAQTPILAA GSGYQSSTYQ NSTFLQLDVS QGGLRRSDPQ QWTRLFTALD EIQSANVFLL MSLSPADFIN AKEAQLLKDT LANCREATGK NIWVLFPGPA DQSELDRGVR YISVAQSLQV TMLGEDISYE FEP
|
| |