Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtur_1118 |
Symbol | |
ID | 7083069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dictyoglomus turgidum DSM 6724 |
Kingdom | Bacteria |
Replicon accession | NC_011661 |
Strand | - |
Start bp | 1137172 |
End bp | 1139232 |
Gene Length | 2061 bp |
Protein Length | 686 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643458210 |
Product | hypothetical protein |
Protein accession | YP_002353011 |
Protein GI | 217967505 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.699386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAAAC TTGTATTCTT TATATGCCTA TTATTGTTTA TCTCTCCATC TTTTACTGTC GAGAAATTCA ATATAGCCCC TAACGCTACT TACGAAAAGG TGGTAAGGGA TAAATTAGTC TACCATATAA CTAAGGTAGA TATGGATCCC CTTTTGGAGA TAGAAACAAT TATATCTCAA AACAGATTAT CTGAGATTCT CAAATATAAA AACTACGATC TTATAATAAA TGGTAATTTT TTTGATCCAA AAACCTTTGA ACCTGTAGGA TTAGTAATTA AAAACGGAGA ATTAATACAC CTTCCTATAA AAAGAGGAAT CTTTGGTTTG ACCTTTGATA ATAAACCTAT AATTGATATA TTTAACATTA ATTTAAAGAT AAAAGTTGAT GAGAATATAA TTCCTATAAA CGCTATTAAT TCTCCAAGGG GTATAGACGG TATAAATATA TTTACAAGAT ATTTTGGAAA AGAGACTCAA ATTAGAGAAA ATGCCTCGGC TGGAGTTGAT ATCGAGGTAG CCCTTGAAGA TAAGATTCCC TCCTTAGGTA AAACAAGTGG AGTAATAACA AATATCTACT ATGGGGTTAA AAGAACTCCT ATAAAAGAAA ATACATGTAT TATATCTCTT GGAGGTACCT CTTTAAAATA TCTTCCCCTA TTTTCTGTAG GAAAAAAACT TGAAATCATC AGTGAATGCA CTCCCCAAAT ACCCTTAAAA GAAGCCATAG GCGGAGGACC AATATTGCTC AAGAACAAAG AAATAGTACT TGGAAAGACT GAGGAATTAC CTTTTGATGA CAATATTGTA AATTCAAGAC ACCCAAGGAC CATCATAGGT ACAAAAGAAA ATAGTATTTA TTTTATTGTT ATTGAGGGAA GAAAAGAAAG CTCCATAGGT GTAACCCTTA AAGAGTCATG TGAAATTCTT AAGGAAATGG GGATTTCTGA TGCTATAAAT ATGGACGGCG GAGGATCAAG TCAGAAGTTA ATATGGGGAA AATTGATGAA TCAAGAAATA GAAAGACCAA TACCTGTAGC CTTAGGGATT AGAAATGTCT ATCCTTATAC ACAGCCTAAA TACTTAGCTT TTACAGAAAA TGATGATATT TATATCAAAA AAGGGGATAA GATAAAAGTA GAACTACTAC TTCAGGATGA AAATTATCAC CCTTACCCTA TTGATACTTC TATGCTAACT TGGACTTTTT CAAACCCTAT ATTAAATTTT GACATAAGTA ATATGACCTT AGAGGGAATA GATTTAGGAG AAAGCCTATT AACCTTGAAC TTAAATGGGT TAACTACTAC AAAAAAAATA TTCGTATGGG ATTTTGTAAA TTTAGAAATC AATACAGGAA GAGATCAATT CTTTCTTGGA GATACTTTTA TCCCTAAAGT GTACGTGGTA GATAACTTCA ATAGAAAAAT AGAACTTCCT ATTGAGAATT TAAAATTTGA CCCAAAATTT TTTATAAAAA ACAGAGAAAA ATTTACTGCC ATAAATGTTG GAAAGACCAA CTTAGAATAT ACTTTTAATA ATCTATGGGT TTCTGTTCCC ATTGAAATAT TTGATAAAAA GAATATAATC TTTGAGGACT TTGAGATTGA TAAAAATTGG ATTATAAAAG GTAAAAATTA TGATATAAGT TCTACCACTT ATACCCTAAC CAGCGATTCT TATTCCGGAA ATAGCGCCAT CTCAGTTACC TATTCTTCGC AGACAAATAA TTCCTTTATA TATTTGGAAC TAAACATTTT AGTACCACAA AATATAACAA AATTCTCTAT TTCCCTAAAA GGAAGTGGTG AGGGCTGGAT TAGAGCTTTA TTTTATGATA ACGATAATCA ACCCTGGGTT ATAGATATAA CAAATACCTC AAGTTTTGAT TTTCCTAAAT GGACTGTTGT AGAAAGAGAC TTAAAAGATT TAAAACCCTT AACAAGTAAA GTGCTTGTCT CACCCGTGTT TCCAATAAAA TTGGAAAGTA TTTATGTTGT AGGGCTAAAT CAAATGGGAG TAAAGGGCAA ACTGATAATT GATACTATCA GATTTTATTA G
|
Protein sequence | MFKLVFFICL LLFISPSFTV EKFNIAPNAT YEKVVRDKLV YHITKVDMDP LLEIETIISQ NRLSEILKYK NYDLIINGNF FDPKTFEPVG LVIKNGELIH LPIKRGIFGL TFDNKPIIDI FNINLKIKVD ENIIPINAIN SPRGIDGINI FTRYFGKETQ IRENASAGVD IEVALEDKIP SLGKTSGVIT NIYYGVKRTP IKENTCIISL GGTSLKYLPL FSVGKKLEII SECTPQIPLK EAIGGGPILL KNKEIVLGKT EELPFDDNIV NSRHPRTIIG TKENSIYFIV IEGRKESSIG VTLKESCEIL KEMGISDAIN MDGGGSSQKL IWGKLMNQEI ERPIPVALGI RNVYPYTQPK YLAFTENDDI YIKKGDKIKV ELLLQDENYH PYPIDTSMLT WTFSNPILNF DISNMTLEGI DLGESLLTLN LNGLTTTKKI FVWDFVNLEI NTGRDQFFLG DTFIPKVYVV DNFNRKIELP IENLKFDPKF FIKNREKFTA INVGKTNLEY TFNNLWVSVP IEIFDKKNII FEDFEIDKNW IIKGKNYDIS STTYTLTSDS YSGNSAISVT YSSQTNNSFI YLELNILVPQ NITKFSISLK GSGEGWIRAL FYDNDNQPWV IDITNTSSFD FPKWTVVERD LKDLKPLTSK VLVSPVFPIK LESIYVVGLN QMGVKGKLII DTIRFY
|
| |