Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_5520 |
Symbol | |
ID | 8229135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 6647569 |
End bp | 6650607 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644933367 |
Product | hypothetical protein |
Protein accession | YP_003089876 |
Protein GI | 255039255 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTCC TGAGTGCTGC ATATTTCGAT AGTACGATAA CCGTGGACGC CAAAGTCCGC GCGAGTAGTA CGTATGCCCT TCCGGCCGTA AATACCAGCA ACGTTGCATA TTCCAGCGGC GGGTTTGAAT CGTATGTCGC AAATGATCAG TACACGGCGG GCGCTCGCCC TGGCTACGGG TTTCATGCGG CCGGAAACTT CGGGGTATTT CTATACGCAC AGTCAGGATC AGAGCTCCGC ATTCGCAGCA ATACCGGCAT TGATAATACG CTTTGGCACT CAGGCAACGC CCGTTCTGAT TCGCAGAACG ACACGAGGTA TTCTCAAATC GGGCACACGC ATGTAATCGC TGATGTAACG GGTCTGCAAA CCGCTTTGGA TGGCAAGGTT TCCCGCAGCC CGTCGCAATT TGACTGGCAC GGATTCTCAT CGATCGGACT GGCGACTTTC AATACTGCAT CGACAAACGG TCCGGCTGGC GCTGGTGTCT ACCATGGTCT TTTCATCCCG CATGCAACAG GATCATCTTA CGGCACAAAC ATCGCTTTCC GCAATGGCAA TTTCTACATC AAAAGCCTGG AAAATGGCGC TTGGGGGAGC TGGATTAAAA TTGCCTCCGA AACCTATGTT AATAGCCAGG GGTTTGTAAC ATCGGCGGGC CTCACTGGAT ATGTCCAGAA CACAAGGCAG ATCAACACAG GGGCGGGACT GCTCGGCGGC GGACCGCTCA GCGCGGATTT GAGCCTTACC TTTGATACCA CGTTTGGCGA TGCCCGGTAC TCGCTTTCAT CTCATACACA CACCGCGGCT CAGATTACCG ACTTTTCAAC TGCTGGCCGG GCACTTTTCT CTGTCTCAAA TGGCATCTCC TACACCAGCG GGACCGGAGC ATTTGCGCTC ACCTACGGAA CGGCGGCTAA TACCGTGGCC CAGGGTAACG ATTCTCGTAT TATCAACGGC CAGGCCGCGC ACATGCGCTG GGTCAGTACG CCCGGCGAGT TCTCGGACAT CAACAACATC AATAAAGGAG GGGTTGCGAC TTATGGAACA CCGGCAGCGG GCAGGCCGAC TACTTACGGC ACGACTTTCA CTTTTGTTGG CAATAGCAGC ACAGGAGACT CGGCCGGCGG AGCGTATCTG AATCAAATTT TGGTTGGCAC CACGGCTGAT TGGTATGTCC GTCATGGATT AGGAGCAACA TGGGGAGCTA CCTACCAGAT CTGGACAGCC AAACAGTTCA ATATTGCCAA TTATGCGACT ACTGCCAGTC TTTCGGATTA TGTGATGAAT AACACGACCG GGTTGAGCTG GACAACAGCC CATTCGGACG GCAAGCATAA GTTTCACGGC AGCTCTACCA ACAGCCCGAC AGGCTTTTAC CATGTCGGAT TTACAGCCAC TTCGCCTGAT GCACAAACGG CCGCTTCGCT GGCATTCAGG AACGGCAATG GTTATTTCAG AACGGTTGAA GCTGGCGTGG TTGGATCGTG GCAACAGTTT CTTACATCCG CAAATCTTTC CGGGTATGTG CCGACTTCCC GGACCATAAC TATCAATGGA GTAACAAAGA ACTTATCTGC AAATCAGGAT TGGGGTACCA TTGGGGGAAG TGGGATACCA GGGAGTGTTA GCGATCGGAC CATCCCACGG TATGAATTGT CGTCCGGATC TTTTGTAAAC AGCTCCATTG ATGAAACTGT TAGCACTGTC GACATCAGAA AATCCACTAC GATTAACAAG ACTGGCAGCA GCTATTACAT TCAGATGGGG GGCTTGTCGA ATACACTTGG CATCAGCCTC TATAATTCCA CCGGATCTAA CTCTTTTACG TTCAGTCGGT ACAATGACGA ATTCAGCTTC TCATCTGCGA CCGGTGTAAC AATCTTTGAA GTAAAGTCGG ATAGTCTGGT CAATTTTGTG ATGCCGAATG GTGTAGCGCC ATTCAAAGTC AATAGCTCTA CTCTGGTAAC GAGTTTGAAC GCGGATCTGC TGGATGGTCA GCACGCATCG GCATTTGCGT CGGCATCGCA CACGCATACC GCGTCCCAGG TTACCGACTT TACCAGCGCG GCCAGGGCTG CTATCTCTGC AACGGGAAGT ATTTCTTACA ACAGTGCAAC CGGTGTTATT TCTGGCGGTG GCTCTGACAC GCGATGGATA TCAACCCCGG TAAATTATTC GGATATAAAT ACCATTACTA GTTCCGGTGT TTTCTCGTGG GGCAACTCGG CAGCAAACCG GCCGTCGGCT TTTGGGACAA TGCTAAGTTT TATCGGTGCA TCTCCCGACG GGAACTTAGG TTCTAACATG TGGATGTCGC AATTGATGTG CGGGACAGAT GCAAACTGGT TTGTCAGGTA CGGGCAGGCT GGTACTATTT ACCAGGTGTG GACTTCAAAA CAGTTTAACA TCGCAAATTA TGCGACAACC TCCTACGTCG ACTCTAATAC GTATACCAGG GGGTACATTG ATTCCGAATT AAGCATCAGG GCGCTTAGCG GTATTTCATT AACCGGCCAG CTCAGTATAA CCGGCGGGGG GACACTGACG GCAAGTAGAA CTTACCAACT GGTCAATGAT AGCGGCAGTC CCGGGGCAAA TAAACTTTAC GGGACTAATG GTTCCGGGGT GAAGGGATGG TATGATCAAC CTTCGGGCGG CGGCGGCGCC TACTCCGCTG GAAGTGGTAT CGGGCTATCT GGATCAACCT TCTTTGTGAA CGGCGGTACC GGACTTAATC AGGATGGCGA TGGGCTTTCG CTCGATGTCG GCTGGACGGA CGGGAGATAT TTGCGAGGAA GCGGCACTCC ATTTCGAGTG GCATATTGGG GCGCAGGGAA CACCCTCACT TCCAGTAGCA TCATGACCGA TGGAGGGAAT CAAATTCAAA TTAGCGGACA TCTGGGCATA AGCAATGGCA TGTTAATCCT TCCGCCATTT GCCAGCGAAC CTGTGGCGCC CGTTGCTGGC TCTATGTACT TCTCCACAGG AAACTACCGA CCGAGATACT ACAATGGCAC CGTATGGGTA AATATTTAA
|
Protein sequence | MDFLSAAYFD STITVDAKVR ASSTYALPAV NTSNVAYSSG GFESYVANDQ YTAGARPGYG FHAAGNFGVF LYAQSGSELR IRSNTGIDNT LWHSGNARSD SQNDTRYSQI GHTHVIADVT GLQTALDGKV SRSPSQFDWH GFSSIGLATF NTASTNGPAG AGVYHGLFIP HATGSSYGTN IAFRNGNFYI KSLENGAWGS WIKIASETYV NSQGFVTSAG LTGYVQNTRQ INTGAGLLGG GPLSADLSLT FDTTFGDARY SLSSHTHTAA QITDFSTAGR ALFSVSNGIS YTSGTGAFAL TYGTAANTVA QGNDSRIING QAAHMRWVST PGEFSDINNI NKGGVATYGT PAAGRPTTYG TTFTFVGNSS TGDSAGGAYL NQILVGTTAD WYVRHGLGAT WGATYQIWTA KQFNIANYAT TASLSDYVMN NTTGLSWTTA HSDGKHKFHG SSTNSPTGFY HVGFTATSPD AQTAASLAFR NGNGYFRTVE AGVVGSWQQF LTSANLSGYV PTSRTITING VTKNLSANQD WGTIGGSGIP GSVSDRTIPR YELSSGSFVN SSIDETVSTV DIRKSTTINK TGSSYYIQMG GLSNTLGISL YNSTGSNSFT FSRYNDEFSF SSATGVTIFE VKSDSLVNFV MPNGVAPFKV NSSTLVTSLN ADLLDGQHAS AFASASHTHT ASQVTDFTSA ARAAISATGS ISYNSATGVI SGGGSDTRWI STPVNYSDIN TITSSGVFSW GNSAANRPSA FGTMLSFIGA SPDGNLGSNM WMSQLMCGTD ANWFVRYGQA GTIYQVWTSK QFNIANYATT SYVDSNTYTR GYIDSELSIR ALSGISLTGQ LSITGGGTLT ASRTYQLVND SGSPGANKLY GTNGSGVKGW YDQPSGGGGA YSAGSGIGLS GSTFFVNGGT GLNQDGDGLS LDVGWTDGRY LRGSGTPFRV AYWGAGNTLT SSSIMTDGGN QIQISGHLGI SNGMLILPPF ASEPVAPVAG SMYFSTGNYR PRYYNGTVWV NI
|
| |