Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_0020 |
Symbol | |
ID | 6025578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | + |
Start bp | 24564 |
End bp | 25625 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641592873 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001716221 |
Protein GI | 169830239 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAAGA TGGTGGTTCA ATTGAAAAAA ACTGATTGGT ACGACCGCTT TTTTAGTCAG CGCAAAATAC TGGTCACCGG TGGCACGGGT TCCATCGGAT CGGAACTGGT AAAAAGCCTG TTGCATCATG GCGTTACCCG AGTGGTAGTC CTGAGTAAAG ACGACAGCAA GCAGTACATG ATGAAACAGA GGCTGATCTC CAAGGAAAAC ATTTCCTTTA TCCTGGGAGA TGTTAGGGAA TACGAAACTT TGAAGGAAGC GACCAGGGGT ATGGACCTGG TTTTTCATAC AGCCGCCTTG AAGCAAATCG GCATCTGTGA AGATAATCCC AAGGAAGCGA TACTGACCAA TACCATGGGC ACTGTCAATA TTATTAGAGC GTGCCTGGAG AACAGGGTTC AGAAACTGAT TAATATCAGT ACCGATAAGG CAGTACATCC CACCAGTATC ATGGGCGCCA CCAAGTTCTT ATCGGAGAAG CTGATCCAGG AGGCCTCCCG CAGGCCGGAC GTGCAAACCA GATTCTGCTC GATACGCTTC GGCAATGTCC TTAACTCCAG GGGTTCGGTT ATCCCCTTGT GGCTTGAACA GTACCGCTCT GGTCAACCGC TGCAAGTTAC TGATTTGGCC ATGACCCGTT TTGCCATGAC CATCCCACAG GCGGTTGAGT TAATCAAAGA ATCTGCTTTC CTTTGTCAGG GAGGAGAGAC ATTTATCTTT AAAATGAAAA GCGTGCGCTT GGGTGACTTG GTGCAAGCCA TGAAAGCCGT GCTTTCAGAT TGCAACATGA ACGGAGCCCA GTTCGTGCTA ACCGGACCGC GGCCGGGAGA AAAGAGCTTT GAACAATTGC TTTTTGCAGA CGAAGCAAAA CGCTTGTTTG AAAATGAACA GCTATACGTG GTTTTGCCCC ATCAGAGCAG CAGTACCCCC CAGGCCCGCT TTAAACGGGC ACGTCACAAT GAGTACCGCT CTGATTTGGC TCAAAAATGG AGCGTACCTG AATTAGTGGC CTTACTCCGT CCGGTAATAA AACAGTCCCT GGGAGATGAA ACAAATGGAT GA
|
Protein sequence | MQKMVVQLKK TDWYDRFFSQ RKILVTGGTG SIGSELVKSL LHHGVTRVVV LSKDDSKQYM MKQRLISKEN ISFILGDVRE YETLKEATRG MDLVFHTAAL KQIGICEDNP KEAILTNTMG TVNIIRACLE NRVQKLINIS TDKAVHPTSI MGATKFLSEK LIQEASRRPD VQTRFCSIRF GNVLNSRGSV IPLWLEQYRS GQPLQVTDLA MTRFAMTIPQ AVELIKESAF LCQGGETFIF KMKSVRLGDL VQAMKAVLSD CNMNGAQFVL TGPRPGEKSF EQLLFADEAK RLFENEQLYV VLPHQSSSTP QARFKRARHN EYRSDLAQKW SVPELVALLR PVIKQSLGDE TNG
|
| |