Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_1203 |
Symbol | |
ID | 6026682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | - |
Start bp | 1255659 |
End bp | 1258280 |
Gene Length | 2622 bp |
Protein Length | 873 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641594018 |
Product | CBS domain-containing protein |
Protein accession | YP_001717346 |
Protein GI | 169831364 |
COG category | [J] Translation, ribosomal structure and biogenesis [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0617] tRNA nucleotidyltransferase/poly(A) polymerase [COG0618] Exopolyphosphatase-related proteins [COG3448] CBS-domain-containing membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.452152 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTGA TCACAACCCA TACCAACACC GATTTCGACG GTCTGGCGGC GATGGTGGCT GCGCAGAAGC TGCATCCGGA CGCTGCACTG GTGCTACCGG GCAAGATGGC TCAGAACGTA GAGGAGTTTG TCGCACTTCA CGTGGATATC CTCGAAATCC GCAAACCGAG TGAGATCAGG TTTTCCGACA TTGACTTCCT GATCGTGGTG GACACCATGC AGCCCGGCCG TTTGGGGCGC CTCGCGGAAA TTCTGAAGAA CCCCAGGCTC CGGGTTCACG TTTACGACCA TCATCCCCGG GTGGAGGGCG ACCTCAAGGG TGAATTGGAG ATCGTTGAGC CCGTGGGCGC CACGACCACG CTTCTGGTCG AGGAGATCAA ACGCCGGGGG GTCCCGATCA CACCCTACGA GGCCACGGTG CTGGCGCTGG GCATTTACGC CGATACAGGG TGTCTGATTT TCTCCAGCAC CACCGACCGG GACGCGGAAG CGATTGCTTT CCTGCTGGCG CAGGGAGCGA ATTTGGGCGT GGTAGCCAAT TTCCTGGGAC GTCCCCTGAC CAACGAGCAA AAAAAGCTCC TGCGGGACCT CTTGATTTCG GCCGAGAGGC ACTATGTCAA CGGAGCCCGG ATTCTGGTTG CGCGTACACG AGTCGATGAG TTCGTCGGCG GTTTGGCGCT CTTGAGCCAT AAGCTGGCCG AGCTTGAGCA GTTGGATGCC GTTTTTTGCG TGGTGGCCAT GGAGGACCGG ACGCACCTTG TGGGACGCAG CACCCTTCCG GAGGTGAACG TGCGGGATAT CCTGGCCCAC TTCGGCGGCG GGGGGCACCA TGCGGCCGCC TCGGCCACCA TCAAGAACAC CGACGTGGAA GCGATCGCCG CCGAACTCCT GCAGGTGGTG AGGGAAATGG TGGTCCCCCC GCTTTTGGCC GGGGATATCA TGACCTCCCC CGTGAAAAGC GTGTCCCCTG AGATCACTGT TTCCGAAGCC AACCGGATAA TGCTCCGCTA CGGCCACCGC GGGATGCCGG TGGTGTCGGA CGGCTCTCTG GTCGGGGTCA TCTCCCGGCG GGACGTGGAG AAGGCCTTGC GGCACAATCT GGGTCACGCT CCGGTCAAGG CTTATATGAG CAAAAACGTG ATGACCGTTT CGCGAGACAC GCCGGTCACC GAAGTGCAGG CCGTAATGAT CGAAAACAAT ATCGGCCGGC TTCCGGTCGT GGATAACGGA TACCTGGTCG GGATCGTTTC CCGGACCGAC ATTCTGAAAA CGCTGCATCC CCAGTTCAAA CCGCGGTTTT CCACGCTGTA CGTTAAGACC CGCGTGCCGT CCTACTACCG GGACGCGGCG GAGCTGATAC GCCGGAACCT GAAGCCCGAA CAGGTAGACC TTTTAACGGT CGCCGGGGAG GTTGGTTCGG CGACCGGTTG CGCCGTCTAC CTGACCGGGG AAATGGTCAG GGACGTGTTT CTGGGCTCCC CGGGAGGCAA CCCGGAGTTG GTGGTGGAGG GCAACGGGCC GGCCTTTGCC GAAGCACTGA CCCGGAAGGT CGGGGGGCGG CTGCACTCGG ATGATCGGTC AGAGTATACA ATCGCCCATA ACGGGCGGAA AATAACGATC GTGGCCCTGC GGACCGGGTT TTCGGAACAC GGTTCGGAAC TGCCCAATGA CAAAACCTCT GCCCTGCGGC ACGAGCTCTA CCGGCGAGAT TTCACGGTCA ATGCCCTGGC CGTCGGTTTG AACCCGGACC GGTTCGGTGA GGTGATCGAC TATTTCGGCG GCCGTGACGA CCTGCACCAC GGGGTGGTGC GGGCCTTACA CAGCCACAGC TTCGAGGAAG ACCCGGTGCG GCTTCTGCAG GCGGTCCAAC TGGAGCAGCG GTTCGGGTTT AACATCGAAC GGGAGACGCT GAAGCTGATC CGGGAAGCGG TGCGGGATAG GCTTTTGTCC CAGGCTCCGC CCGACAGAAT CTGGGCCGAG TTCCGGCGCC TGCTGAAGGA GCCGCGGGTT CCGGCCACAC TGGCCCGCCT GGCCCAACTC AATCTCTGGC CGTCTCTGTT CCCGGACATC CTGTACTGGG AAGTCCAGCC GATGATCTCG GGAATACCCA AAGCCCTTGA CAGTCTTGGT TCCTGGAAAG TGCCGCAACC GGCGGAGCCC TGGCTCTGCT ACTTCATCGC CATCCTGCAC TGGAAAAACC TGGTCGTCGT GGACGAACTC TGCCGGTCCT ATCGTCTACG GAAGAGCCAG ACCGACAAAA TCCTGTACAC GATCGCTGGC TGGCGGTCCG CGGTATCCAG GCTTTCGGCG CCGAAACCGC CGGCGGTCCG GACCGCGGCC TTGAGTCTCG TGAACCTGCC GCGTGAGGGC TACCCGATCG TGCTCCTCAT GCTCGAGAAG AAGGCCTGGA AAGAACGGTT CCGCGAAGCA CTGGTCACCT TGTACGAACA TAAACCGGTC ATCACGGGCA AAGATATTAG GAATCTGGGC TATCGGGAAC AAACCGGCAT GAAAAGGATA CGGGAAGCGG TGTGGCGGGC TAAACTGCGC GGTGAGGCGA CCACCCGGGA GGAAGAAATG CGCTTGGCTC GGGAGGCAAT GAATTGGGAG GGGGAACGCT GA
|
Protein sequence | MKVITTHTNT DFDGLAAMVA AQKLHPDAAL VLPGKMAQNV EEFVALHVDI LEIRKPSEIR FSDIDFLIVV DTMQPGRLGR LAEILKNPRL RVHVYDHHPR VEGDLKGELE IVEPVGATTT LLVEEIKRRG VPITPYEATV LALGIYADTG CLIFSSTTDR DAEAIAFLLA QGANLGVVAN FLGRPLTNEQ KKLLRDLLIS AERHYVNGAR ILVARTRVDE FVGGLALLSH KLAELEQLDA VFCVVAMEDR THLVGRSTLP EVNVRDILAH FGGGGHHAAA SATIKNTDVE AIAAELLQVV REMVVPPLLA GDIMTSPVKS VSPEITVSEA NRIMLRYGHR GMPVVSDGSL VGVISRRDVE KALRHNLGHA PVKAYMSKNV MTVSRDTPVT EVQAVMIENN IGRLPVVDNG YLVGIVSRTD ILKTLHPQFK PRFSTLYVKT RVPSYYRDAA ELIRRNLKPE QVDLLTVAGE VGSATGCAVY LTGEMVRDVF LGSPGGNPEL VVEGNGPAFA EALTRKVGGR LHSDDRSEYT IAHNGRKITI VALRTGFSEH GSELPNDKTS ALRHELYRRD FTVNALAVGL NPDRFGEVID YFGGRDDLHH GVVRALHSHS FEEDPVRLLQ AVQLEQRFGF NIERETLKLI REAVRDRLLS QAPPDRIWAE FRRLLKEPRV PATLARLAQL NLWPSLFPDI LYWEVQPMIS GIPKALDSLG SWKVPQPAEP WLCYFIAILH WKNLVVVDEL CRSYRLRKSQ TDKILYTIAG WRSAVSRLSA PKPPAVRTAA LSLVNLPREG YPIVLLMLEK KAWKERFREA LVTLYEHKPV ITGKDIRNLG YREQTGMKRI REAVWRAKLR GEATTREEEM RLAREAMNWE GER
|
| |