Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0604 |
Symbol | |
ID | 5538067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 804144 |
End bp | 805694 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640892765 |
Product | Dak phosphatase |
Protein accession | YP_001430751 |
Protein GI | 156740622 |
COG category | [R] General function prediction only |
COG ID | [COG1461] Predicted kinase related to dihydroxyacetone kinase |
TIGRFAM ID | [TIGR03599] DAK2 domain fusion protein YloV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00036597 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.583532 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGCG CGTGGAATGG CGAGCATCTG CTGGAAGCGT TACGCGCGGC GTCGCGCGAT CTGGAGCGTC ACGCCGCCTC GCTCAATGCA TTGAATGTGT TCCCTGTTCC CGACGGCGAT ACCGGGACGA ACATGGCGCT GACATTGAGC AGCGCATTGC GCGACATTAC GCCGCATCCT TCGTGCGGCA CAGTGGCGGA GCAGGTTGGC TACTGGGCGA CGATGCGCGG GCGCGGCAAC TCAGGCATCA TTCTGTCGCA AATCCTGCGT GGTGTTGCCG CGGCGCTTGC CGGACATCAC CTGATGAGCG GGCGCGAAAT GGCGGTGGCC CTGACGCATG GCAGCACGCG CGCCTACGAA GCCGTGTTGC GTCCGGTCGA AGGAACGATG TTGACGGTCA TCCGCTGCGC CGGTGAGGCG GCGCAGCGCG CCATCGCCGC AGGTGAAGCA TCGTTGAGCG CCGTGCTCGA GGCAGCCGTG CGCGAAGCGC GCGCCGCCGT GGCGCGCACG CCGCAGTTGC TGGCGACCCT GCGCGACGCA GGCGTGGTCG ATGCGGGCGG GCAGGGTTTG CTGGTCCTGC TCGAAGCGCT GCTGCGCTAT GCCCGCGGCG AAGCCAGTGA TTCGCATGCC CCAACCGTGA CACCCACCGC AACCGTTGAT GATCATGCCG AGAGCGCAGG GTACTGCACC AGTTTTGTCA TCCATCACGC AACCGCACCA CCGGAAACGC TCCGACGAGT CTTTGCGGCG CTCGGCGAAT CGCTGGTGAT CGCCGGAGAT CGCGCGCTAG TCAAAATACA TCTTCACACT CCACGACCGG GCGACGCGCT CAATCAGGCG TTAGCGTATG GCATTCTCGA TCAGATCGAA GTCGTGAACA TGGATCTGCA ACGCATGGCG CACCATTCGG GTGCGGCGCT TTCCGACACT CAACCGGATA CACCGGCGAA CCCTGCGCCG GGAATCATTG CACTGGCGCC AGGCGCCGGA TACGCAGCCA TCCTGCGCGA CCTGCGCGCC GATCTGGTGT GGGAGACGAA TACGCCGCCG ACCATCGACG AGTGGCGCGC AGCCTTTGAG CGCCTGCCGC AGCAGGAGAT CATTGTGCTG CCCAATGATC CGCAGGCGGC GGAAACTGCG CAGGCAACCG CACCGTTGTT CGCCAGGCGC ATTGCTATCG TGCCGGCAAC CTCGCCGCCA CAGGGCATTG CCGCGCTGCT GGCGCTGAAC TTCCAGGCAG ACGTCGATCA GAACATTCGG GCAATGACAG CAGCAGCAGA ACGGGTGCGG GTTATCACCT TCGATGGACA GCGTCGCAAC GAGATGGAGA CGCCTGCAGA AGCGGTGCAA GATGCGTATA ATGTGTGCCA TACACTTCAG CAGATGGGCG CGAACGCTGC CGAGGTCGTC ACGCTCTACT ATGGACAGGC TGTTGACCAG ACGCATGCGG AGCGACTGGC GCAGGAGATT CGGGTTGCTT TCCCGATGCT GCACGTCGAA GTTCATGCTG GCGGTCAACC GGGCAGTGGC GTCGCCATTG CCCTCGAATA A
|
Protein sequence | MTGAWNGEHL LEALRAASRD LERHAASLNA LNVFPVPDGD TGTNMALTLS SALRDITPHP SCGTVAEQVG YWATMRGRGN SGIILSQILR GVAAALAGHH LMSGREMAVA LTHGSTRAYE AVLRPVEGTM LTVIRCAGEA AQRAIAAGEA SLSAVLEAAV REARAAVART PQLLATLRDA GVVDAGGQGL LVLLEALLRY ARGEASDSHA PTVTPTATVD DHAESAGYCT SFVIHHATAP PETLRRVFAA LGESLVIAGD RALVKIHLHT PRPGDALNQA LAYGILDQIE VVNMDLQRMA HHSGAALSDT QPDTPANPAP GIIALAPGAG YAAILRDLRA DLVWETNTPP TIDEWRAAFE RLPQQEIIVL PNDPQAAETA QATAPLFARR IAIVPATSPP QGIAALLALN FQADVDQNIR AMTAAAERVR VITFDGQRRN EMETPAEAVQ DAYNVCHTLQ QMGANAAEVV TLYYGQAVDQ THAERLAQEI RVAFPMLHVE VHAGGQPGSG VAIALE
|
| |