Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0589 |
Symbol | |
ID | 4078627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 629035 |
End bp | 629910 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005886 |
Product | HAD family hydrolase |
Protein accession | YP_612584 |
Protein GI | 99080430 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01459] HAD-superfamily class IIA hydrolase, TIGR01459 [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00195304 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCAGA TCATCTCCGC CCTTTCCGAG GTTTCAGACC GATACAAGGC TCTGTTCGTG GATCTCTGGG GCTGCGTTCA CAACGGGATC ACGGCCTACC CGGACGCTGT CGCGGCCCTT CAGGCCTACC GCAAATCCGG CGGAGTGGTG GTACTGGTCA CCAACTCGCC CAAACCCCGC GCAGGGGTGG CAGAGCAACT GAGCCAGTTC GGCGTCCCAG ATGACGCCTA TGACACCATC GCTACCTCTG GCGATTCCGC GCGCGCGGCG ATGTTCACAG GTGCTGTCGG TGAAAAGGTC TACTTCATGG GCGAATGGGA GCGCGATGCC GGCTTTTTTG AGCCAATGAA GGTTATCCAC GAGCCCATTG AGATCACCCG CGTCCCGCTT AAGGAAGCCG AAGGGATCGT CTGCTGCGGT CCCTTCGACA CCTTGGCCGA CCCGGAGGTG AACCGCGCCG ATTTTCTCTA TGCCAAACAG ATGGGTATGA AGCTTCTTTG CGCGAATCCG GATATTATCG TCGACCGTGG CGAGGTACGC GAATGGTGCG CTGGGGCCTT GGCCAAACTT TACACCGAAA TGGGGGGCGA GAGCCTCTAT TTCGGCAAGC CGCATCCGCC GATCTACGAT CTCGCGCGTC GTCGCCTGAC GGAAATTGGC CACGATGTTT CGGACCGCGA CATTCTCGCA ATTGGCGACG GGCCGCACAC CGACATCTCA GGCGGCATGG GTGAAGGGGT GGACACGCTG TTCATCACCG GCGGCTTGGC CGCAAAAGAC ACCCAAACGG CTCATCAGCC CGAGCCTGCT GCACTGGAGG CGTATCTCGC GCAGGAACAG ATCGCGCCCA CCTACAGCAT CGGCTTTCTG CGCTAA
|
Protein sequence | MSQIISALSE VSDRYKALFV DLWGCVHNGI TAYPDAVAAL QAYRKSGGVV VLVTNSPKPR AGVAEQLSQF GVPDDAYDTI ATSGDSARAA MFTGAVGEKV YFMGEWERDA GFFEPMKVIH EPIEITRVPL KEAEGIVCCG PFDTLADPEV NRADFLYAKQ MGMKLLCANP DIIVDRGEVR EWCAGALAKL YTEMGGESLY FGKPHPPIYD LARRRLTEIG HDVSDRDILA IGDGPHTDIS GGMGEGVDTL FITGGLAAKD TQTAHQPEPA ALEAYLAQEQ IAPTYSIGFL R
|
| |