Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4662 |
Symbol | |
ID | 4246316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7169309 |
End bp | 7170043 |
Gene Length | 735 bp |
Protein Length | 244 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638109527 |
Product | HAD family hydrolase |
Protein accession | YP_724103 |
Protein GI | 113478042 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00548431 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00277532 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGTAGCTA TTAACTGTGG AGGTCTAATT TTCCATGATA TTCAAGCAAT TATTTTTGAT AAAGATGGTA CCCTAGAAGA CTCTCAAGAG TTTTTAAGAA ATTTAGGACA AAAGCGAGCT CGCCTTATTG ATGCTAAAAT TCCTGGGATA GGAGAACCAC TACTCATGGC TTTTGGGATT AATGACAATA ATATTAACTC TACAGGATTA ATGGCGGTTG GCAGTAGACG AGAAAACGAA ATTGCAGCAG CAGCTTATAT TGCTGAGACT GGTCGGGGTT GGTTAGAATC ATTAACTATT GCCAAAGAAG CTTTTACAGA AGCTGACCAA ATACTGCCAA AGTCTACTCC TGGTTCTCTG TTTAAGGGTA GCCAAGAAGT ATTAAAGTAT CTCTCTCAAG CAGGTATACA ACTGGGTATA TTGTCTGCAG ATAATACAAC AGCAGTGCAA AATTTTGTTA AATACTATCA ACTTAACGAT TATATCCAAC TAGAAATGGG GGTTGATGAT GGTCTTAGTA AACCAGATCC AGAACTATTT TTACAAGCTT GTCATAAATT AGGGTGTAAA CCAGTTAGGA CATTAATTGT GGGAGATTCT CCCACAGATA TAGAAATGGG GAGAAAAGCA GGAGCAGCAG GTTGTATTGG TATTTTTTTT GGGAAAAGCG AAGCTGAGCA TCTACAACTA GCAGATGTGG CGATCGCTAA GTTAGAAGAA ATTAAAGTGT TGTAA
|
Protein sequence | MVAINCGGLI FHDIQAIIFD KDGTLEDSQE FLRNLGQKRA RLIDAKIPGI GEPLLMAFGI NDNNINSTGL MAVGSRRENE IAAAAYIAET GRGWLESLTI AKEAFTEADQ ILPKSTPGSL FKGSQEVLKY LSQAGIQLGI LSADNTTAVQ NFVKYYQLND YIQLEMGVDD GLSKPDPELF LQACHKLGCK PVRTLIVGDS PTDIEMGRKA GAAGCIGIFF GKSEAEHLQL ADVAIAKLEE IKVL
|
| |