Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1635 |
Symbol | |
ID | 4242324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2495998 |
End bp | 2496975 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638106774 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_721384 |
Protein GI | 113475323 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.417453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACGG TTTTAGCTAT AGAAACAAGT TGTGATGAAA CATCTGTGGC AATTGTTAAA AATCGTCAAG TTTTAAGTAA CATTGTGAAG TCACAAATTA ATATCCATAG CTTTTATGGA GGAGTAGTTC CAGAGGTAGC TTCACGACAA CATTTAGAAA TAATTAATCA GGCGATCGCT CAAGCTTTCA GAGAAGCAAA TTTAGACTGG CCAGACATTG ATGGTATTGG AGCTACTTGC GCCCCTGGTC TAGTTGGCGC TCTGTTAGTT GGCCTGACGG CGGCCAAAAC CTTAGCTATT GTCCATGAAA AGCCCTTTGT GGGAGTCCAT CACTTAGAAG GTCATATTTA TGCAACCTAC CTAAGCCAAC CCGAGTTAGT ACCACCTTTT CTTTGTTTGT TAGTTTCTGG AGGTCATACC AGTTTGATTT ATGTCAAAAA TTGTGGGGAA TATGAAACTT TAGGGCAAAC AAGAGATGAT GCAGCAGGAG AAGCTTTTGA CAAAGTAGCT AGATTATTAA AACTTGGCTA TCCCGGTGGA CCTGTAATTG ATAAATTAGC TGCAGAGGGT AATAAATTAG CATTTTCTCT CCCTGAAGGG AAAATTTCTT TGCCTGGAGG AGGCTATCAT CCTTATGATT TTAGTTTCAG TGGACTCAAA ACTGCAGTAT TGCGATTAGT CCAGAAAATA GAACAAGAAG GTAATGAATT ATTATCAGGG TCTTCAGTAA CTAAAGACAT TGCAGCCAGC TTTCAAGAAA CTGTAGCAAA AGGGCTAACA AAAAGAGCTA TCGCTTGTGC ATTAGACTAT AGCTTAAACA CAATTGCTAT TGGCGGTGGT GTAGCAGCTA ATAGTAGTTT GCGACAGCAC TTAAGTAGAG CAGTAGAAAG TCATAACTTG CAAGTACTAT TTCCCTCACT CAGATTTTGT ACGGATAATG CAGCCATGAT AGGTTGTGTG CGGCTCGTTG GCCACTAA
|
Protein sequence | MTTVLAIETS CDETSVAIVK NRQVLSNIVK SQINIHSFYG GVVPEVASRQ HLEIINQAIA QAFREANLDW PDIDGIGATC APGLVGALLV GLTAAKTLAI VHEKPFVGVH HLEGHIYATY LSQPELVPPF LCLLVSGGHT SLIYVKNCGE YETLGQTRDD AAGEAFDKVA RLLKLGYPGG PVIDKLAAEG NKLAFSLPEG KISLPGGGYH PYDFSFSGLK TAVLRLVQKI EQEGNELLSG SSVTKDIAAS FQETVAKGLT KRAIACALDY SLNTIAIGGG VAANSSLRQH LSRAVESHNL QVLFPSLRFC TDNAAMIGCV RLVGH
|
| |