Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4570 |
Symbol | |
ID | 4246224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7030421 |
End bp | 7031815 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638109443 |
Product | cytosine deaminase-like protein |
Protein accession | YP_724019 |
Protein GI | 113477958 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.882556 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCGTC AAAAGTTAAT ATTAAAAGGT CAAAAGTCAA AATTCTCTAT ATATTTTATG AACCATCAAC AATTTTCAGT CATGCCTGAG ACTTCTAACT ATTGGCTAAA AAATGCTCAT GTACCTGCAT GTCTCATAGA AAGAGAATTA GAGATATCAG ATCAAACAAG AGTTGGTTTG TCTTTAGTAG ATATAGAAAT TAAAGAGGGA GTCATAACAC AAATTGTCCA CTCGTTAGCC GACTTGCCTT CTCTGAACAA TGGGGACATT CCTAGAGTTG ACCTCAAAGG TGGTATGGTG TGGCCATGTT TTGTAGATAT GCATACCCAT TTAGATAAAG GCCATATTTG GGAGCGATCG CCTAACCCAG ATGGTAGTTT TAATGGCGCT TTAGATGCAG GTGCTAGGGA TGGGGAAAAA TATTGGAATG CTGAAGACAT TTACCGTCGA ATGGAATTTG GCCTGAAATG CAGTTATGCT CATGGCACAA AAGCTATTCG GACTCATCTT GACTGTTTTG CACAACAGGC AAATATTAGT TTGGAAGTAT TTCAAACATT ACAAAAAAAG TGGGAAAGTA GATTAATATT ACAACCTGTT TCTTTGGTTA CTGGAGATTA TTATCTTACT GAAGCTGGAG AAAAATTAGC CGATCAAATT GCTGATATTG GTGGGATATT AGGAGGGGTT CCATTTATAA ATTCAGACTT AGATAACCAA TTAGATAGAT TATTTAGTAT AGCCAAAGAA CGTCAATTAG ATTTAGACTT ACATACTGAT GAAAATGGAG ACCCTAATTC CCGGGTTTTA CACAAAGTTG CTGAGAAAGC AATTAGTCAT CAATTTTATG GCAAAATTAT CTGCGGACAT TGTTGTAGTT TAGCTGTACA AACCCCTGAA AAAGTAACAG AAACTATTAA GTTAGTTAAG GAAGCTGGTA TTAGTATTGT TAGTCTGCCA ATGTGTAATC TTTATCTCCA AAATCGCCAA GAAAATTATA CTCCTAGATG GCGGGGTGTG ACTTTATTAC ATGAATTAAA AAAACAGGGA ATTTCAGTTG CTATTGCCAG TGATAATTGT CGCGACCCGT TTTTTGGATT TGGTGACCAT GATGTTTTAG AAGTCTTTAA TATGTCAGTG AGAATTGCTC ATTTAGATAT GCCTTATGGG GATTGGCCTT GTGCTGTTAC TAAGACTCCT GGAGATTTAA TTGGTTTGCC AAATGTGGGA ATAATTAAAG TTGGATTACC CGCAGATTTA ATTCTATTTA AAGGTCGGAA TTTTAGTGAA CTTTTATCAC GGTCTCAGCA TGATAGAGTA GTATTAAGAA ATGGTCTAGT TATTGATACA ACTTTACCTG ATTATTCTGA ATTAGATAGC TTGTTTTACG GATAG
|
Protein sequence | MGRQKLILKG QKSKFSIYFM NHQQFSVMPE TSNYWLKNAH VPACLIEREL EISDQTRVGL SLVDIEIKEG VITQIVHSLA DLPSLNNGDI PRVDLKGGMV WPCFVDMHTH LDKGHIWERS PNPDGSFNGA LDAGARDGEK YWNAEDIYRR MEFGLKCSYA HGTKAIRTHL DCFAQQANIS LEVFQTLQKK WESRLILQPV SLVTGDYYLT EAGEKLADQI ADIGGILGGV PFINSDLDNQ LDRLFSIAKE RQLDLDLHTD ENGDPNSRVL HKVAEKAISH QFYGKIICGH CCSLAVQTPE KVTETIKLVK EAGISIVSLP MCNLYLQNRQ ENYTPRWRGV TLLHELKKQG ISVAIASDNC RDPFFGFGDH DVLEVFNMSV RIAHLDMPYG DWPCAVTKTP GDLIGLPNVG IIKVGLPADL ILFKGRNFSE LLSRSQHDRV VLRNGLVIDT TLPDYSELDS LFYG
|
| |