Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4535 |
Symbol | |
ID | 4246189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6997007 |
End bp | 6998191 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638109412 |
Product | aldo/keto reductase |
Protein accession | YP_723988 |
Protein GI | 113477927 |
COG category | [R] General function prediction only |
COG ID | [COG1453] Predicted oxidoreductases of the aldo/keto reductase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0927734 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00892549 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATGCAAT ATCGACGCTT TGGGCGCACA GAATTATCAA TACCAGTGTT TTCCTGTGGC GGAATGAGGT ATAAATATAA ATGGCAAGAT GTTCCTAAAA ATGAAATCCC ACTTAATAAT CAACAAAACT TAGAAAATAC AATCCGTCGC TCTCTAGAAT GTGGGATTAA CCACATAGAA ACGGCCCGTA ATTATGGCAC ATCTGAAATG CAACTGGGAG AAATTTTACC TCAACTACCA CGGGAAAAAT TGATTATCCA AACAAAAATT AGCCCAAGCG TTGACTCCCA AGAATTCAAA TCCAAGTTTG ATCAGTCTCT CCATTTTTTA CAACTAGAAT ATGTTGACTT GCTGGCCATA CATGGGATTA ACACGTTAGA ACGTCTTGAC TATTCTATTA GACCAGGAGG TTGTTTAGAT ATAGTTAGAA AATTACAAGA GCAGGGAAAA GTCAGGTTTG TTGGTTTCTC TACTCATGGG CCAACTGATG TAATAATTAA AACTATAGAA ACCAATCAAT TTGACTATGT TAACCTACAC TGGTACTACA TTAATCAGGA GAATTGGTCC GCAATAGAGG TTGCTAATAA GTTTGATCTG GGAGTATTTA TTATTAGTCC TTCTGATAAG GGTGGTAAAT TGTATCAACC GCCACAAAAA TTAATAGATT TGTGCTATCC ATTAAGCCCA ATGGTGTTTA ATAATCTATT TTGTTTGAGT CATCCCCAAG TTCATACATT GAGTTTGGGA GCTTCAAAAC CAACAGATTT TGATGAGCAC TTAAAAACAT TGGAATTTTT AGAGAAACCA GATGAGATAT TACAACCAAT ATTAAATAGT CTAGAAAAAG AAGCGATCGC TAAACTAGGA GAAAATTGGT ACCAAACTTG GCATATTGGT TTGCCTACTC CAGAAAATAC TCCAGGAAAT ATTAATATTC CTGTGATTTT ATGGTTAAGA AATTTAGCGA TCGCCTACGA TATGTGGGAA TATGCTAAAG TACGCTATAA CTTATTGGGC AATGGTAGTC ATTGGTTTCC TGGTGCAAAT GCTGAACAAG TAGAAAAATA TAACTTGAGT AAATTTCTTG TGAATAGTCC TCATGCTGAT AAAATTCCAG ATATTCTTCA AGATGCTCAT CAATTACTGG TAGGGACTCC AGTAGAACTT TTGTCTAGCA CTTAA
|
Protein sequence | MMQYRRFGRT ELSIPVFSCG GMRYKYKWQD VPKNEIPLNN QQNLENTIRR SLECGINHIE TARNYGTSEM QLGEILPQLP REKLIIQTKI SPSVDSQEFK SKFDQSLHFL QLEYVDLLAI HGINTLERLD YSIRPGGCLD IVRKLQEQGK VRFVGFSTHG PTDVIIKTIE TNQFDYVNLH WYYINQENWS AIEVANKFDL GVFIISPSDK GGKLYQPPQK LIDLCYPLSP MVFNNLFCLS HPQVHTLSLG ASKPTDFDEH LKTLEFLEKP DEILQPILNS LEKEAIAKLG ENWYQTWHIG LPTPENTPGN INIPVILWLR NLAIAYDMWE YAKVRYNLLG NGSHWFPGAN AEQVEKYNLS KFLVNSPHAD KIPDILQDAH QLLVGTPVEL LSST
|
| |