Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1926 |
Symbol | |
ID | 4242675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2984383 |
End bp | 2985750 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638107047 |
Product | aldehyde dehydrogenase |
Protein accession | YP_721654 |
Protein GI | 113475593 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.515635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.086374 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGATCG CTACAGTTAA TCCAGCAACA GGAGAAGTCC TGAAAACTTT TGAACAAATT ACAGATACAC AAATAGAGGC TAAACTAGAG TTAGCAGAAA AAACTTTTCG TGCCTATTGT CAAACTTCTA TAACTCAACG TGGAGAATGG TTGTTAGCAG CAGCAGACAT TTTAGAAAAA AATGCTGAGA AATTTGGTAA GATAATGACT CTAGAGATGG GTAAAACTAT AAGTGGAGCG ATCGCCGAAG CTAAAAAATG TGCTCTAGTC TGTCGTTACT ATGCAGAAAA GGCTACTGAG TTTCTGGCTG ATGTTCCTGC ACAAACTGAT GCTAGTAAAT CATTTGTTCG TTATCAACCA ATTGGTCCAG TGCTAGCGGT TATGCCCTGG AATTTTCCTT TTTGGCAGGT TTTCCGTTTT GCAGCACCAG CTTTAATGGC AGGAAATGTG GGTTTATTGA AACACGCTTC TAATGTTCCT CAATGTGCTT TGGCTATTGA GGAAATTTTT CAAGAAGCAG GTTTCCCAGA AGGTGTATTT CAAACTCTTT TGATCAGCTC AGATAAAGTG TCTGGTATTA TGATGGATGA CCGGGTCAAA GCAGGAACTT TAACTGGCAG TGAACCTGCG GGTGCAAGTT TAGCGGCAAC AGCAGGTAGA GCTATTAAAA AAACGGTCTT GGAACTTGGG GGTAGTGACC CTTTTATAGT ATTAGAAAGT GCTGATTTAG AAACAGCAGT TACAACAGCA GTTACAGCTA GGATGCTAAA TAATGGTCAA TCTTGTATTG CAGCTAAACG TTTTATTTTG GCAGATGCGA TCGCTGATCA ATTTCAAGAG GGTTTGGTAG AAAAATTTGA GGCTTTAAAA GTAGGAGACC CTATGTTGCC AGATACTAAT ATTGGTCCTT TGGCAACTCC ATCTATTCTT GAAGAGTTAA ATGCTCAAGT GGAAGCTTCT GTGGAGAAAG GAGCGAAAAT TCTCACAGGT GGTCATCTTT TATCTGACCT TCCTGGAAAT TTTTACCCTC CGACAATTTT AGCTGAGATA CCAATAAGTT CTCCTGCTTA TCAGGAAGAA TTTTTTGGTC CGGTAGCTTT AGTCTTTCGC GTTGCTAATA TTGATGAAGC AATAAATTTG GCAAATAATA CACCTTTTGG TTTAGGTGCA AGTGCATGGA CTAAAGATAC GGGAGAGACG GAAAGATTAA TCTCAGAATT AGAGGCTGGT GCTGTTTTTA TCAACGGTTT AGTTAAGTCT GATCCACGTC TGCCTTTTGG TGGAATTAAA CGTTCTGGTT ATGGTCGGGA ACTGAGCAGG GAAGGAATTT TGGAATTTGT CAATATTAAG ACTGTTTGGG TTAAATAA
|
Protein sequence | MQIATVNPAT GEVLKTFEQI TDTQIEAKLE LAEKTFRAYC QTSITQRGEW LLAAADILEK NAEKFGKIMT LEMGKTISGA IAEAKKCALV CRYYAEKATE FLADVPAQTD ASKSFVRYQP IGPVLAVMPW NFPFWQVFRF AAPALMAGNV GLLKHASNVP QCALAIEEIF QEAGFPEGVF QTLLISSDKV SGIMMDDRVK AGTLTGSEPA GASLAATAGR AIKKTVLELG GSDPFIVLES ADLETAVTTA VTARMLNNGQ SCIAAKRFIL ADAIADQFQE GLVEKFEALK VGDPMLPDTN IGPLATPSIL EELNAQVEAS VEKGAKILTG GHLLSDLPGN FYPPTILAEI PISSPAYQEE FFGPVALVFR VANIDEAINL ANNTPFGLGA SAWTKDTGET ERLISELEAG AVFINGLVKS DPRLPFGGIK RSGYGRELSR EGILEFVNIK TVWVK
|
| |