Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0275 |
Symbol | |
ID | 4241642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 425178 |
End bp | 426698 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638105615 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_720230 |
Protein GI | 113474169 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0236777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATTG ACGACCAACA TTATGATGTA ATTATTGTCG GTACAGGAGC AGGCGGTGGT ACTTTAGCAT ATAAACTTGC TGCCACTGGC AAAAAAATTC TGATTCTGGA AAGAGGTGAT TTTATGCCTC TAGAAATCCA AAATCGAGTT AATGTTGATA TCTTTAAGAA AGAACTTTAC CGTGCCCATG AAAATTGGTA TAATGATGCA GGGGAAGCAT TTTCTCCTCA GACAAATTAT GCTGTTGGTG GTAATACAAA AATTTATGGA GCAGCCCTAA TTCGGATGCG GGAAAAAGAC TTTGAGGCAG TGGAACATCA AGAAGGAATT TCTCCAGAGT GGTGTCTCAA ATACAAGGAC TTTGAACCTT ATTACACTGA AGCAGAACAA CTCTATAAAG TTCATGGTCA CTCAAGCAAT AATTCTAATG AACCTCATCA TAGTCAAGAA TATCCTTATC CCGCAGTTGA CCACGACCCA CAAATTCAAA CAGTTGTCAA CGCTATCGCC GGCCAAGGTC TACATCCAGA AAACATACCA TTAACTCTGA CTCGGGAACA GGAAGACCCC ACAGGAGACT CAGAAGTATT TGGTATTGCA CCACTGCTCA ATTCTCCTAA CATTACTATT AAAACTAATG CTAAAGTAGT CTGTTTACTT ACCAACTCTT CTGGTAATAC TGTCAAAGCA GTAGAAGTTG AAATTGACGA ACGATCATTT CTATTTTTTG GGGATATTGT TGTAGTTGCT TGTGGAGCTG TAAATTCAGC AGCATTGTTA TTACGTTCTG CTAACGAAAA ACATCCTCAT GGTTTAGCAA ACAGTTCTGG ACAAGTTGGG CGCAACCTGA TGAAACATCA AATGACTGCT GTAGTGCAAC CTAGTCCAAA ACCTAATTCT GGCAACTTTC TCAGAAGCGT TTGTGTAAAT GACTTTTATT GGGGAGATGA AAATTATTCT TATCCCATGG GTCACATTCA GAATACAGGT GGGTTACTTC AAGATATTAC TTTTGCTGAA TCTCCACCTA TCTTGTCTAT TTTAGCAAAA GCAATGCCTG AGGTGGGTTT GAAGCGGTTA GCATTACGTT CTGTTGGTTG GTGGACATAT TCGGAGGTAT TACCAGATCC TAATAACCGC ATTGAAGTTA AGGGAGACAA ACTTTTCTTC CATTACACTC CCAATAACTT GGAAGCACAC GATCGCCTAG TTCATCGTTG GATAGAGGTA TTAAAGTCAG TAGACAAAGC GGTTGGAAAT TCTTTCTTTG CAAGATCAGG AGGTAATGTT TACCCTCGTG GAGAAGCTCC TATCGAAGTA GTTGCCAACC AGTGCGGTAC TTGTAAGTTT GGGGAAGATC CAGCTACATC AGTTTTAGAT ATTAACTGTC GTACCCACGA TGTAGATAAT TTATATGTTG TAGATAGCAG TTTTTTCCCA TCAAGTTCTG CTATAACTCC TGCTTTAACA ATTATTGCTA ATGCTTTACG AGTAGGGGAA CATTTAAAGG AGCGTCTATA G
|
Protein sequence | MIIDDQHYDV IIVGTGAGGG TLAYKLAATG KKILILERGD FMPLEIQNRV NVDIFKKELY RAHENWYNDA GEAFSPQTNY AVGGNTKIYG AALIRMREKD FEAVEHQEGI SPEWCLKYKD FEPYYTEAEQ LYKVHGHSSN NSNEPHHSQE YPYPAVDHDP QIQTVVNAIA GQGLHPENIP LTLTREQEDP TGDSEVFGIA PLLNSPNITI KTNAKVVCLL TNSSGNTVKA VEVEIDERSF LFFGDIVVVA CGAVNSAALL LRSANEKHPH GLANSSGQVG RNLMKHQMTA VVQPSPKPNS GNFLRSVCVN DFYWGDENYS YPMGHIQNTG GLLQDITFAE SPPILSILAK AMPEVGLKRL ALRSVGWWTY SEVLPDPNNR IEVKGDKLFF HYTPNNLEAH DRLVHRWIEV LKSVDKAVGN SFFARSGGNV YPRGEAPIEV VANQCGTCKF GEDPATSVLD INCRTHDVDN LYVVDSSFFP SSSAITPALT IIANALRVGE HLKERL
|
| |