Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_07041 |
Symbol | |
ID | 4717407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 627727 |
End bp | 629367 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640078417 |
Product | glucose-methanol-choline (GMC) oxidoreductase:NAD binding site |
Protein accession | YP_001009097 |
Protein GI | 123968239 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGATATAA GTCCTTATGA TGCAATTGTT GTTGGTTCTG GAGCTACAGG AGGAATAGCA GCACTTACAT TGGCAGAACA AGGAATCAAA GTTTTAGTAA TAGAAGCAGG GCCTCAAGTT AAAAGGCATG AAGCTAGTAA TGATGAGCCA AAAAGTACAT TCAAAAGATT ATCAGGAGTT TTAACAAAAA AACATGCCAA TCAATGCCAA CATCCTGGTT ATTGGAAAAA TAATCCTGAC TTATATTCAA ATGAATTGAA GCATCCTTAT GACTTCCCAA CAAAAAAGCC ATTTCTTTGG ACCCAAGGTA AACAATATGG GGGGAGATCA TTAACTTGGG GAGGCATAAC ATTAAGACTT TCCTCAGAAG ACTTTCATCC TGCTAAAAAA GACGGATTCG GACCAAACTG GCCTATTTCA TACGATGAAC TATCCCCTCA CTATGATTTC ATTGAAAATT TTTGCGGCAT CTATGGACGA AAAGATGACA TTAAAGAAGT CCCAAACGGT AAATATATTG GAGAAATACC TCTTACAGAA AACGAAAATG TTTTTGGTAA CAAAGTAAAA TCAAAATTAA ACTATCCATT TATCCAATCA AGAGGATTTG ACCGTAATTC ATCAGTAAAA GAAAAAAAAT GGCCAAAGTC CTCTAGCTTA GGAAGCTCTT TAAAAAAAGC TTTAGATACT GGAAATGTAC AAATAATCTC TAATTACCTA GTGGAGTCTT TTGAGATTAA CAAGGCAACA GAGCTTGCCT CAAAACTAAC GATCGTAAAC CTAGAAAATG GACAAAAAGA AGTCTTGAAT TGTGATTTAA TTTTTCTCTG CGCGTCAACA ATTTCAACAC TCAGAATACT ACTAAACTCA GAATATAAAA CAAATTCCTC AGGGTTTAAA GATAATTCTG GGAAATTAGG CAAATACCTC ATGGATCACA TATCTATCTG TAGATTTTTT TCAGTCCCAA AAACAAAAAA CTCAGATAAA CCAGTAGATA ATCCACCCGA TCTTTCTGGA GCAGGCAGCT TCTTTATTCC ATTTGGTTCA AATTTACCAG AAATTGACGA CATAAATTTC CATAGAGGTT ATGGAATCTG GGGGGCAATT GATCGATTAG GGATTCCTAA ATTTTTGCAA AAAGACACAA ACACATCCAT TGGCTTTCTT ATCGCCCATG GCGAAGTCCT TCCTAGAGAG AAAAACTCAG TTTCTCTCTC ACGAAAAACA GATGAATGGG GTATCCCAAT TCCCTTCATT GAATTCGAAT GGAGCAAAAA TGAATTAAAT ATGGCTAAAC ATATGGAAAA CACAATACGT AAATCAATCA CAGCTGCTAA TGGAGAAATA AAAAATATTA ATGAACTAAT TAATATCCCA TTAGGGAGTC TATTTACAAA AAATTTGATC GCACTTTCAG ATAGTCCTCC TCCTCCTGGA TATTACATTC ATGAAGTAGG GGGGGCACCA ATGGGGATAG ATGAAGAAAA TAGCGTAGTT GATAAATTTA ATAGATTATG GAGATGCAAG AATGTACTTG TATTAGATGG AGCATGCTGG CCCACATCAT CTTGGCAAAG CCCTACACTT ACGATGATGG CCTTGAGTAG AAGAGCCTGC TTAAATATTA AAAAGACTTA G
|
Protein sequence | MDISPYDAIV VGSGATGGIA ALTLAEQGIK VLVIEAGPQV KRHEASNDEP KSTFKRLSGV LTKKHANQCQ HPGYWKNNPD LYSNELKHPY DFPTKKPFLW TQGKQYGGRS LTWGGITLRL SSEDFHPAKK DGFGPNWPIS YDELSPHYDF IENFCGIYGR KDDIKEVPNG KYIGEIPLTE NENVFGNKVK SKLNYPFIQS RGFDRNSSVK EKKWPKSSSL GSSLKKALDT GNVQIISNYL VESFEINKAT ELASKLTIVN LENGQKEVLN CDLIFLCAST ISTLRILLNS EYKTNSSGFK DNSGKLGKYL MDHISICRFF SVPKTKNSDK PVDNPPDLSG AGSFFIPFGS NLPEIDDINF HRGYGIWGAI DRLGIPKFLQ KDTNTSIGFL IAHGEVLPRE KNSVSLSRKT DEWGIPIPFI EFEWSKNELN MAKHMENTIR KSITAANGEI KNINELINIP LGSLFTKNLI ALSDSPPPPG YYIHEVGGAP MGIDEENSVV DKFNRLWRCK NVLVLDGACW PTSSWQSPTL TMMALSRRAC LNIKKT
|
| |