Gene P9515_07141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_07141 
Symbol 
ID4719530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp648917 
End bp650557 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content35% 
IMG OID640080392 
Productglucose-methanol-choline (GMC) oxidoreductase:NAD binding site 
Protein accessionYP_001011030 
Protein GI123965949 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.915082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATATAA ATCCTTATGA TGCAATCGTA GTAGGTTCAG GTGCCACAGG AGGAGTAGCT 
GCACTTACTT TAGCCGAACA AGGAATAAGA GTGCTAGTAA TAGAAGCTGG TCCTCAAATT
AAAAGAACTG AGGCTAGCAG TAATGAGCCA AAAGATACCT TAAACAGATT ATCAGGAATA
ATATCAAAAA AACACGCTAA TCAATGTCAG CATCCCGGTT ATTGGAAAAA TAATCCTAAT
TTATATAAAA ACGAATTAAA ACATCCTTAT GTTCAACAAA AAAACAAGCC ATTCCTTTGG
ACTCAAGGAA ACCAATATGG AGGAAGATCA TTAACGTGGG GAGGTATTAC CTTAAGATTT
TCTAGAGAGG ATTTTCATCC ATCAAAAAAA GATGGATATG GGCCAGATTG GCCTATTTCC
TACGACGAAT TATCACCTCA TTATGATTTT ATAGAAAATT TTTGTGGAAT ATATGGACAT
AAAGATAACA TCAAAGAAGT CCCAAATGGT AAATATATTG GGAAGATACC TCTTACAAAA
ATTGAAAGTA TTTTTGGCAA TCAAGTCAAA TCGAAATTAA ACTATCCCTT TATTCAATCT
AGAGGATTCG ATCGTAATTC TTCAGTAAAA GAGGAACAAT GGCCAAAATC TTCGAGTGTA
GGGACCACAT TCAAAAAAGC ACTAGAGACT GGAAATGTCC AAATACTCTC TAATCACTTA
GTAGAATCAT TCGAAACGGA TAAAATAACA GAGCTTGCTT CAAAAATAAT CATCGTAAAT
GTTGAAAATG GTAAAAGAAA ATCATTAAAT TGCGATTTAA TTATTCTATG TGCATCAACA
ATCTCAACAT TGAGGATACT ACTTAACTCT GAAACCAAAT CAAATTCTTC TGGTTTTAAA
GATACATCTG GAAAATTAGG AAAATTTCTA ATGGATCATA TTTCTGTCTG TAGGTTTTTT
TCAGTTCCAA ATACAACTCA AAGAAACAAT ATATCTAATT CGTATCCTGA TCTTTCCGGA
GCTGGGAGTT TTTTCATACC CTTTGGGACA AATCTGCCCA AACCTGAAAG TATTAATTTT
TTACGGGGTT ATGGAATTTG GGGAGCAATT GACCGTCTAG GAATACCAAA GTTTTTGCAA
AAAGACTTAA ATTCTTCTAC AGGTTTTCTA ATCGCTCATG GTGAGGTCCT ACCAAGAGAA
GAAAATTCAG TCTCCCTTTC TGACAAAACA GACCGATGGG GGATTCCGGT CCCTCATATT
GAATTCGAAT GGAGTGAAAA TGAATTAAAT ATGGCTAAAC ATATGGAGAG AACGATGCGA
GATTCAATAA AAGCTGCAGA TGGAAAAATC AAAGGGATCG ATGAACTTAT AAAAATCCCA
TATGCAGGGT TGTTTACAAA AAAATCAATT GCTCTTTCAG GAAATCCGCC ACCCCCGGGA
TACTATATCC ATGAAGTAGG AGGAGCACCA ATGGGATTTA GAGAGGAAGA TAGTGTTGTA
AATAAATCAA ATAGACTTTG GAGATGTAAG AATGTTCTTG TATTAGATGG TGCATGTTGG
CCCACTTCTT CATGGCAGAG TCCAACTTTA ACAATGATGG CTATAAGCAG AAGAGCTTGT
TTAAAAGTTA AAAAGACTTA G
 
Protein sequence
MDINPYDAIV VGSGATGGVA ALTLAEQGIR VLVIEAGPQI KRTEASSNEP KDTLNRLSGI 
ISKKHANQCQ HPGYWKNNPN LYKNELKHPY VQQKNKPFLW TQGNQYGGRS LTWGGITLRF
SREDFHPSKK DGYGPDWPIS YDELSPHYDF IENFCGIYGH KDNIKEVPNG KYIGKIPLTK
IESIFGNQVK SKLNYPFIQS RGFDRNSSVK EEQWPKSSSV GTTFKKALET GNVQILSNHL
VESFETDKIT ELASKIIIVN VENGKRKSLN CDLIILCAST ISTLRILLNS ETKSNSSGFK
DTSGKLGKFL MDHISVCRFF SVPNTTQRNN ISNSYPDLSG AGSFFIPFGT NLPKPESINF
LRGYGIWGAI DRLGIPKFLQ KDLNSSTGFL IAHGEVLPRE ENSVSLSDKT DRWGIPVPHI
EFEWSENELN MAKHMERTMR DSIKAADGKI KGIDELIKIP YAGLFTKKSI ALSGNPPPPG
YYIHEVGGAP MGFREEDSVV NKSNRLWRCK NVLVLDGACW PTSSWQSPTL TMMAISRRAC
LKVKKT