Gene P9303_12251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_12251 
Symbol 
ID4776371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1065563 
End bp1067218 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content53% 
IMG OID640086734 
Productglucose-methanol-choline (GMC) oxidoreductase:NAD binding site 
Protein accessionYP_001017239 
Protein GI124022932 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTCAGC ACCCTTATGA GGTGATCGTG ATCGGCTCTG GTGCTACTGG AGGGGTTGCT 
GCTCTAACCC TTGCAGAAGC TGGTGTACGT GTGCTCGTGG TAGAAGCTGG GCCAGATTTG
TCTGCTCAAA AAGCCCTGGG CTCAGAACCT GGAAACACCC TTAGACGTTT GGATGGTTTA
TGTAGCGGCA AGCATCGATC TCAGGCTCAA CATCCTGGCT ATTGGAAAGC GAATCCGTTG
CTCTACGCGA ATGAAAAGGA GAATCCCTAT ACCTATCCTT CTGAACACCC CTTCATCTGG
ACCCAGGGTC GTCAGGTGGG GGGGCGCAGC CTCACTTGGG GTGGAATCAC TCTTCGTCTT
TCAGATCAGG ACTTAAAGGC TTCGCGCAGA GATGGTTATG GGCCTGAATG GCCACTGCAA
TACAGCGAGT TAGCTCCTCA TTATTCCGCC CTAGAAGAGC GCTTGAAGGT TCATGGTCAT
GTGGATGGGT TGGAACAGTT ACCGGATGGC AACTACATCG CCCCATTACC GTTCACAGCT
AGTGAACAAC AGTTCGCTAG CGCCGTTGAT ACTGAACTTG GTTATCCAGT CATTCATTCA
CGAGGGTTCG GACCTCACCA GCCTTCAGTT GATGGACCTT GGCCTCGTTC AAGCAGTCCG
GGCAGCACCT TACAGATGGC ACTTGCCACA GGCAAAGTAG AGATCCTCAG CAACCACAAG
GCTGAGCGGT TGCTGATGCA TCCAGATCAT GAAGCAGCCC GAGGGGTGCT GGTGATTGAT
CAGCGCAACG GCAACCGACA AGAGCTCCAT GGTGAGCTTG TGGTGCTTTG TGCATCGACA
ATTCAGAGTC TTCGACTCCT GCTGAGTTCC GAAGTGAGCC ATCACAGCGC GGGGTTTACT
GACCCCTCGG GCAACCTCGG TTGCTATTTG ATGGACCACG TCTCCACCTG TCGTTTCTTT
GCTCTGCCAC GTAGCCAAGT GAAGCAGGTG TCTGAGACTG ACTCAACGGC GAATGTGCTT
TCTGGAGCTG GCAGTTTTTT TCTTCCTTTC GGTGCTTGCT TAGAGCCTAA AAATCAGTTG
AAGTTCTTGC GGGGTTATGG ACTCTGGGGA GGGATTGATC GCTTTGAACC CCCAGATTGG
TTGAAACGTA AACCAGACAC AGCTACAGGT TTCCTGATTG GGCATGGTGA AGTGTTGCCT
TCACCTCACA ACAAAGTGAC GCTGTCAAGC ACTTTGGATC GCTGGGGTGT TCCTGTGCCA
CATATCGATT GTCGATGGGG AGAGAACGAG CAAGCCATGG TTGATCACAT GCAAGACACG
ATCAAGACAG CGATCCAGTC AGCTGGGGGA ACAATGTTGC CGCTCAAGGA GCTGATTAAT
TTGATGTTTC TCGAACCCCT TCTCGATGGT GCGCTAGCTC TCAGTGAAAC GTCTCCACCG
CCGGGGTATT ACATCCATGA AGTTGGCGGT GCTGCGATGG GAGAACGTGA AGATTGCAGT
GTGGTGGATC GTTGGAACCG TCTTTGGCGA TGTCCAAATG TGCTTGTTGT GGATGGAGCG
TGTTGGCCAA CATCCGCCTG GCAGAGTCCC ACTCTGACAA TGATGGCGAT TACAAGAAGG
GCTTGTCTAC AAGCCCTTAA GCCTCGGCGT GGCTGA
 
Protein sequence
MIQHPYEVIV IGSGATGGVA ALTLAEAGVR VLVVEAGPDL SAQKALGSEP GNTLRRLDGL 
CSGKHRSQAQ HPGYWKANPL LYANEKENPY TYPSEHPFIW TQGRQVGGRS LTWGGITLRL
SDQDLKASRR DGYGPEWPLQ YSELAPHYSA LEERLKVHGH VDGLEQLPDG NYIAPLPFTA
SEQQFASAVD TELGYPVIHS RGFGPHQPSV DGPWPRSSSP GSTLQMALAT GKVEILSNHK
AERLLMHPDH EAARGVLVID QRNGNRQELH GELVVLCAST IQSLRLLLSS EVSHHSAGFT
DPSGNLGCYL MDHVSTCRFF ALPRSQVKQV SETDSTANVL SGAGSFFLPF GACLEPKNQL
KFLRGYGLWG GIDRFEPPDW LKRKPDTATG FLIGHGEVLP SPHNKVTLSS TLDRWGVPVP
HIDCRWGENE QAMVDHMQDT IKTAIQSAGG TMLPLKELIN LMFLEPLLDG ALALSETSPP
PGYYIHEVGG AAMGEREDCS VVDRWNRLWR CPNVLVVDGA CWPTSAWQSP TLTMMAITRR
ACLQALKPRR G