Gene P9211_07591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_07591 
Symbol 
ID5730639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp663866 
End bp665515 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content36% 
IMG OID641285122 
Productglucose-methanol-choline (GMC) oxidoreductase:NAD binding site 
Protein accessionYP_001550644 
Protein GI159903300 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00259994 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACAAATA CTTACGAAGC AATTGTAATT GGATCAGGTG CCACTGGGGG GGTAGCTTCT 
CTTACCCTTG CGAAAGCAGG AGTAAGAGTT CTTTTAGTAG AGGCTGGGCC TCTTTTAACT
GCGAAACAAG CACTAGGAAC AGAGCCTTTA AATACATTCA AAAGATTATT ATCAATATTT
AATGGAAATC ATAGAAAACA AATTCAACAT CCTGGTTATT GGAAGGCAAA TCCTTCTTTA
TATATAAATG AGAAAGAGAA CCCATATATA TATCCAAAAG AAAAGCCATT TATTTGGACA
CAAGGAAGAC AAGTTGGAGG TAGAAGTCTT ACTTGGGGGG GGATAACATT AAGACTTTCA
AATAGTGATC TAAAAGCAGC AACTAAAGAT GGATATGGTC CCGAATGGCC TATAAGCTAT
TCAGATCTTG AACCTCATTA CAATATTTTA GAAAGATTTT TCAAAGTGCA TGGTTGTAAT
GATGGACTAA CACAATTACC TGACGGATAT TTTATTAAAA ACCTACCTTT TACTAATTCT
GAGTCTCTAT TTGCTAGCGA AGTAAAAAGA AAGCTCGGTT ATCCAATAAT CCATTCAAGA
GGTTTTGGTC CCCATGCTCA TGTCAATGAT GAAGAATGGC CCAGATCAAG TAGCCCTGGG
AGTACGCTTA AAGTTGCTTT GGCCACAGGA AAAGTTAATT TGCTTCCTGA GCATATAGCA
GAAACATTAA TAATAAATAA ATCGAATTCA ATTGCCGAAG GAATAATAGT TATTGACAAA
GAAACAGGGG CTCGTAAAGA ACTTAAGAGT AAATTAATCG TACTATGTGG CTCTACAATA
CAAAGCCTTA GACTTCTACT AAATTCACAA GAAAAGTATA ATAATAAAAG ATTAATAGAT
CTTTCAGAAA GTCTAGGATG TAATTTAATG GATCATATTT CCACCTGTAG ATTTTTTGCA
ATGCCAACAA AAACTGAGTC TAAAGACTCT AAATTTTTAA CAGCCCAAAA GTTATCAGGT
GCAGGTAGCT TCTTTATTCC ATTTGGCAAT AAGTTAGATT CAACAGATGA TGTTGATTTT
CGAAGAGGCT ATGGTATTTG GGGAGCAATA GATCGATTTG AGCCTCCTGG AATACTAAAA
AGAAAACCAA ATTCAAAAAT AGGTTTTTTA ATAGGACATG GTGAAGTACT TCCCTATAAA
GAGAACAAGG TCACTCTTTC TACTCAACTA GATAAGTGGG GTGTTCCAAT ACCAAGTATT
GAATGCGAAT GGAAGACTAA CGAGATAAAG ATGGTTGCTC ATATGAATAA AACAATTCAA
AAATGTATTT CCGCAGCTGG AGGTGAAATA CTCCCGCTAA AAGAACTAAT AAAAATGCCT
TTTGTAGAAC CTATTATTAA CAGTGCAATG GCAATACAAG ACAAAGCTCC TCCGCCTGGG
TATTACATAC ATGAAGTAGG TGGAGCTCCT ATGGGGTATT GTCCAGAATC TAGCGTTTTA
GACCCATTAA ATAGATTATG GGCCTGTCCA AATGTATTAG TAGTAGATGG ATCTTGTTGG
CCAACCTCTT CTTGGCAAAG TCCAACTTTG ACAATGATGG CTATATCAAG AAGAGCTTGC
TTAGGAGCTA TTAAGAATCA GAGAGCTTAA
 
Protein sequence
MTNTYEAIVI GSGATGGVAS LTLAKAGVRV LLVEAGPLLT AKQALGTEPL NTFKRLLSIF 
NGNHRKQIQH PGYWKANPSL YINEKENPYI YPKEKPFIWT QGRQVGGRSL TWGGITLRLS
NSDLKAATKD GYGPEWPISY SDLEPHYNIL ERFFKVHGCN DGLTQLPDGY FIKNLPFTNS
ESLFASEVKR KLGYPIIHSR GFGPHAHVND EEWPRSSSPG STLKVALATG KVNLLPEHIA
ETLIINKSNS IAEGIIVIDK ETGARKELKS KLIVLCGSTI QSLRLLLNSQ EKYNNKRLID
LSESLGCNLM DHISTCRFFA MPTKTESKDS KFLTAQKLSG AGSFFIPFGN KLDSTDDVDF
RRGYGIWGAI DRFEPPGILK RKPNSKIGFL IGHGEVLPYK ENKVTLSTQL DKWGVPIPSI
ECEWKTNEIK MVAHMNKTIQ KCISAAGGEI LPLKELIKMP FVEPIINSAM AIQDKAPPPG
YYIHEVGGAP MGYCPESSVL DPLNRLWACP NVLVVDGSCW PTSSWQSPTL TMMAISRRAC
LGAIKNQRA