Gene P9301_06751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_06751 
Symbol 
ID4912804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp602142 
End bp603782 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content36% 
IMG OID640160256 
Productglucose-methanol-choline (GMC) oxidoreductase:NAD binding site 
Protein accessionYP_001090899 
Protein GI126696013 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATATAA GTCCTTATGA TGCAATTGTT GTTGGTTCTG GAGCTACAGG AGGAATAGCA 
GCACTTACAT TGGCAGAACA AGGGATCAAA GTTTTAGTAA TAGAAGCAGG GCCTCAAGTT
AAAAGGGATG AAGCTAGTAA TCATGAGCCA AAAAGTACAT TAAAAAGATT ATCAGGACTA
ATAACAAAAA AAAATGCCAA TCAGTGCCAA CATCCTGGTT ATTGGAAAAA TAATCCTGAC
TTATATTCAA ATGAATTGAA GCATCCTTAT GACTTCCCAA AAAAAAAGCC ATTTCTTTGG
ACACAAGGTA AACAATTTGG GGGGAGATCA TTAACCTGGG GAGGCATAAC TTTAAGACTT
TCCTCAGAAG ACTTTCATCC AGCTAAAAAA GACGGATTCG GGCCAAACTG GCCTATCTCG
TACGAAGAAA TCTCCCCGCA CTATGATTTT ATTGAAAATT TCTGCGGAAT TTATGGCCGA
AAAGATGATA TCAAGGAGGT CCCAAACGGT AAATATATTG GTGAGATTCC TCTTACAGAA
AACGAAAATG TTTTTGGCAG CAAAGTTAAA TCAAAATTAA ACTATCCATT TATCCAATCA
AGAGGATTTG ACCGTAATTC ATCTGTAAAA GAAAAAAATT GGCCCAAGTC CTCTAGCTTA
GGAAGCTCTT TTAAAAAAGC TTTAGATACT GGAAATGTAC AAATAATCTC TAATTACCTA
GTGGAGTCTT TTGAGATTAA CAAGATAACA GAGCTTGCCT CAAAACTAAC GATTGTAAAC
CTAGAAAATG GATACAAAGA AGTATTGGAT TGTGATTTGA TTCTTCTTTG CGCATCAACA
ATTTCAACAT TGAGAATACT ATTAAACTCA GAATACAAAT CAAATTCCTC AGGGTTTAAA
GATAATTCTG GGAAATTAGG TAAATATCTA ATGGATCACA TATCTATATG TAGATTTTTT
TCAGTCCCAA AAGCAAAAAA CTCAGATAAA TCACTAGATA ATCCTCCCGA TCTTTCTGGA
GCAGGCAGCT TCTTTATTCC TTTTGGTTCA AATTTGCCAG AAATTGATGA CATAAATTTC
CATAGAGGTT ATGGGATCTG GGGGGCAATT GATAGATTAG GGATACCTAA ATTTTTACAA
AAAGACGTAA ACAAATCCAT TGGCTTTCTT ATCGCCCATG GTGAAGTCCT TCCAAGAGAG
AAAAACTCAG TTTCTCTCTC AAGAAAAACA GATGAATGGG GAATACCAAT TCCCTACATT
GAATTCGAAT GGAGCGAGAA TGAGTTAAAT ATGGCAAAAC ATATGGAAAA AACAATACAA
AGATCAGTCA AAGCTGCAAA TGGGAAAATA AAAAATATTG ATGAACTAAT GAATATTCCA
CTAGGGAGTT TATTTACAAA AAATTTGATA GCACTTTCAG ATAGTCCTCC TCCTCCTGGA
TATTATATTC ATGAGGTAGG GGGAGCACCG ATGGGGTTAA ATGAAGAAAA TAGCGTAGTT
GATAAATTTA ATAGATTGTG GAGATGTAAG AATGTACTGG TACTAGATGG AGCATGCTGG
CCCACATCAT CTTGGCAAAG CCCCACACTT ACAATGATGG CCTTGAGTAG AAGAGCCTGT
TTAAATATTA AAAAGACTTA G
 
Protein sequence
MDISPYDAIV VGSGATGGIA ALTLAEQGIK VLVIEAGPQV KRDEASNHEP KSTLKRLSGL 
ITKKNANQCQ HPGYWKNNPD LYSNELKHPY DFPKKKPFLW TQGKQFGGRS LTWGGITLRL
SSEDFHPAKK DGFGPNWPIS YEEISPHYDF IENFCGIYGR KDDIKEVPNG KYIGEIPLTE
NENVFGSKVK SKLNYPFIQS RGFDRNSSVK EKNWPKSSSL GSSFKKALDT GNVQIISNYL
VESFEINKIT ELASKLTIVN LENGYKEVLD CDLILLCAST ISTLRILLNS EYKSNSSGFK
DNSGKLGKYL MDHISICRFF SVPKAKNSDK SLDNPPDLSG AGSFFIPFGS NLPEIDDINF
HRGYGIWGAI DRLGIPKFLQ KDVNKSIGFL IAHGEVLPRE KNSVSLSRKT DEWGIPIPYI
EFEWSENELN MAKHMEKTIQ RSVKAANGKI KNIDELMNIP LGSLFTKNLI ALSDSPPPPG
YYIHEVGGAP MGLNEENSVV DKFNRLWRCK NVLVLDGACW PTSSWQSPTL TMMALSRRAC
LNIKKT