Gene A9601_05891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_05891 
Symbol 
ID4717289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp514532 
End bp515572 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content37% 
IMG OID640078301 
Productdehydrogenase 
Protein accessionYP_001008982 
Protein GI123968124 
COG category[R] General function prediction only 
COG ID[COG5322] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.735878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGGGT TAATAGGCCA CTCAACCAGT TTTGAAGATG CAAAAAGAAA AGCTTCGATG 
CTAGGCTTTG ATCACATTGC TGATGGCGAC TTGGATGTTT GGTGTACTGC TCCTCCTCAG
CTTGTTGAAA ATGTAGAAGT TAAGAGTGCA ACTGGAATAT CTATTGAAGG TTCTTATATA
GATTCTTGCT TTGTTCCAGA AATGCTTTCT AGGTTTAAAA CCGCTAGAAG AAAAGTACTA
AATGCTATGG AACTAGCTCA GAAAAAAGGG ATTAATATTA CAGCTTTAGG AGGATTTACT
TCTATTATTT TTGAGAATTT TAATCTTCTA CAGCATAAAC AAATTAGAAA TACTTCATTA
GAGTGGGAAA GATTTACTAC TGGCAATACT CATACCGCCT GGGTTATTTG TAAGCAACTA
GAAATAAATG CTCCTCGCAT TGGGATAGAC CTTAAAAAAG CAACTGTTGC TGTAATTGGT
GCTACAGGTG ATATTGGTAG CGCTGTTTGT AGGTGGCTTA TCAATAAAAC TGGGATTTCA
GAACTCCTTA TGGTAGCAAG ACAACAAGAA CCTCTAGCGC TGTTACAAAA AGAATTAGAT
GGTGGCACCA TAACAAGTTT GGATGAGGCA TTGCCTCAGG CGGACATTGT TGTGTGGGTT
GCAAGTATGC CTAAAACTAT TGAAATTAAT ACTGACAACT TACAAAAACC ATGTTTAATG
ATTGATGGTG GATATCCCAA AAATCTTGAT GAGAAATTTC AGGGTGAAAA TATTTATGTT
TTAAAAGGAG GTATAGTAGA GTTTTTCAAT GATATTGGTT GGAATATGAT GGAACTTGCG
GAAATGCAAA ACCCTCAGCG AGAGATGTTT GCTTGCTTTG CAGAAGCTAT GATTTTAGAA
TTTGAGAAGT GTCATACAAA CTTTAGTTGG GGAAGAAATA ACATTTCCCT TGAAAAGATG
GAATTTATTG GAGCAGCTTC TTTAAAGCAT GGTTTTTCCG CCATTGGACT TGATAAGCAG
CCTAAAGTAT TAACTGTCTA A
 
Protein sequence
MFGLIGHSTS FEDAKRKASM LGFDHIADGD LDVWCTAPPQ LVENVEVKSA TGISIEGSYI 
DSCFVPEMLS RFKTARRKVL NAMELAQKKG INITALGGFT SIIFENFNLL QHKQIRNTSL
EWERFTTGNT HTAWVICKQL EINAPRIGID LKKATVAVIG ATGDIGSAVC RWLINKTGIS
ELLMVARQQE PLALLQKELD GGTITSLDEA LPQADIVVWV ASMPKTIEIN TDNLQKPCLM
IDGGYPKNLD EKFQGENIYV LKGGIVEFFN DIGWNMMELA EMQNPQREMF ACFAEAMILE
FEKCHTNFSW GRNNISLEKM EFIGAASLKH GFSAIGLDKQ PKVLTV