Gene A9601_14101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14101 
Symbol 
ID4718131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1182007 
End bp1183119 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content33% 
IMG OID640079131 
Producthypothetical protein 
Protein accessionYP_001009801 
Protein GI123968943 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGCGA AATTCAAACC TATCACTAAT AGAAAAATTA AAGTTGGAAT AGTTGGTTGC 
GGTCGAATTT TTAAAAAGCA TCTTGAGGCA ATTACAAATA ATTTTGAAAG AATAGAATTA
GTTGCAATTT GCGATGAAAA TAATGATTCT TTAGAAAAAG CTAATGAATT TATTAAAGAT
GTTTGTTCAA AAATAAAAAA TTTTTCAAAT AATCCCAAAA GGTTTTTTTC CTATAAAATA
TTGTTGGATT ATTGCTCTCA AAATCCTAAT TTCATTGATT TAATTGTATT AGCAACACCA
AGTGGTTTGC ATCCAAGTCA AGTAATTAGT GCTGCTAAAT GTGGTCTAAA TGTTATGACT
GAGAAGCCAA TGGCTACGAA ATGGGCTGAC GGGCTATCTA TGGTTAAAGC CTGCGATGAT
GCTGGTGTAA GATTATATGT CATAAAGCAA AACAGATTTA ATAGAACTCT TCAGTTACTT
AAAAAGCAAA TTGTAAATGG TAGGTTTGGA AGAATAGCAA TGGTAACTTC TAATGTTTTT
TGGCAAAGAC CTCAATCTTA TTACGATCAA GATTCGTGGC GAGGTACCTG GGAGTTTGAT
GGTGGTGCTT TAATGAATCA AGCTAGCCAT TATGTTGATT TAATGGAATG GTTGGTTGGC
CCAATTGCAT CGGTTAATGC TTCAATTGCA ACTGTTGGAC GCAATATTGA AGTTGAGGAT
ACAGCAACTT TAAATTTGAG ATGGCGAAAT GGTGCGCTAG GTTCTATGTC TGTTACCATG
CTTACTTATC CTAAAAATTT AGAGGGCTCA ATAATTGTGT TGGGTGAAAA TGGTTCAGTA
AAGGTAGGGG GTGAAGCTGT CAATAAAATA GAATTTTGGG AATTCAAAGA CAATCATCCT
GATGATAAAA ATGTTGAAAT TAACAACTAT GAAGTTAAAA GTGTTTATGG CTCAGGACAT
TCATTATTTT ATTCAAATAT TCTTGATCAT TTTCAAGGAA AAAATGTTGA TGTTTGTGAT
GGAAGAGAAG GTTTAAAAAG CCTTGAATTA TTAATAGGAG CTTATAGGTC TGCTAGAGAT
GGTAAGAATA TTTATTTGCC CTTAGACTAC TGA
 
Protein sequence
MYAKFKPITN RKIKVGIVGC GRIFKKHLEA ITNNFERIEL VAICDENNDS LEKANEFIKD 
VCSKIKNFSN NPKRFFSYKI LLDYCSQNPN FIDLIVLATP SGLHPSQVIS AAKCGLNVMT
EKPMATKWAD GLSMVKACDD AGVRLYVIKQ NRFNRTLQLL KKQIVNGRFG RIAMVTSNVF
WQRPQSYYDQ DSWRGTWEFD GGALMNQASH YVDLMEWLVG PIASVNASIA TVGRNIEVED
TATLNLRWRN GALGSMSVTM LTYPKNLEGS IIVLGENGSV KVGGEAVNKI EFWEFKDNHP
DDKNVEINNY EVKSVYGSGH SLFYSNILDH FQGKNVDVCD GREGLKSLEL LIGAYRSARD
GKNIYLPLDY