Gene A9601_19091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_19091 
SymbolaroG 
ID4718648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1646278 
End bp1647345 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content34% 
IMG OID640079644 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001010299 
Protein GI123969441 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.352834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAT CATCCAATAA TTCAGCTTTA GAAAAGACAT CAGATTTACA TGTTGTTGAA 
ACACGTCCAT TAATACCTCC AAGCAGATTA CATAATGATA TACCTTTAGA TCACGCCTCT
GCTAATACAG TATCTAAAAC AAGAAGATCG ATACAAAATA TTTTGCATCA TAATGATAAG
AAGCTTCTAG TAATCGTGGG CCCATGTTCA ATTCATGATC TTGAGGCGGC AAAGGAATAT
TCAAAATATA TTCAAAAATT CCGAGAAATG TATAACGATA AATTAGAAAT AATTATGAGA
GTATATTTTG AAAAACCAAG GACAACTATT GGTTGGAAGG GATTGATAAA TGATCCTCAT
CTAGATGATT CTTATGATAT TAATACTGGT TTAAGAAGGG CAAGAAGTTT GCTTTCATAT
TTAGCAACTC GAGGCATACC TTCTGCTACA GAATTACTAG ATCCAATTGT TCCTCAATAC
ATTGCCGATT TAATAAGTTG GACAGCCATA GGTGCGCGGA CCACGGAAAG TCAAACTCAT
AGAGAAATGG CATCAGGATT ATCAATGCCT ATAGGCTTTA AAAATGGAAC GGATGGTTCT
TTTACTACTG CAATTAATGC AATGCAGTCA GCTTCAAAAT CCCATCACTT CTTAGGTGTA
AATGAAAATG GAATGGCTTC TATAGTTAAT ACTACAGGAA ATCCAGATGG ACATATAGTT
TTAAGGGGCG GTTCAAAAGG CCCAAATTTT GAAAGTGATC ATGTACAAAG AATTTCAGCA
GAATTGAGGC AGTATAATCT TCCCCATAAA GTGATGATTG ATTGTAGTCA TGGAAATTCC
AATAAAGATT TCCGAAAACA GTCAGAAGTG CTAAAAAATG TAGCTTCTCA AATTAGTAAT
GGTGAAAAAA ATATTTTAGG AGTTATGCTT GAAAGTCATT TGAAGGAAGG AAATCAAAAA
CTTTTAAAAA AAGAAGATCT CCAGTTTGGA AGAAGCATTA CAGATGCATG TATAGATATA
GAAACAACAA AAGAATTAAT CGCTATTTTA TACGATTCAC TTAGCTAG
 
Protein sequence
MTTSSNNSAL EKTSDLHVVE TRPLIPPSRL HNDIPLDHAS ANTVSKTRRS IQNILHHNDK 
KLLVIVGPCS IHDLEAAKEY SKYIQKFREM YNDKLEIIMR VYFEKPRTTI GWKGLINDPH
LDDSYDINTG LRRARSLLSY LATRGIPSAT ELLDPIVPQY IADLISWTAI GARTTESQTH
REMASGLSMP IGFKNGTDGS FTTAINAMQS ASKSHHFLGV NENGMASIVN TTGNPDGHIV
LRGGSKGPNF ESDHVQRISA ELRQYNLPHK VMIDCSHGNS NKDFRKQSEV LKNVASQISN
GEKNILGVML ESHLKEGNQK LLKKEDLQFG RSITDACIDI ETTKELIAIL YDSLS