Gene P9301_18901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_18901 
SymbolaroG 
ID4912543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1618274 
End bp1619341 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content33% 
IMG OID640161496 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001092114 
Protein GI126697228 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.667922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACAT CATCAAATAA TTCAGCTTTA GAAAAGACAT CAGATTTACA TGTTCTTGAA 
ACACGTCCAT TAATACCTCC AAGCAGATTA CATAATGATA TACCTTTAGA TCACGACTCT
GCTAATACAG TATCTAAAAC AAGAAGATCG ATACAAAATA TTTTGCATCA TAATGATCAG
AAGCTTTTAG TCATTGTGGG TCCATGTTCA ATTCATGATC TTGAGGCGGC AAAGGAATAT
TCAAAATATA TTCAAAAATT CCGAGAAATG TATAAAGATA AATTAGAAAT AATTATGAGA
GTATATTTTG AAAAACCAAG AACAACTATT GGCTGGAAGG GATTGATAAA TGATCCTCAT
CTAGATGATT CTTATGATAT TAATACTGGT TTAAGAAGAG CAAGAAGTTT GCTTTCATAT
TTAGCAACTC GTGGTATACC TTCTGCTACA GAATTACTAG ATCCAATTGT TCCTCAATAC
ATTGCCGATT TAATAAGTTG GACAGCCATA GGTGCGCGGA CTACAGAAAG TCAAACTCAT
AGAGAAATGG CATCAGGATT ATCAATGCCT ATAGGCTTTA AAAATGGAAC GGATGGTTCT
TTTACTACTG CAATTAATGC AATGCAGTCA GCTTCAAAAT CCCATCACTT CTTAGGTGTA
AATGAAAATG GAATGGCTTC TATAGTTAAT ACTACAGGAA ATCCAGATGG ACATATAGTT
TTAAGGGGTG GTTCAAAAGG CCCAAATTTC GAAAATGATC ATATACAAAG AATTTCAGCA
GAATTGAGGC AATGTAGTCT TCCCCATAAA GTGATGATTG ATTGTAGTCA TGGAAATTCC
AATAAAGATT TCCGAAAACA GTCGGAAGTG CTAAAAAATG TGGCTTCTCA AATTAGTAAT
GGTGAAAAAA ATATTTTAGG AGTTATGCTT GAGAGTCATT TGAAGGAAGG AAATCAAAAA
CTTTTAAAAA AAGAAGATCT CCAGTTTGGT AGAAGCATTA CAGATGCATG TATAGATATA
GAAACAACAA AAAAATTAAT AGCTATTTTA TACGATTCAC TTAGCTAG
 
Protein sequence
MMTSSNNSAL EKTSDLHVLE TRPLIPPSRL HNDIPLDHDS ANTVSKTRRS IQNILHHNDQ 
KLLVIVGPCS IHDLEAAKEY SKYIQKFREM YKDKLEIIMR VYFEKPRTTI GWKGLINDPH
LDDSYDINTG LRRARSLLSY LATRGIPSAT ELLDPIVPQY IADLISWTAI GARTTESQTH
REMASGLSMP IGFKNGTDGS FTTAINAMQS ASKSHHFLGV NENGMASIVN TTGNPDGHIV
LRGGSKGPNF ENDHIQRISA ELRQCSLPHK VMIDCSHGNS NKDFRKQSEV LKNVASQISN
GEKNILGVML ESHLKEGNQK LLKKEDLQFG RSITDACIDI ETTKKLIAIL YDSLS