Gene PMN2A_1307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1307 
Symbol 
ID3606702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1816060 
End bp1817130 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content38% 
IMG OID637688184 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_292500 
Protein GI72383145 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACCT CATCATATTT TTCAATGGTG GATAGCACCT CCGACCTTCA TGTGGTCGAG 
ACTCGTCCAT TAATGTCACC AGCATTAATT CATAGAGATT TGCCTTTAGA TAAGGCATCC
TCTGGAGTTG TCTCTACTAC TCGCAATAAG ATTCAATCAA TTCTTCATGG TAATGATCCA
AGAATTTTGG TCATTGTTGG ACCGTGTTCG GTTCATGATG TTGATGCGGC CATTGAATAT
GCAAATCGTT TAGCCCCATT GAGGGAGAGA TATAGTCAAA AGCTTGAGAT TGTTATGCGT
GTTTACTTTG AGAAACCACG CACAACTGTT GGCTGGAAAG GACTTATTAA TGACCCTCAT
CTTGATAATT CTTACGATAT TAATACTGGC TTAAGAAAGG CAAGAGGTCT ATTACTTGAT
TTAGCCAAAG CAGGAATGCC GGCTGCAACT GAATTACTTG ATCCAGTTGT TCCTCAATAT
ATTGCTGATT TAATTAGTTG GACTGCTATT GGAGCAAGAA CGACAGAGAG TCAGACTCAT
CGTGAAATGG CGTCTGGATT ATCAATGCCG GTTGGTTATA AGAATGGTAC TGATGGGACA
GCTACGATTG CGATTAATGC AATGCAAGCG GCCTCAAAAC CTCATCATTT TTTAGGAATT
AATCATGATG GTCATGCCTC AATAGTTAGT ACTACAGGTA ATCCAAATGG TCATCTTGTT
TTAAGAGGTG GTAAGAATGG AACTAACTAC CATCTTGATG CAATTAATTT AATTGCAGAT
GAATTAGCAC AATTTAATAT GCCTGGAAAA GTGATGGTTG ATTGTAGTCA TGGTAATTCT
AATAAAGATT TTCGTAGACA ATCAGAAGTT TTAAAAGACG TAGCAGCACA GATAAGAGGT
GGATCAAAGA ATCTAATGGG CGTAATGATA GAAAGTCATC TTGTTGAGGG TAATCAGAAA
TTAAATTCAG ATTTGTCAAA ACTTACCTAT GGGCAAAGTG TTACGGATGC ATGCATAAAC
TTTTCTACAA CTGAAATTTT ATTAGAAGAA CTAGCTGAAT CGGTCAAATA A
 
Protein sequence
MSTSSYFSMV DSTSDLHVVE TRPLMSPALI HRDLPLDKAS SGVVSTTRNK IQSILHGNDP 
RILVIVGPCS VHDVDAAIEY ANRLAPLRER YSQKLEIVMR VYFEKPRTTV GWKGLINDPH
LDNSYDINTG LRKARGLLLD LAKAGMPAAT ELLDPVVPQY IADLISWTAI GARTTESQTH
REMASGLSMP VGYKNGTDGT ATIAINAMQA ASKPHHFLGI NHDGHASIVS TTGNPNGHLV
LRGGKNGTNY HLDAINLIAD ELAQFNMPGK VMVDCSHGNS NKDFRRQSEV LKDVAAQIRG
GSKNLMGVMI ESHLVEGNQK LNSDLSKLTY GQSVTDACIN FSTTEILLEE LAESVK