Gene P9515_18901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_18901 
SymbolaroG 
ID4720540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp1680567 
End bp1681634 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content34% 
IMG OID640081591 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001012204 
Protein GI123967123 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACTT CATCAAATAA TCAATCTTTA GAAAAAACAT CTGATTTGCA TGTTGTTGAA 
ACACGTCCAT TGATACCTCC AAGCAAACTT CATAATGATA TACCTTTAGA TTATACCTCT
GCTGATACTG TCTCCAATAC GAGGAGATCG ATACAAAATA TTTTGCATAA TAATGATCCT
AGGCTATTAG TCATTGTGGG ACCATGCTCA ATCCACGATA TTAAAGCTGC TAAAGAGTAT
TCAGAATATA TTCAGGAATT TAGAAAAATC TACAATGATA AATTGGAAAT TGTAATGAGA
GTATATTTTG AAAAACCGAG AACTACAATC GGATGGAAAG GATTGATAAA TGACCCCCAT
TTAGATGGTT CCTACGATAT TAATACAGGT TTACGTAGAG CTAGAAGCTT GCTCTCCTAT
CTTGCGACTA GAGGGATCCC TTCAGCTACT GAGTTGTTGG ACCCCATTGT CCCTCAATAT
ATTGCTGATT TAATCAGCTG GACAGCCATT GGTGCAAGGA CAACTGAAAG TCAAACTCAT
AGAGAAATGG CTTCAGGATT ATCTATGCCA ATTGGTTTTA AAAATGGTAC AGATGGTTCT
TTCAGTACAG CTATTAATGC GATGCAGTCT GCATCAAAAT CTCATCACTT TTTAGGCGTT
AATGATCATG GTTATGCTTC TATTGTAAAT ACGACTGGCA ATCCCGATGG GCATATAGTT
TTAAGGGGTG GGTCTAAAGG AGTTAATTTT GAAAATCAAC ATGTAAAAGG CATATCTTCT
GAATTAAAAG CCAGTAATCT TCCTCATAAG GTTATGATCG ATTGTAGTCA TGGTAATTCT
AATAAAGACT TTAGGAAGCA ATCTGATGTT CTAGAAAACG TAGCAACTCA AATTAAGAAT
GGTGAAAAAA ATATTTTAGG AATTATGCTT GAAAGTCATC TTAAGGAAGG TAATCAAAAA
CTTTCAAATA ATAAAGATCT TGAATATGGG AGAAGTATTA CTGATGCTTG CATTAATATA
GACAAGACAA AAAATTTGCT AGAGAGTTTA TATGATTCAA TTTCTTAA
 
Protein sequence
MTTSSNNQSL EKTSDLHVVE TRPLIPPSKL HNDIPLDYTS ADTVSNTRRS IQNILHNNDP 
RLLVIVGPCS IHDIKAAKEY SEYIQEFRKI YNDKLEIVMR VYFEKPRTTI GWKGLINDPH
LDGSYDINTG LRRARSLLSY LATRGIPSAT ELLDPIVPQY IADLISWTAI GARTTESQTH
REMASGLSMP IGFKNGTDGS FSTAINAMQS ASKSHHFLGV NDHGYASIVN TTGNPDGHIV
LRGGSKGVNF ENQHVKGISS ELKASNLPHK VMIDCSHGNS NKDFRKQSDV LENVATQIKN
GEKNILGIML ESHLKEGNQK LSNNKDLEYG RSITDACINI DKTKNLLESL YDSIS