Gene P9303_29971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_29971 
SymbolaroG 
ID4778538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2647903 
End bp2648985 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content53% 
IMG OID640088521 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001018992 
Protein GI124024685 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.906124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCTG GCGAAATGAC CACCACCTCC GACTTGCATG TGGTGGATAC GCGGCCTTTG 
GTGTCACCCG TCTTGCTTCA TCAGGAGCTG CCCCTCGATC TAGTGGCTCT CAAAACCGTT
GCAGACACTC GTAGACGCAT CCAGGCAATT CTGCGTGGTG AGGATCCCCG CTTGCTGGTG
ATTGTTGGTC CCTGCTCGGT TCATGACATT GCCTCTGCAA GGGACTATGC CCGTCAGTTG
GAGCCATTGC GACAGCGATA TGCCGCCCAG TTGGAGGTGG TTTTGCGGGT CTATTTCGAA
AAACCTCGCA CCACCGTTGG CTGGAAAGGT CTCATCAATG ATCCCCATCT CGATGGTTCC
TATGACATCA ACACTGGCTT GAGGCGTGCG CGGTCGTTGT TGCTCGACCT TGCCCGCGAG
GGTATGCCGA CGGCCACTGA ATTGCTGGAT CCGGTTGTTC CTCAATACAT CGCTGATTTG
ATCAGTTGGA CGGCGATTGG AGCCAGAACC ACTGAGAGTC AGACCCATCG GGAGATGGCT
TCTGGATTGT CAATGCCCGT TGGTTACAAA AACGGTACCG ATGGCAGTGC CAAGATTGCG
ATCCATGCGA TGCAGGCAGC ATCTAGGCCG CATCATTTTC TAGGGATCAA TCGGCAGGGT
CAGGCTTCGA TTGTGCATAC CACTGGAAAC CCTGATGGCC ATCTCGTGTT GCGGGGAGGC
AATGGCTGCA CCAATTACCA TCCCGAAGCT GTGGAAGGGG TTGCAAAAGA ATTAGTGAAG
GCTGGCTTGG CTGATCGGTT GATGGTGGAT TGCAGCCATG ACAATTCGAA TAAAGATTTT
CGGCGACAGT CAGAGGTGCT GCAGGCTGTT GCTACTCAGG TACGCCAAGG ATCAACCCAC
CTGATGGGTG TGATGTTGGA AAGTCATCTT GTCGAGGGCA ATCAGAAGTT GCCTGAAGAC
CTCTCTACTC TTGTCTATGG TCAAAGCATT ACGGATGCTT GTATCGATAT AGAGACAACG
GCAACTCTCC TTGAGGATTT GGCGGCTGCA GTGGCTTCAG TGACGTTGTC ACCAATAACT
TGA
 
Protein sequence
MNSGEMTTTS DLHVVDTRPL VSPVLLHQEL PLDLVALKTV ADTRRRIQAI LRGEDPRLLV 
IVGPCSVHDI ASARDYARQL EPLRQRYAAQ LEVVLRVYFE KPRTTVGWKG LINDPHLDGS
YDINTGLRRA RSLLLDLARE GMPTATELLD PVVPQYIADL ISWTAIGART TESQTHREMA
SGLSMPVGYK NGTDGSAKIA IHAMQAASRP HHFLGINRQG QASIVHTTGN PDGHLVLRGG
NGCTNYHPEA VEGVAKELVK AGLADRLMVD CSHDNSNKDF RRQSEVLQAV ATQVRQGSTH
LMGVMLESHL VEGNQKLPED LSTLVYGQSI TDACIDIETT ATLLEDLAAA VASVTLSPIT