Gene A9601_07371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_07371 
SymbolaroB 
ID4717442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp656255 
End bp657346 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content30% 
IMG OID640078451 
Product3-dehydroquinate synthase 
Protein accessionYP_001009130 
Protein GI123968272 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAAGA GAAAAATATT AGTCCCATTA GGTGATAAGT CATACGAAGT AACTCTAGAA 
GCAGGGATAC TGAATAACAT TAGCGAAGAA CTCTTAAAAA TTGGAATAAC AAAGAAAAGA
AAAATACTTG TGATTTCAAA TGAAGAAATA TCAAATTTGT ATGGTGAGAA ATTCTTAAAT
AATTTAAAAG ATAATAAATT TCAGGCCAAA ATGTTCCTTA TCAAGGCTGG AGAATCATAT
AAAAACTTAA AAACCTTAAG TGAAATATAT GATGTAGCAT TTGAATTTGG CTTAGATAGA
AATTCAATAA TTATTGCCCT TGGAGGAGGA ATTGTTGGAG ATGTAAGTGG TTTTGCAGCT
GCTACTTGGC TTAGAGGTAT CGAATATATT CAGATTCCAA CAACATTATT ATCAATGGTT
GATTCATCTG TGGGAGGAAA AACAGGAGTA AATCATCCAA AAGGTAAGAA TTTAATTGGA
GCTTTCAATC AACCTAAAGC AGTTTTTATT GATCCAGAAA CTTTAAAAAG TTTGCCCAAA
AGAGAATTTA GTGCAGGCAT GGCTGAAGTA ATAAAATACG GAGTAATAAG AGATAAAGAA
CTTTTCGAAT ACTTAGAAAT TGAAAAAAAC AAAAATGAAC TTATAAATCT CAAAAATGAA
TATTTAATTA AAATAATTAA TAGTTCAATT AAAACAAAGT CTAATGTTGT TTCTCAAGAC
GAACATGAAA ATGGTGTTAG AGCAATATTG AATTATGGTC ATTCTTTTGG TCACGTTATT
GAAAATTTAT GTGGATACGG CAAATTTCTG CATGGTGAGG CAATATCAAT TGGTATGAAT
ATTGCGGGGA AAATAGCAAT TGAAAAAGGG TTATGGTCTA AAGAAGAATT AGAGAGACAG
CGAATTCTCT TAGAGAGTTA TGATCTTCCT ACCGAGATCC CCAAAATAAA TAAAGAAGAC
GTTCTAACAA TACTTATGGG TGATAAAAAA GTTCGTGATG GCAAAATGAG ATTTATATTA
CCGAAAGAAA TTGGTGCTGT TGATATATAT GATGACGTGG AAGATTCATT ATTTTTAAAG
TTTTTTTCTT AA
 
Protein sequence
MNKRKILVPL GDKSYEVTLE AGILNNISEE LLKIGITKKR KILVISNEEI SNLYGEKFLN 
NLKDNKFQAK MFLIKAGESY KNLKTLSEIY DVAFEFGLDR NSIIIALGGG IVGDVSGFAA
ATWLRGIEYI QIPTTLLSMV DSSVGGKTGV NHPKGKNLIG AFNQPKAVFI DPETLKSLPK
REFSAGMAEV IKYGVIRDKE LFEYLEIEKN KNELINLKNE YLIKIINSSI KTKSNVVSQD
EHENGVRAIL NYGHSFGHVI ENLCGYGKFL HGEAISIGMN IAGKIAIEKG LWSKEELERQ
RILLESYDLP TEIPKINKED VLTILMGDKK VRDGKMRFIL PKEIGAVDIY DDVEDSLFLK
FFS