Gene PCC8801_4183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4183 
Symbol 
ID7104581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4387578 
End bp4388678 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content39% 
IMG OID643477170 
Product3-dehydroquinate synthase 
Protein accessionYP_002374269 
Protein GI218248898 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATTA TTATTCCTGT TAAGTTACCC CATACTTCTT ACAACATTGC GATCGCCCCT 
GGTAGTCTTT CTCAGTTAGG AAGCCATCTA GAACCCCTCA AATTAGGTCA AAAAATCCTG
ATCATTTCTA ACCCTGAAAT TTTTAACTAT TATGGTGATG TAGTTGTCAA TTCCCTCAAG
AAATCAGGTT TTGAAGTATT TACCCATCTT ATTCCGGCCG GAGAAGCTTA CAAAACTCTA
GACTCCATCG CCCAAGTCTA TGATACCGCC TTAGAACATC GGTTAGAAAG GTCATCAACA
ATGATAGCCC TAGGGGGAGG GGTCATTGGG GATATGACGG GGTTTGCTGC GGCAACTTGG
CTAAGGGGAA TCAATTTTGT TCAGGTGCCC ACCTCTCTAT TAGCCATGGT AGATGCTTCT
ATTGGGGGTA AAACAGGGGT CAACCATCCC CAAGGAAAAA ACCTCATTGG AGCCTTTTAT
CAACCCCGTT TAGTGTTTAT TGATCCTTCG GTGTTAAAGA CGTTGCCTGT GCGGGAATTT
CGGGCAGGAA TGGCGGAAGT CATTAAATAT GGCATTATTT GGGATAAAGC GTTATTTGAG
CAATTAGAAC AAGCCAAAAC ACTCGATCAT CTTAATAGTT TAAATGATGA ATTATTGCAA
ACCATTATTA CCCGTTCTTG TCAAGCGAAG GTCGATGTTG TTAGCCAAGA TGAAAAAGAA
AGTGGTTTAA GAGCTATTTT GAATTATGGT CATACTATTG GTCATGCAAT AGAAAGTTTA
ACCGGATATG AAACCATTAA TCATGGTGAA GCGGTAGCAA TGGGGATGGT AGCTGCGGGA
AAAATCGCCA TTAAATTATC ATTATGGACA CAAGAAGAAA CCATTCGACA AGACCAGTTA
ATTGACAAAG TTGGATTAAT TTCTACCATT CCTAAGACGC TAGATATTGA TCAAGTGATT
GAGAGTTTAC AGAGCGATAA AAAAGTCAAA AGCGGAAAAG TTCGGTTTAT TCTCCCAACG
AGCATTGGTA AGGTTATTAT TAGCGATCAA GTTTCTTCGG AAATTATTAA ATCAGTTATG
ATTCATCAGG TTAATAAGTA A
 
Protein sequence
MSIIIPVKLP HTSYNIAIAP GSLSQLGSHL EPLKLGQKIL IISNPEIFNY YGDVVVNSLK 
KSGFEVFTHL IPAGEAYKTL DSIAQVYDTA LEHRLERSST MIALGGGVIG DMTGFAAATW
LRGINFVQVP TSLLAMVDAS IGGKTGVNHP QGKNLIGAFY QPRLVFIDPS VLKTLPVREF
RAGMAEVIKY GIIWDKALFE QLEQAKTLDH LNSLNDELLQ TIITRSCQAK VDVVSQDEKE
SGLRAILNYG HTIGHAIESL TGYETINHGE AVAMGMVAAG KIAIKLSLWT QEETIRQDQL
IDKVGLISTI PKTLDIDQVI ESLQSDKKVK SGKVRFILPT SIGKVIISDQ VSSEIIKSVM
IHQVNK