Gene Cyan8802_4222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4222 
Symbol 
ID8393573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4359331 
End bp4360431 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content39% 
IMG OID644982134 
Product3-dehydroquinate synthase 
Protein accessionYP_003139846 
Protein GI257061958 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.950152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATTA TTATTCCTGT TAAGTTACCC CATACTTCTT ACAACATTGC GATCGCCCCT 
GGTAGTCTTT CTCAGTTAGG AAGCCATCTA GAACCCCTCA AATTAGGTCA AAAAATCCTG
ATCATTTCTA ACCCTGAAAT TTTTAACTAT TATGGTGATG TAGTTGTCAA TTCCCTCAAG
AAATCAGGTT TTGAAGTATT TACCCATCTT ATTCCGGCCG GAGAAGCTTA CAAAACTCTA
GACTCCATCG CCCAAGTCTA TGATACCGCC TTAGAACATC GGTTAGAAAG GTCATCAACA
ATGATAGCCC TAGGGGGAGG GGTCATTGGG GATATGACGG GGTTTGCTGC GGCAACTTGG
CTAAGGGGAA TCAATTTTGT TCAGGTGCCC ACCTCTCTAT TAGCCATGGT AGATGCTTCT
ATTGGGGGTA AAACAGGGGT CAACCATCCC CAAGGAAAAA ACCTCATTGG AGCCTTTTAT
CAACCCCGTT TAGTGTTTAT TGATCCTTCG GTGTTAAAGA CGTTGCCTGT GCGGGAATTT
CGGGCAGGAA TGGCGGAAGT CATTAAATAT GGCATTATTT GGGATAAAGC GTTATTTGAG
CAATTAGAAC AAGCCAAAAC ACTCGATCAT CTTAATAGTT TAAATGATGA ATTATTGCAA
ACCATTATTA CCCGTTCTTG TCAAGCGAAG GTCGATGTTG TTAGCCAAGA TGAAAAAGAA
AGTGGTTTAA GAGCTATTTT GAATTATGGT CATACTATTG GTCATGCAAT AGAAAGTTTA
ACCGGATATG AAACCATTAA TCATGGTGAA GCGGTAGCAA TGGGGATGGT AGCTGCGGGA
AAAATCGCCA TTAAATTATC ATTATGGACA CAAGAAGAAA CCATTCGACA AGACCAGTTA
ATTGACAAAG TTGGATTAAT TTCTACCATT CCTAAGACGC TAGATATTGA TCAAGTGATT
GAGAGTTTAC AGAGCGATAA AAAAGTCAAA AGCGGAAAAG TTCGGTTTAT TCTCCCAACG
AGCATTGGTA AGGTTATTAT TAGCGATCAA GTTTCTTCGG AAATTATTAA ATCAGTTATG
ATTCATCAGG TTAATAAGTA A
 
Protein sequence
MSIIIPVKLP HTSYNIAIAP GSLSQLGSHL EPLKLGQKIL IISNPEIFNY YGDVVVNSLK 
KSGFEVFTHL IPAGEAYKTL DSIAQVYDTA LEHRLERSST MIALGGGVIG DMTGFAAATW
LRGINFVQVP TSLLAMVDAS IGGKTGVNHP QGKNLIGAFY QPRLVFIDPS VLKTLPVREF
RAGMAEVIKY GIIWDKALFE QLEQAKTLDH LNSLNDELLQ TIITRSCQAK VDVVSQDEKE
SGLRAILNYG HTIGHAIESL TGYETINHGE AVAMGMVAAG KIAIKLSLWT QEETIRQDQL
IDKVGLISTI PKTLDIDQVI ESLQSDKKVK SGKVRFILPT SIGKVIISDQ VSSEIIKSVM
IHQVNK