Gene PCC7424_5301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_5301 
Symbol 
ID7111189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp5878424 
End bp5879527 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content40% 
IMG OID643483508 
Product3-dehydroquinate synthase 
Protein accessionYP_002380517 
Protein GI218442188 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTCCC GTATTCCCGT TAACTTACCC CAAAACTCTT ACGAAATTGC CATCGCTTCT 
GGCAGTTTAT CCCGACTCGG TGCTGAAATG AGTACCTTGA ACCTAGGCAA AAAAGTTTTA
CTGGTTTCAA ACCCAGAAAT TTATGACTAT TATGGAGAAA TTGCGGTTAA TTCTCTCAAA
ACAGCCGGAT TTGAAGTAGC GACTCATTTG ATTCCCGAAG GAGAAATCTA TAAAACCTTA
GATTCTATCA CCCAAATTTA TGATACTGCG GTTGAAAATT TTCTAGAACG TTCCTCAACG
ATGATTGCTT TAGGAGGGGG AGTAGTTGGG GATATGACAG GGTTTGCCGC CGCCACATGG
TTGAGAGGGA TTTCTTTTGT GCAAGTTCCC ACCTCTCTTT TAGCCATGAT TGATGCTGCT
ATAGGGGGAA AAACAGGAGT AAATCATCCA AAAGGAAAAA ACTTAATTGG CGCTTTTCAT
CAACCTCGTT TAGTGTTAAT TGATCCTCAA GTTTTGAAAA CATTACCTCC TAGAGAATTT
CGCGCCGGGA TGGCAGAAGT GATTAAATAT GGAGTCATTT GGGATGCGAA TTTATTTAGC
CAATTAGAAG CGGCTAAAAA ATTAGATGAA TTTTCTGATC TCGATGAGGA TTTATTACAA
ACTATTATTA CTCGCTCTTG TCAAGCGAAA GCGGATGTCG TCAGTAAAGA TGAAAAAGAG
GCAGGATTAA GAGCTATTTT AAACTATGGT CATACGATTG GTCATGGGAT AGAAAGTTTA
ACCGGCTACA GTCAAATTAT TCACGGAGAA GGGGTAGCGA TTGGAATGGT TGCAGCCGGC
ACAATAGCGG TTAAATTGCA ATTGTGGAGT GAACAAGAGG CTAACCGTCA GGATGCTTTA
ATTAAAAAAG CAGGTTTACC CACAGAAGTT CCGCCTAATG TGAATATAGA GGCAATTATT
GAAGCGTTGC AAACTGATAA AAAGGTAAAA GCGGGTAAAG TTCGCTTTAT TTTACCGCAA
CAGATAGGAA CTGTAACGAT TACCGATCAG GTTACGTCTG ATGTTATTCG GGAAGTGTTA
GCTCAAATTC AAAGCAAAGC TTAA
 
Protein sequence
MYSRIPVNLP QNSYEIAIAS GSLSRLGAEM STLNLGKKVL LVSNPEIYDY YGEIAVNSLK 
TAGFEVATHL IPEGEIYKTL DSITQIYDTA VENFLERSST MIALGGGVVG DMTGFAAATW
LRGISFVQVP TSLLAMIDAA IGGKTGVNHP KGKNLIGAFH QPRLVLIDPQ VLKTLPPREF
RAGMAEVIKY GVIWDANLFS QLEAAKKLDE FSDLDEDLLQ TIITRSCQAK ADVVSKDEKE
AGLRAILNYG HTIGHGIESL TGYSQIIHGE GVAIGMVAAG TIAVKLQLWS EQEANRQDAL
IKKAGLPTEV PPNVNIEAII EALQTDKKVK AGKVRFILPQ QIGTVTITDQ VTSDVIREVL
AQIQSKA