Gene Cyan7425_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan7425_1604 
Symbol 
ID7287527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7425 
KingdomBacteria 
Replicon accessionNC_011884 
Strand
Start bp1453187 
End bp1454245 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content57% 
IMG OID643584604 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_002482335 
Protein GI220907024 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATCG TGATAAAAAG CGGAACGCCA GATGTGGAAA TCGACCGGAT TAGTGCGGAG 
ATGACCAGTT TGGGGTTTAC ACCGGAAAAA ATTGTCGGTA AGCATAAGGT GGTCATTGGT
CTGGTCGGGG ATACCGCTGA ACTCGATCCG TTGCAAATTC AAGAAGCCAG CCCCTGGATT
GAACAGGTAT TACGGGTCGA ACAACCCTTC AAGCGGGTGA GCCGGGAATA CCGCCATGGG
GAAGCCAGTG AGGTCATTGT TCCTACCCCA AATGGGCCGG TTCATTTTGG CGAATCCCAT
CCCGTCGTGC TGGTAGCCGG TCCCTGTTCA GTCGAAAACG AAGCCATGAT TGTGGAAACG
GCGAAGCGGG TCAAAGCCGC CGGGGCCCAA TTCCTCCGAG GGGGGGCCTA CAAACCCCGG
ACCTCTCCCT ATGCGTTTCA GGGCCATGGT GAAAGTGCTT TAGAGTTACT GGCAGCGGCG
CGATCGGCGA CGGGACTGGG CATTATTACC GAAGTGATGG ACACTGCCGA TCTGGAAAAG
ATTGCTGAGG TGGCCGATGT CCTGCAGGTG GGAGCCCGCA ATATGCAGAA TTTCTCCCTG
CTGAAGCGAG TGGGGGCGCA GGAAAAACCA GTTTTATTGA AGCGGGGCAT GGCGGCGACG
ATCGAAGAAT GGCTGATGGC GGCTGAGTAT ATTCTGGCAG CAGGTAACCC AAATGTGATT
CTGTGTGAGC GGGGCATCCG CAGCTTCGAT CGCCAGTACA CCCGTAATAC GCTGGATTTA
GCGGCGGTCC CGGTCCTGCG CAGCCTTACC CATTTGCCGA TCATGGTAGA CCCCAGTCAT
GGGACGGGTT GGGCTCCCTA CGTTCCAGCT CTGGCCAAAG CAGCGATCGC CCTGGGGGTC
GATTCCCTCA TGATTGAAGT GCACCCTAAT CCCCCCAAAG CCCTGTCCGA TGGCCCCCAA
TCCCTTACCC CTGACCAGTT CGATCGGCTG GTCCCCGAAT TGGCAGTAAT CGGTGAAGCC
GTGCAGCGCT GGCCGCGTCA GGTTGCCGTC GTCGGTTAA
 
Protein sequence
MIIVIKSGTP DVEIDRISAE MTSLGFTPEK IVGKHKVVIG LVGDTAELDP LQIQEASPWI 
EQVLRVEQPF KRVSREYRHG EASEVIVPTP NGPVHFGESH PVVLVAGPCS VENEAMIVET
AKRVKAAGAQ FLRGGAYKPR TSPYAFQGHG ESALELLAAA RSATGLGIIT EVMDTADLEK
IAEVADVLQV GARNMQNFSL LKRVGAQEKP VLLKRGMAAT IEEWLMAAEY ILAAGNPNVI
LCERGIRSFD RQYTRNTLDL AAVPVLRSLT HLPIMVDPSH GTGWAPYVPA LAKAAIALGV
DSLMIEVHPN PPKALSDGPQ SLTPDQFDRL VPELAVIGEA VQRWPRQVAV VG