Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0626 |
Symbol | |
ID | 3775609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 618362 |
End bp | 620221 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637799038 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_399645 |
Protein GI | 81299437 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0546913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.890987 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCAGT ACCGATCGCG CACAACGACT TACGGCCGTA ACATGGCTGG CGCCCGAGCA CTTTGGCGTG CCACGGGTAT GAAGGACGAG GACTTTGAAA AGCCGATCAT TGCGGTCGCG AACTCCTTCA CCCAGTTTGT GCCGGGGCAC GTCCACCTCA AGGACTTGGG TCAACTGGTG GCGCGGGAGA TTGAGCGAGC CGGTGGTGTC GCCAAGGAAT TCAACACAAT CGCGGTCGAT GACGGCATCG CCATGGGCCA CGGCGGTATG CTCTACTCTT TACCATCGCG GGACTTGATC GCCGACTCGG TTGAGTACAT GGTCAACGCC CATTGTGCTG ATGCGCTGGT CTGCATTTCC AACTGCGACA AGATCACGCC GGGGATGCTG ATGGCGGCGC TGCGGCTCAA TATTCCCGCC GTGTTTGTCT CCGGTGGCCC GATGGAAGCG GGCAAAGTCA TCCTCAATGG TGAAGAGCGC CACCTCGACT TGGTCGATGC CATGGTTGTC GCCGCCGATG ATCGCGAGTC TGATGAAGAT GTGGCCACGA TTGAGCGATC GGCCTGCCCC ACCTGTGGCT CTTGCTCGGG CATGTTCACG GCTAACTCGA TGAACTGTCT GACGGAAGCG CTGGGCTTGA GTCTGCCGGG CAATGGTTCG TTGCTGGCAA CCCACGGCGA TCGCAAAGAG CTGTTCCTCG AAGCGGGTCG TTTGGCAGTC AAACTAGCGA AACAGTACTA CGAGCAGGAT GACGAGTCGG TCTTACCCCG CAGCATCGCC AGCTTCAAGG CCTTTGAAAA CGCGATCTGT CTCGACATTG CGATGGGCGG CTCGACCAAC ACGGTCTTGC ATCTACTAGC GGCAGCTCAC GAGGCGGGTG TGGACTTCAC GATGAAGGAC ATCGATCGCC TCTCGCGCAA AATCCCTAAC CTCTGCAAGG TCGCGCCCTC GACGCAGAAG TACCACATGG AGGACGTGCA TCGAGCGGGC GGTGTGATCG CCATCCTCGG GGAGCTCGAT CGCGCTGGGC TGTTGCATCG CGAAGTCCCA ACCGTCCATA GCCCCAGTCT GGGAGCAGCT CTCGATCAGT GGGATATCAA CCGAGAAACG GCGACGGAGG AAGCGAAGTC ACGCTATCTG GCGGCTCCGG GCGGGGTACC GACCCAGGAA GCTTTTAGCC AGTCGAAACG CTGGACCGCC TTGGATCTCG ATCGCGAGAA TGGCTGTATC CGCGACATCG AACACGCCTA CTCGCAAGAT GGGGGTCTGG CGGTGCTTTA CGGCAACTTG GCCGAGCAGG GCTGCATTGT CAAAACAGCC GGCGTCGATG AAAACATCCT TGTCTTCTCG GGGCCAGCGG TGGTTTGCGA AAGCCAAGAT GAAGCCGTCA ACTGGATTCT GAACGGGCGC GTCAAGGAAG GCGATGTTGT TCTGATTCGC TACGAAGGTC CGCGCGGTGG CCCCGGTATG CAGGAAATGC TTTACCCCAC CAGCTACCTC AAGTCGAAGG GCTTGGGTAA GGCCTGTGCA CTGATTACTG ATGGACGTTT CTCGGGCGGT ACGTCCGGCC TCTCGATCGG TCATGTTTCG CCGGAGGCGG CGGAAGGCGG TCTCATTGCG CTGGTAGAAC AGGGCGATCG CATCGAAATC GACATCCCGA ATCGCCGCAT TCATCTAGCG GTCTCGGAGG AAGAACTGGC GCACCGCCGT GCTGCTATGG AAGCGCGGGG TGACCAAGCT TGGACTCCGA AAGATCGTGA TCGCCCGATT TCCCAAGCGC TACAAGCCTA CGCAGCCATG ACGACCTCGG CGGCCCGGGG CGGCGTCCGC GATCTCAGCC AAATTCTCGG ATCTCGCTAG
|
Protein sequence | MPQYRSRTTT YGRNMAGARA LWRATGMKDE DFEKPIIAVA NSFTQFVPGH VHLKDLGQLV AREIERAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRDLI ADSVEYMVNA HCADALVCIS NCDKITPGML MAALRLNIPA VFVSGGPMEA GKVILNGEER HLDLVDAMVV AADDRESDED VATIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS LLATHGDRKE LFLEAGRLAV KLAKQYYEQD DESVLPRSIA SFKAFENAIC LDIAMGGSTN TVLHLLAAAH EAGVDFTMKD IDRLSRKIPN LCKVAPSTQK YHMEDVHRAG GVIAILGELD RAGLLHREVP TVHSPSLGAA LDQWDINRET ATEEAKSRYL AAPGGVPTQE AFSQSKRWTA LDLDRENGCI RDIEHAYSQD GGLAVLYGNL AEQGCIVKTA GVDENILVFS GPAVVCESQD EAVNWILNGR VKEGDVVLIR YEGPRGGPGM QEMLYPTSYL KSKGLGKACA LITDGRFSGG TSGLSIGHVS PEAAEGGLIA LVEQGDRIEI DIPNRRIHLA VSEEELAHRR AAMEARGDQA WTPKDRDRPI SQALQAYAAM TTSAARGGVR DLSQILGSR
|
| |