Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_0217 |
Symbol | |
ID | 7103551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 211308 |
End bp | 212714 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643473330 |
Product | isopropylmalate isomerase large subunit |
Protein accession | YP_002370476 |
Protein GI | 218245105 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR00170] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAG GAACGCTTTT CGATAAGGTT TGGGATGCCC ACACAGTGCA AATTTTACCT TCAGGACAAA CCCAACTCTT TATCGGACTC CATCTAATTC ACGAAGTCAC CAGTCCCCAA GCATTTGCCA TGTTGCGGGA AAGAGGACTC AAAGTATTGT ATCCCGATCG CACCGTCGCC ACGGTCGATC ACATCGTCCC CACAGAAAAC CAGGCCAGAC CCTTTGCGGA CTATCTAGCC GAAGAAATGA TGCAAGCGTT GGAGAAAAAC GCCCAAGATA ACGGGATTCG CTTCTGTCAT ATTGGATCAG GCGATCAAGG GATCGTCCAT GTGATCGCCC CCGAACAAGG GTTAACCCAA CCCGGAATGA CCATCGCCTG TGGAGACTCC CATACCTCCA CCCATGGGGC TTTTGGCGCG ATCGCCTTTG GTATCGGAAC CTCCCAAGTC AGGGATGTCT TAGCTTCCCA AACCCTGGCT TTAGCTAAGC TAAAAGTCCG TAAAATTGAA GTTAATGGAA CCCTAGCCCC TGGAGTCTAT GCTAAGGACG TGATTCTCCA TATTATCCGT AAATTAGGCG TGAAAGGCGG TGTGGGATAC GCCTACGAAT ACGCCGGAAC CACCTTTGAG GCCATGTCCA TGGAAGAGAG GATGACGGTG TGTAATATGG CCATTGAAGG GGGGGCTCGA TGTGGCTATA TTAACCCCGA CCAAGTGACC TTTGACTACC TCAAGGGGCG AGATTTTGCC CCCACAGGAG AGAATTGGGA TAAGGCAGTG GAATGGTGGC AAAGTATTCG GTCTGATGCC GATGCCGAGT ACGATGATGT CATAGTATTC GACGCTAAAG ACATCGAACC GACCGTAACA TGGGGCATTA CCCCCGGTCA AGGTATTGGA GTCAGTGAGG TCATTCCTAT CCCTGACAGT TTACCCGAAA GCGATCGCGC CATTGCTAAA GAAGCCTATG AGTATATGCA GCTTTCCCCT GGAGCCCCCA TTAAGGGGAC TAAGGTAGAT GTCTGTTTTA TTGGCAGTTG TACTAATGGA CGTATTAGTG ACCTTCGCGA AGCGGCTAAA TTTGCCCAAG GACACCACGT TTCCCCAGGG GTGAAAGCCT TTGTGGTTCC GGGGTCAGAA CGGGTTAAGG TTCAAGCAGA AGCCGAAGGA CTCGATAAAA TCTTTGTTGA GGCTGGGTTT GAATGGCGCG AGGCCGGTTG TTCCATGTGT TTAGCCATGA ACCCGGATAA GCTACAAGGG GATCAAATTA GTGCCTCGTC TTCTAACCGC AATTTTAAGG GTCGTCAGGG GTCTTCAACC GGTCGGACTT TGTTAATGAG TCCGGCGATG GTGGTAGCCG CCGCGATCAA TGGCCAAGTC TCGGATGTAC GCGAATTAGT CTCCTAA
|
Protein sequence | MSKGTLFDKV WDAHTVQILP SGQTQLFIGL HLIHEVTSPQ AFAMLRERGL KVLYPDRTVA TVDHIVPTEN QARPFADYLA EEMMQALEKN AQDNGIRFCH IGSGDQGIVH VIAPEQGLTQ PGMTIACGDS HTSTHGAFGA IAFGIGTSQV RDVLASQTLA LAKLKVRKIE VNGTLAPGVY AKDVILHIIR KLGVKGGVGY AYEYAGTTFE AMSMEERMTV CNMAIEGGAR CGYINPDQVT FDYLKGRDFA PTGENWDKAV EWWQSIRSDA DAEYDDVIVF DAKDIEPTVT WGITPGQGIG VSEVIPIPDS LPESDRAIAK EAYEYMQLSP GAPIKGTKVD VCFIGSCTNG RISDLREAAK FAQGHHVSPG VKAFVVPGSE RVKVQAEAEG LDKIFVEAGF EWREAGCSMC LAMNPDKLQG DQISASSSNR NFKGRQGSST GRTLLMSPAM VVAAAINGQV SDVRELVS
|
| |