Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_02821 |
Symbol | leuC |
ID | 5730642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 266061 |
End bp | 267476 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641284627 |
Product | isopropylmalate isomerase large subunit |
Protein accession | YP_001550167 |
Protein GI | 159902823 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR00170] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCTCAA AAACGCTCTA CGACAAAGTC TGGGATCTTC ATCGGATAGC AGATCTCCCT GGAGGAGCTA CCCAGTTATT TGTTGGGCTT CATTTGATTC ATGAAGTAAC CAGTCCTCAA GCATTTGCAG CTTTAGAAGA AAAAGGACTC TCCGTAAATT GTCCTGATAG AACAATTGCA ACCGTCGACC ACATAGTACC CACTACAAAT CACCAAAGAC CTTTCTCAGA TCCTCTAGCT GAGGAAATGC TTCATACACT TGAAAAAAAT TGCTCAAACC ATCACATCAA ACTATTTGGA ATAGGTTCAG GGAATCAAGG CATCGTCCAT GTGATGGCAC CAGAATCAGG TCTAACACAA CCTGGCATGA CGATTGCTTG TGGAGACTCT CATACCTCTA CCCATGGAGC TTTTGGTGCA ATCGCATTTG GGATTGGCAC CAGCCAAGTT AGAGATGTAC TCGCTACCCA AAGCCTGGCA ATGAAAAAGT TAAAGGTTCG TAGAATTTGG GTAGATGGTC AACTCACCAA TGGTGTTTTT GCAAAAGATC TTATCCTTCA TGTAATTCGC CATCTTGGCG TCAAAGGAGG CGTTGGATAT GCTTATGAGT TTGCTGGGCC CGCAATAAAA AAACTCTCAA TGGAAGAGAG AATGACTATA TGCAATATGG CTATAGAAGG TGGAGCAAGA TGCGGATATG TCAATCCTGA TCAAACAACC TTTGATTACT TAGAAGGGAA GCCCTATATA CCAACTGGCC AGGAATGGGA GTCTGCTCTC CAATGGTGGA AGGAATTAGC CTCTGATCAG AATGCGATTT TTGACGACGA AGTGAAATTT GATGCATGCA AAATCTCCCC AACAGTCACT TGGGGTATTA CACCCGGCCA AGCGATTGGG ATTGACGAAT TAATACCAAA GGTCGACTCA CTAGAGACAA GTGACCAACA AACAGCAAGA GAAGCTTATC TTTATATGAA TCTGCATCCA GGAAGCTCTA TTGAAGGTCT TGGAATAGAC GTTTGTTTTA TAGGAAGTTG TACTAATGGC CGCCTAAGTG ATCTTCAAGC AGCAGCAAAG ATTGTTAAAA ATAGACATGT GGCTAAGGGT ATTAAAGCCT TTGTAGTTCC TGGCTCTGAG AAAGTAGCTA AAGCTGCTGA AGCAGAAGGT TTAGATGTTT TATTTCAAAA TGCAGGGTTT GAATGGAGAA AGCCAGGTTG CTCTATGTGT CTTGCAATGA ACCCAGATCG ATTAGAAGGC AATCAAATAA GCGCTAGTTC TAGCAATAGG AACTTTAAAG GAAGACAAGG ATCAGCCAGA GGAAGAACAT TGTTAATGAG TCCAGCAATG GTTGCCGCTG CTGCTATTTC TGGATCTGTA ACAGATGTAA GGAACCTAAT TAACCAAGGA CCATAG
|
Protein sequence | MSSKTLYDKV WDLHRIADLP GGATQLFVGL HLIHEVTSPQ AFAALEEKGL SVNCPDRTIA TVDHIVPTTN HQRPFSDPLA EEMLHTLEKN CSNHHIKLFG IGSGNQGIVH VMAPESGLTQ PGMTIACGDS HTSTHGAFGA IAFGIGTSQV RDVLATQSLA MKKLKVRRIW VDGQLTNGVF AKDLILHVIR HLGVKGGVGY AYEFAGPAIK KLSMEERMTI CNMAIEGGAR CGYVNPDQTT FDYLEGKPYI PTGQEWESAL QWWKELASDQ NAIFDDEVKF DACKISPTVT WGITPGQAIG IDELIPKVDS LETSDQQTAR EAYLYMNLHP GSSIEGLGID VCFIGSCTNG RLSDLQAAAK IVKNRHVAKG IKAFVVPGSE KVAKAAEAEG LDVLFQNAGF EWRKPGCSMC LAMNPDRLEG NQISASSSNR NFKGRQGSAR GRTLLMSPAM VAAAAISGSV TDVRNLINQG P
|
| |