Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03341 |
Symbol | leuC |
ID | 4779767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 307783 |
End bp | 309192 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640083600 |
Product | isopropylmalate isomerase large subunit |
Protein accession | YP_001014163 |
Protein GI | 124025047 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR00170] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.6876 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGTTCTA GAACCCTCTA CGACAAAGTT TGGAACTTCC ATCAGGTAAA AGAATTACCT GGAGGATCAA CTCAACTTTT TATTGGTCTT CATTTAATTC ACGAAGTTAC AAGTCCTCAG GCTTTCTCTG CACTTAATGA AAAAAAACTT GGAGTAAAGT TTCCCAATCT TACTGTCGCA ACAGTTGACC ATATAGTTCC GACTTCAAAC CAGCAACGTC CTTTTAGTGA TCCTCTTGCC GAGGAAATGT TGTCTACTTT AGAAAAAAAC TGCAAAACTC ATGGGATAAA ATTTCACGGA ATTGGAAGTA ATTCGCAAGG AGTAGTACAT GTTATGGCTC CAGAATTAGG ATTAACCCAG CCTGGAATGA CAGTAGCCTG CGGAGATTCG CATACCTCAA CCCATGGAGC ATTTGGAGCA ATTGCCTTTG GAATTGGAAC TAGTCAAGTG CGAGATGTTT TAGCCAGTCA AAGCTTGGCA ATGAATAAAT TAAAAGTAAG AAGAATATGG GTAGAGGGTG AATTGCAAAA GGGAGTCTAT GCAAAAGACC TAATTCTTCA TATCATTCGT CATCTTGGAG TTAAAGGAGG TGTTGGATTT GCATATGAAT TTGCTGGGCC TGCAATAGAA AAACTCTCAA TGGAGGGACG AATGACCATA TGCAATATGG CTATTGAAGG TGGTGCAAGA TGCGGTTATA TCAATCCAGA TGAAACCACT TTCAAATATT TAAAAGGGAA AGAACATGCG CCCAAAGGTC AAGAATGGGA TAAGGCGATT TCTTGGTGGA AAAGTTTAGC TAGTGATTCA AAAGCAACAT TCGATGATGA GATTCAATTA GATGGATCAT CAATCGAACC CACTGTTACT TGGGGTATAA CTCCTGGGCA AGGAATTTCA ATCAAAGAAA CAATTCCAAA CCCTGAGTTT CTTCCCAAAA ATGAGCAACA AATCGCTAAA GACGCATGCA AATACATGAA TCTAAAACCA GACGAACCCA TAGAGGGACA ATCAATTGAT GTTTGCTTTA TAGGGAGCTG TACTAATGGA AGATTAAGTG ATCTCCAAGA GGCATCTAAA ATTGTTAAGG GTAATACAGT AGCTGATGGG ATTAGAGCAT TTGTCGTTCC TGGTTCTCAA AAGGTCGCTA AGGAGGCAAA AGAAAAAGGA TTAGATAAAA TATTTCTTAA AGCAGGTTTT GAGTGGCGAG AACCAGGTTG TTCAATGTGT CTAGCCATGA ACCCAGACAA ACTAGAAGGC AGACAAATAA GCGCTAGTTC GAGTAATAGA AATTTCAAAG GAAGACAAGG CTCCGCAAAA GGGAGAACCT TGCTAATGAG CCCCGCTATG GTTGCTGCTG CTGCAATAAA TGGGAGGGTT ACAGATGTAA GAAAGTTCCT GCAAGAGTAA
|
Protein sequence | MSSRTLYDKV WNFHQVKELP GGSTQLFIGL HLIHEVTSPQ AFSALNEKKL GVKFPNLTVA TVDHIVPTSN QQRPFSDPLA EEMLSTLEKN CKTHGIKFHG IGSNSQGVVH VMAPELGLTQ PGMTVACGDS HTSTHGAFGA IAFGIGTSQV RDVLASQSLA MNKLKVRRIW VEGELQKGVY AKDLILHIIR HLGVKGGVGF AYEFAGPAIE KLSMEGRMTI CNMAIEGGAR CGYINPDETT FKYLKGKEHA PKGQEWDKAI SWWKSLASDS KATFDDEIQL DGSSIEPTVT WGITPGQGIS IKETIPNPEF LPKNEQQIAK DACKYMNLKP DEPIEGQSID VCFIGSCTNG RLSDLQEASK IVKGNTVADG IRAFVVPGSQ KVAKEAKEKG LDKIFLKAGF EWREPGCSMC LAMNPDKLEG RQISASSSNR NFKGRQGSAK GRTLLMSPAM VAAAAINGRV TDVRKFLQE
|
| |