Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_08511 |
Symbol | ilvA |
ID | 5731462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 744473 |
End bp | 746017 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641285215 |
Product | threonine dehydratase |
Protein accession | YP_001550736 |
Protein GI | 159903392 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01124] threonine ammonia-lyase, biosynthetic, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.306662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00240564 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGGACT ATTTGCAAAA AATCCTTAGA GCTCGTGTCT ATGACATCGC AACAAAGACA CCTTTAGAGA AGGCTAAGAA TTTAAGTAAT CGGTTGAAAA ATAAGGTTTG GTTGAAAAGA GAGGATCTAC AACCAGTGTT TTCCTTCAAG TTGAGGGGGG CATATAACAG AATGGCATCT TTAACAGACG AGGAATTAAA AATTGGTGTT GTTGCATCGA GTGCTGGTAA TCATGCGCAG GGAGTTGCTC TAAGTGCGTC CTATCTAAAA TGTAGGGCCG TAATAGTAAT GCCTATAACA ACACCAGCAA TGAAAATCAC CTCTGTTCGA AGATTAAAAG CAGAGGTAAT TCTTTATGGT GAAACATATG ATGAGGCTTA CAAAGAAGCT CAGAGAATAA GTGAAAAAAA TGGGTTAGTG TTTATACATC CATTTGATGA TCCTGAAGTT ATTGCTGGCC AAGGAACTAT TGGTCAAGAA ATACTCTCGC AAATAGAAAA TCCGCCTGAT GCAATTTATA TAGCAGTTGG TGGGGGTGGA CTAATTGCAG GCGTAGCCTC ATTCGTAAAA AGTCTATGGC CAAGCACAGA GATAATTGGG GTAGAACCAA CTGATGCGTC TGCAATGACA GATTCATTAA AGGCAGGCAA AAGAATTGAA TTGAAAGATG TCGGCCTATT TGCTGATGGA GTAGCGGTTA AAAAAGTAGG TGAGAAAACT TTTGAATTAG CCAAGAGATA TGTCGACAGA ATGATTACTG TTAATACCGA CGAAATATGT GCAGCAATTA AAGATGTATT TGAAGACACA CGGTCAATAC TGGAACCAGC AGGTGCTTTG TCTATAGCGG GTCTAAAGTC TGACGTTTCT AACAGAAACT TAGTTAATAA AAATTTAGTC GCAATTGCAT GTGGTGCAAA TATGAATTTT GACAGACTAA GGTTTGTTGC TGAAAGGGCT GAGTTAGGAG AGAAAAGAGA GGCTATGTTT GCAGTAGAGA TACCAGAGAA TGCAGGGAGC CTTAAGAATT TATGCAAAAT ACTAGGCAAC AGAAGTCTTA CAGAATTTAG TTATCGAATG TCTGAAGGAA ATTCAGCCCA GATCTTTATG GGTTTAGAAG TAGAAGGCAA TAAAGACAAA GGTGACCTTA TAAAAAGTAT TAATTCGAAG AGTTTTAAAT GTCTTGACCT AAGTGATGAC GAGCTCTCAA AAGTTCATTT ACGGCATATG GTCGGCGGAA GATTACCAAT ATCAGCCAAT GCCCTCTCGA ATAAAAACTA TAAGGAACTT CTTTATAGGT TCGAATTCCC TGAAAGGCCT GGAGCCTTAA TGCGTTTTGT AAACACTATG AGACCTAATT GGAATATCAG TATTTTCCAC TATCGCAATC ATGGTGCTGA TATTGGGAGA ATAGTTATCG GAGTACTTGT AGAAGAAAAG GATCTTGAAG CTTGGGAGAG ATTCCTCAAA GAAATAGGGT ACAAAAGCTG GGAAGAAACC AAAAATCCGG CTTATCAGCT TTTCTTAGGT GCACAGAATG GCTAA
|
Protein sequence | MEDYLQKILR ARVYDIATKT PLEKAKNLSN RLKNKVWLKR EDLQPVFSFK LRGAYNRMAS LTDEELKIGV VASSAGNHAQ GVALSASYLK CRAVIVMPIT TPAMKITSVR RLKAEVILYG ETYDEAYKEA QRISEKNGLV FIHPFDDPEV IAGQGTIGQE ILSQIENPPD AIYIAVGGGG LIAGVASFVK SLWPSTEIIG VEPTDASAMT DSLKAGKRIE LKDVGLFADG VAVKKVGEKT FELAKRYVDR MITVNTDEIC AAIKDVFEDT RSILEPAGAL SIAGLKSDVS NRNLVNKNLV AIACGANMNF DRLRFVAERA ELGEKREAMF AVEIPENAGS LKNLCKILGN RSLTEFSYRM SEGNSAQIFM GLEVEGNKDK GDLIKSINSK SFKCLDLSDD ELSKVHLRHM VGGRLPISAN ALSNKNYKEL LYRFEFPERP GALMRFVNTM RPNWNISIFH YRNHGADIGR IVIGVLVEEK DLEAWERFLK EIGYKSWEET KNPAYQLFLG AQNG
|
| |