Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_02041 |
Symbol | |
ID | 5731799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 193203 |
End bp | 194333 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641284548 |
Product | aminotransferase class-I |
Protein accession | YP_001550089 |
Protein GI | 159902745 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCAT TCAAGACATC CCAATTTGTA AAACTCCCTC AAGCTCGACA AGAGGTAGAA AAAATGACTC CTTACTCTGC TCCATTAGAA GGTCGTCGAG AGCTTCTAAG GCTGGACTTT AATGAAAATA CTATTGGTCC TAGCCCTAAA GTTATACAGG CCATTCAGAA TATCCCTGCA GATCAAATCT CTATTTATCC GGAATACAAT GGGTTAAAAG AGGCAATTGC TAATCATTTA AATTCTTCCG AGATTGCAAA TCCAATTAAA TCTAACCAAA TAGGATTATT CAATGGAGTA GATGCTGCGC TCCATGCGAT ATTCCATGCA TATGGAAATC GAAAAGATGC CTTTTTAACA ACCACCCCAA CCTTTGGTTA CTACCACCCT TGTGCCTGCA TGCAAGGAAT GGAGATCATT GAAATACCTT ATGAGCAAAA TAGCTTTGAA TTCCCATTCA ATAGAATTTA TAAAGCATTA ATTGAAAAAA ACCCAAAACT ACTAATTATT TGTAACCCCA ATAATCCTAC CGGTACAAAT CTTTCAGCAG AGAGAATCAT TCAATTAGCA AAGGCCTCTC CTGAGACGTT AATAGTTATT GACGAACTTT ATGAGGCTTT CTTGGGAGAT AGTGTAATTC CCATCGTTAA CTATGAAAAA ACACCAAATA TTGTTGTCCT GAGATCACTT TCTAAAACAT ATGGATTGGC AGGATTGCGA ATCGGCTTTG CTATTGGGCA TATGGCGGTA GTTAATCGAA TTCAACGAGT AACTGGTCCG TATGATATCA ATAGCTTTGC CGTAACAGCC GCCTTCGCAG CACTTAAGGA CCAAGCTTAC ATAGATGAAT ATATAAGAGA AGTTTTGAGA GCACGAGAGT GGATTAAAAC AAAGTTAAAG GAACATGACG TCAGGCACGT TATACAAAGT GGTAATTATT TCCTACTCTG GCCAAAATCT AATGTCTCAA TAGTAGAGCA ATCTCTTAAG AAGCACGGAG TCTTAGTTAG AAATATGAAC AATAAGCCAC TACTAGAAGG TGCTTTAAGA GTTAGTATTG GTGTATCTAC ACAGATGGAA CAGTTCTGGG AAGCGTTTAA GAAAAGTGAT GAGGTTAAGG CATTAGCTTA A
|
Protein sequence | MESFKTSQFV KLPQARQEVE KMTPYSAPLE GRRELLRLDF NENTIGPSPK VIQAIQNIPA DQISIYPEYN GLKEAIANHL NSSEIANPIK SNQIGLFNGV DAALHAIFHA YGNRKDAFLT TTPTFGYYHP CACMQGMEII EIPYEQNSFE FPFNRIYKAL IEKNPKLLII CNPNNPTGTN LSAERIIQLA KASPETLIVI DELYEAFLGD SVIPIVNYEK TPNIVVLRSL SKTYGLAGLR IGFAIGHMAV VNRIQRVTGP YDINSFAVTA AFAALKDQAY IDEYIREVLR AREWIKTKLK EHDVRHVIQS GNYFLLWPKS NVSIVEQSLK KHGVLVRNMN NKPLLEGALR VSIGVSTQME QFWEAFKKSD EVKALA
|
| |