Gene P9211_02041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_02041 
Symbol 
ID5731799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp193203 
End bp194333 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content38% 
IMG OID641284548 
Productaminotransferase class-I 
Protein accessionYP_001550089 
Protein GI159902745 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCAT TCAAGACATC CCAATTTGTA AAACTCCCTC AAGCTCGACA AGAGGTAGAA 
AAAATGACTC CTTACTCTGC TCCATTAGAA GGTCGTCGAG AGCTTCTAAG GCTGGACTTT
AATGAAAATA CTATTGGTCC TAGCCCTAAA GTTATACAGG CCATTCAGAA TATCCCTGCA
GATCAAATCT CTATTTATCC GGAATACAAT GGGTTAAAAG AGGCAATTGC TAATCATTTA
AATTCTTCCG AGATTGCAAA TCCAATTAAA TCTAACCAAA TAGGATTATT CAATGGAGTA
GATGCTGCGC TCCATGCGAT ATTCCATGCA TATGGAAATC GAAAAGATGC CTTTTTAACA
ACCACCCCAA CCTTTGGTTA CTACCACCCT TGTGCCTGCA TGCAAGGAAT GGAGATCATT
GAAATACCTT ATGAGCAAAA TAGCTTTGAA TTCCCATTCA ATAGAATTTA TAAAGCATTA
ATTGAAAAAA ACCCAAAACT ACTAATTATT TGTAACCCCA ATAATCCTAC CGGTACAAAT
CTTTCAGCAG AGAGAATCAT TCAATTAGCA AAGGCCTCTC CTGAGACGTT AATAGTTATT
GACGAACTTT ATGAGGCTTT CTTGGGAGAT AGTGTAATTC CCATCGTTAA CTATGAAAAA
ACACCAAATA TTGTTGTCCT GAGATCACTT TCTAAAACAT ATGGATTGGC AGGATTGCGA
ATCGGCTTTG CTATTGGGCA TATGGCGGTA GTTAATCGAA TTCAACGAGT AACTGGTCCG
TATGATATCA ATAGCTTTGC CGTAACAGCC GCCTTCGCAG CACTTAAGGA CCAAGCTTAC
ATAGATGAAT ATATAAGAGA AGTTTTGAGA GCACGAGAGT GGATTAAAAC AAAGTTAAAG
GAACATGACG TCAGGCACGT TATACAAAGT GGTAATTATT TCCTACTCTG GCCAAAATCT
AATGTCTCAA TAGTAGAGCA ATCTCTTAAG AAGCACGGAG TCTTAGTTAG AAATATGAAC
AATAAGCCAC TACTAGAAGG TGCTTTAAGA GTTAGTATTG GTGTATCTAC ACAGATGGAA
CAGTTCTGGG AAGCGTTTAA GAAAAGTGAT GAGGTTAAGG CATTAGCTTA A
 
Protein sequence
MESFKTSQFV KLPQARQEVE KMTPYSAPLE GRRELLRLDF NENTIGPSPK VIQAIQNIPA 
DQISIYPEYN GLKEAIANHL NSSEIANPIK SNQIGLFNGV DAALHAIFHA YGNRKDAFLT
TTPTFGYYHP CACMQGMEII EIPYEQNSFE FPFNRIYKAL IEKNPKLLII CNPNNPTGTN
LSAERIIQLA KASPETLIVI DELYEAFLGD SVIPIVNYEK TPNIVVLRSL SKTYGLAGLR
IGFAIGHMAV VNRIQRVTGP YDINSFAVTA AFAALKDQAY IDEYIREVLR AREWIKTKLK
EHDVRHVIQS GNYFLLWPKS NVSIVEQSLK KHGVLVRNMN NKPLLEGALR VSIGVSTQME
QFWEAFKKSD EVKALA