Gene P9301_02181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_02181 
Symbol 
ID4911904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp202752 
End bp203876 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content31% 
IMG OID640159784 
Productaminotransferases class-I 
Protein accessionYP_001090442 
Protein GI126695556 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAT CCGAAACTAA TGATTACTCA ACATTAGCCA TGCAAATATC AGACTTAAAG 
CATGGAGGAA ATGTATATGC AAATGCAAAA AAATTAAATT TATTACCCTC TGAAATCATT
GACGCAAGTG CCTCGTTAGT ACCCTTTGAT CCACCTCAAA TACTAATAGA TTCATTAAAT
GCGGGAATTA AGAATCTTGG ATTTAGATAT TACCCAGAGA GAAACTTGAG TGATCTGAAA
GAAATAATCG GTAAATTTCA TGGGATAAAT CCAGATAATA TATTGCCTGG AAATGGAGCT
TCTGAGCTAA TAACCTGGGC AGGTTATGAA GCATCCAAAT TTGGAATAAG TTGTATTCCT
TCTCCATCAT TTGTTGATTA TGAAAGATCT TTAAATTGTT GGAATAGCAA TTTTGTACAT
TGCGAATTAC CAAAAAACTG GAATGATATT TTTCCTCAAT CATTCCCGCT TCATCCAAAA
GGTGATGTTA TTTGGATAAC AAATCCACAT AACCCTACCG GTCAATTATG GGAAAAGAAT
TCATTGGAGG AACTTGTAAA AAAATATAAA TTAGTTATCT GTGATGAAGC TTTCTTATCG
ATAACACCTA ATGGAGACAA AGAATCTTTA ATACCATTAA CCCAAAGATT TGATAATTTA
TTAGTCTTGA GAAGCTTGAC TAAAATCTTC AATATTCCTG GTCTTAGATT AGGTTACGTT
ATTGGTTCAT CGAAAAAACT TAAGCAATGG GAAATAAAAA GAGATCCTTG GCCTTTAAAT
TCATTTGCTA TTAAAGCCGG AATTGATCTA CTAAGTAATA AGAAATTCTA TGAACAATGG
ACAAAACAGA TTCACAGCTG GATAAATATT GAAAAAAAGA GAGTATTTGA AAAATTATCA
AAAATAGAGA GTCTTAAAGT TCATAACTCT TCAACCAACT TTTTTTTAGT AGAAAGTAAA
ACATCCTTGT CGCCAAATAT CAAATACTTA GAAAATAAGG GAATATTGCT TAGAGAATGC
ACTTCATTTA GATTTCTTGA CGAAAAGTGG GCAAGAATAA GTTTGCAGAA CAGCAAAAAT
AACACTCTTT TATGTGAAGA AATTCAGAAT TCCTTCAAAA AATAA
 
Protein sequence
MNKSETNDYS TLAMQISDLK HGGNVYANAK KLNLLPSEII DASASLVPFD PPQILIDSLN 
AGIKNLGFRY YPERNLSDLK EIIGKFHGIN PDNILPGNGA SELITWAGYE ASKFGISCIP
SPSFVDYERS LNCWNSNFVH CELPKNWNDI FPQSFPLHPK GDVIWITNPH NPTGQLWEKN
SLEELVKKYK LVICDEAFLS ITPNGDKESL IPLTQRFDNL LVLRSLTKIF NIPGLRLGYV
IGSSKKLKQW EIKRDPWPLN SFAIKAGIDL LSNKKFYEQW TKQIHSWINI EKKRVFEKLS
KIESLKVHNS STNFFLVESK TSLSPNIKYL ENKGILLREC TSFRFLDEKW ARISLQNSKN
NTLLCEEIQN SFKK