Gene A9601_02041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02041 
Symbol 
ID4716888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp186926 
End bp188035 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content27% 
IMG OID640077903 
Productaminotransferases class-I 
Protein accessionYP_001008599 
Protein GI123967741 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0980416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATT TTGATCCATT AAATAATCTT TTCCCAAAAC CAAGAGAAGA AATAATAAAT 
ATGCAGTCTT ACTCTGCACC TTTAGAAAAT AGAAGAAATT TACTCCGCTT AGACTTTAAT
GAAAATACTT TAGGTCCAAG TCCTAAGGTT CTAGAGGCAT TAAAAGCGAT AAAATTAGAT
GAGATTTCAA TTTATCCAGA ATATAATTTT TTAAAAAAAT ATTTATGTGA TAAATATCTT
GATTCAAGAA AATTTGGTAA TGATGAAATC GGAATTTTCA ATGGAGCAGA TGCAGCAATA
AATGCAATTT TCAATTGCTT TGGAGAAAAA GATCAAATAT TTCTAACCAC AAATCCAACT
TTTGGTTACT ATTCTCCTTG TGCAGAAATC CGAGGAATGA AAAAAATAAG TTGTTCTTAC
ATTGGAGAAA ATTTTCTATT CCCCATCGAA GAATTTAGGG AAAAAATAAT AAAGCATAAT
CCAAAGTTAA TATTTATTTG CAATCCAAAT AATCCAACAG GAACTGTTCT AAGCTCTCAT
GAAATAATTA ATTTAGCCAA TATCAATAAA GATTCATTAA TAGTTGTTGA TGAACTATAT
GAAAAATTTA ATGGAGATAG TCTTCTTAAA TCGATAGATT TTGAAAAAAA TAAAAATATA
CTAATAATAC AATCTCTTTC AAAAACTGCA GGTCTAGCTG GTTTAAGAAT AGGTTTTACT
TTTGGCAATA AAAGTTTAAT TCAGTACATT AATAAAGTTA CAGGACCATA TGATGTAAAC
AGCTTTGCTA TAACAGCTGC ATTAGCAGCA CTTAAAGACA AATCATATAT TGATAATTAT
GTTTTAGAAG TAAAAAAGGC GAGGGAATGG ATTTTAAATA AATTTAAATC AACAAAAATC
AGAACTCACT TTAGTGGAGG TAATTATTTC TTAATTTGGC CAAAAAAAGA TCCTAAAATC
TTAATACAAC AGATGAGAGC AAAAGGTATT CTTATTAGAA GTATGGAAAA CAAAAAAGAT
ATCAGTAATT CTATAAGGGT TAGTATTGGA ACTAAAGAAC AAATGATTTT TTTCTGGGAC
AATTACAAGA TATTAGATTT AAAAAATTAA
 
Protein sequence
MNDFDPLNNL FPKPREEIIN MQSYSAPLEN RRNLLRLDFN ENTLGPSPKV LEALKAIKLD 
EISIYPEYNF LKKYLCDKYL DSRKFGNDEI GIFNGADAAI NAIFNCFGEK DQIFLTTNPT
FGYYSPCAEI RGMKKISCSY IGENFLFPIE EFREKIIKHN PKLIFICNPN NPTGTVLSSH
EIINLANINK DSLIVVDELY EKFNGDSLLK SIDFEKNKNI LIIQSLSKTA GLAGLRIGFT
FGNKSLIQYI NKVTGPYDVN SFAITAALAA LKDKSYIDNY VLEVKKAREW ILNKFKSTKI
RTHFSGGNYF LIWPKKDPKI LIQQMRAKGI LIRSMENKKD ISNSIRVSIG TKEQMIFFWD
NYKILDLKN