Gene P9303_14321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_14321 
SymbolaspC 
ID4778705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1226544 
End bp1227722 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content52% 
IMG OID640086941 
Productaminotransferases class-I 
Protein accessionYP_001017443 
Protein GI124023136 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTCC CGCCTCATCT TTCCGACCGA GTCGTTGCCC TTCAGCCCTC ACTCACACTG 
GCAATCAGTG CTCGAGCAAA GGCTCTTCAG CAAGAAGGCC GCGACATTTG CAGCATGAGT
GCTGGTGAGC CGGATTTCAA TACCCCTGAA TTCATCATTG ATGCCACGGT GAAGGCACTC
CGTGATGGCA TCACCCGTTA TGGCCCTGCC GCTGGAGACC CTGAACTGCG TGAGGCAATA
GCCACCAAGC TCAGCAAAGA AAACACTGTG CCAACCAATG CAGAGCAAGT GTTGGTGACC
AATGGAGGCA AGCAAGCAAT CTTTAACTTG TTTCAGGTGA TCCTCAATCC AGGCGATGAG
GTTTTAATCC CTGCTCCTTA TTGGCTGAGT TATCCAGAAA TGGCCCGCTT AGCCGGTGCA
AAGGTGACAA CACTTCCCTC CACTCCAGAA AACGGTTTCT GTCTAGATCT CAACAACCTA
GAAGCTTCCA TCGGCTCAAA AACCCGTCTG TTAATACTTA ATTCCCCGGG CAACCCAACC
GGTCGTGTGA TGGCACGCAA GGAGCTGGAA GCTTTGGCTG ATCTGCTAAG AAATTATCCC
CAGATCCTTG TCATGAGTGA TGAGATCTAC GAGTTCATTC TTGAAGACGG GCAACAGCAT
CACAGCTTCT CTGCTATAGC ACCAGATCTT TCAGACAGAA CCTTCATCGT TAACGGCTTT
GCCAAGGGCT GGGCAATGAC TGGTTGGCGG TTGGGTTATC TAGCCGGCCC CGCTCATGCA
GTGAAAGCGG CCACTGCCCT CCAAAGCCAG AGCACGAGCA ATGTCTGCAG TTTCGCTCAG
CGTGGAGCCT TGGCCGCGCT GCAAGGCTCA AGGGAGTGTG TGAAGAAGAT GGTTAATAGC
TACAACACCC GACGCGAACT CCTCGCCTCT GGCTTGCTTG GCCTTGAAGG GATCAGCCTG
ATCTCTCCAA AAGGTGCGTT TTATGCCTTC CCAAAACTAC CTGAAGGAAG CCTCGACTCA
GTAAGTTTCT GTCAGCAAGC TCTTGAAAAC TATGGGCTTG CCATGGTTCC AGGTGCCGCA
TTCGGAGACG ACAGTTGCAT ACGCCTCACT TGTGCTGTGT CACATAAGAC GATTTGCGAT
GGACTAGAAC GTCTCCGCAA AGCTCTAAAA CAGAGCTAA
 
Protein sequence
MPLPPHLSDR VVALQPSLTL AISARAKALQ QEGRDICSMS AGEPDFNTPE FIIDATVKAL 
RDGITRYGPA AGDPELREAI ATKLSKENTV PTNAEQVLVT NGGKQAIFNL FQVILNPGDE
VLIPAPYWLS YPEMARLAGA KVTTLPSTPE NGFCLDLNNL EASIGSKTRL LILNSPGNPT
GRVMARKELE ALADLLRNYP QILVMSDEIY EFILEDGQQH HSFSAIAPDL SDRTFIVNGF
AKGWAMTGWR LGYLAGPAHA VKAATALQSQ STSNVCSFAQ RGALAALQGS RECVKKMVNS
YNTRRELLAS GLLGLEGISL ISPKGAFYAF PKLPEGSLDS VSFCQQALEN YGLAMVPGAA
FGDDSCIRLT CAVSHKTICD GLERLRKALK QS