Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04021 |
Symbol | aspA |
ID | 4776812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 403795 |
End bp | 404766 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640085905 |
Product | aspartoacylase |
Protein accession | YP_001016419 |
Protein GI | 124022112 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2988] Succinylglutamate desuccinylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGTTC CCTTTGCAGC CATTTTCGGT CAGGCTGAGG ATTGCGATCG AATCTTGATG TCTGGCCTTC AGGTGCTTCT GGTGGCGGGT ACTCATGGCA ATGAAATCAA CGCTCCCTGG CTTCTCGATC AATGGTCTCA GACCCCTGAG CTGATCAACA CCCATGGTGT TGCTGTGGTG CCGGTGATCG GCAATCCAGA GGCTTTTGCG TTGGGCAGGC GTTATTTGGA TTGCGATCTC AACCGCAGCT TTCGGCTCGA CTTGCTCAGA TCCCCCAGCA TCTTGGATCG AGAGGTTGTT CGTGCCAAGC AGCTGCTTAG TTTTTTTGGT CCAGAGGGAT CAACCCCTTG CCAGATCGTT ATTGATTTGC ACAGCACCAC TTCAGCAATG GGAAGCACTC TTGTGGTTTA TGGCCGACGG CCGGCTGACC TGGCTTTAGC CGCATTGATT CAGGCTCGCT TGGGTTTGCC TATCTATTTG CATGATGGGG ATGATGACCA GCAGGGGTTT TTGGTGGAGC GCTGGCCCTG TGGCTTAGTG ATTGAAATCG GTCCTGTTCC CCAGGGCCTC TTGAAGGCCT GCATCATTGA ACAGACACGA CTTGCTGTTC AGGCCTGTCT CGAGGCTCTG AGCAGCATTT CATCTGGATC GCCGACCTAC CCAGATCAGT TTGTGGTGCA CTCTCATCTG GGCAGTCTGG ACCTGCCCCG TGATGGCTTG GGCCAGCCTG CTGCATGTGT GCATCCATAT CTTCAGGGCC GTGATTGGCA GCCTCTGCAG ATGGGTGCCC CCTTGTTTCT CCGGCCAGAT GGAGAGGTGT TCAGATTTGA AGGACGGGAT TCTCCTATCC CCGTTTTTAT CAACGAGGCG GCCTACGTTG AAAAGCACAT CGCCATGAGC CTGACCTGTC GAGAGGTCTG CCCCCTGCCT GAGCAATGGC AAGGGGCCCT GCAGCAGTTA GTCGACTGTT AA
|
Protein sequence | MVVPFAAIFG QAEDCDRILM SGLQVLLVAG THGNEINAPW LLDQWSQTPE LINTHGVAVV PVIGNPEAFA LGRRYLDCDL NRSFRLDLLR SPSILDREVV RAKQLLSFFG PEGSTPCQIV IDLHSTTSAM GSTLVVYGRR PADLALAALI QARLGLPIYL HDGDDDQQGF LVERWPCGLV IEIGPVPQGL LKACIIEQTR LAVQACLEAL SSISSGSPTY PDQFVVHSHL GSLDLPRDGL GQPAACVHPY LQGRDWQPLQ MGAPLFLRPD GEVFRFEGRD SPIPVFINEA AYVEKHIAMS LTCREVCPLP EQWQGALQQL VDC
|
| |