Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_01951 |
Symbol | |
ID | 4780838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 181234 |
End bp | 182202 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640083459 |
Product | O-acetylserine (thiol)-lyase A |
Protein accession | YP_001014024 |
Protein GI | 124024908 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.958038 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.621094 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATTG CTAATGACAT AACTTATTTA GTTGGCCAAA CACCTTTGGT CAAACTGAAT CGATTACCTA ATGAATTTAA TTGTAGGGCT GAAATTATAG CCAAATTAGA AAGTTTCAAC CCCACAGCAT CCGTAAAAGA CCGCATAGCT GGAGCAATGG TGAAATCAGC AGAAAAAGAG GGCACTATCA AACCTGGACA TACTGTTCTT GTTGAACCAA CTAGCGGAAA CACAGGAATT GCATTAGCGA TGGTTGCGGC AGCAAAAGGT TATCGGCTCA TACTTACAAT GCCTGACACA ATGAGTACTG AACGACGCTC AATGTTAAGA GCATTTGGAG CAGAGCTTCA ACTAACTCCT GGCCAAGACG GTATCCAAGG CGCTATTCAG CTAGCAAAGG AATTGGTTGC TTCTGTACCT AATGCATACT TGCTTCAGCA ATTCGATAAT TTATCCAATC CTGAAATTCA TGAAAAAACA ACCGCTGAAG AAATATGGGA AGATTGCGAA GGCAAACTTG ATGCATTAAT TGCAGGAGTA GGAACAGGAG GAACAATCAC AGGTTGTGCC AGATTCTTAA AACAAAAAAA TCCAAAAATC AAGGTTTTTG CAGTAGAACC TTCATCAAGC CCTGTTCTGT CAGGAGGGAA TCCTGGATCT CATGCGATAC AAGGCATTGG GGCTGGTTTC ATTCCTAATG TATTAGATAT GAATCAGATT GATGAAGTCA TAAGAATCAA TGATAATGAA GCAATGGACA TAGGCAGAAG ACTTGCCAAA GAAGAGGGAT TGCTAAGTGG TGTCAGCAGT GGTGCTGCAG TAGCCGCTGC TTTAAAAGTA GGAAATCAAC CTGAGTTTGC GAATAAACGC TTGATTGTTA TTCTGCCTAG TTTTGGTGAA AGATATCTAT CAACAACAAT GTTTACTTCT ATCCCCGCAA AGCCAGTAAA AGGGAATGAG TTTCTCTAA
|
Protein sequence | MPIANDITYL VGQTPLVKLN RLPNEFNCRA EIIAKLESFN PTASVKDRIA GAMVKSAEKE GTIKPGHTVL VEPTSGNTGI ALAMVAAAKG YRLILTMPDT MSTERRSMLR AFGAELQLTP GQDGIQGAIQ LAKELVASVP NAYLLQQFDN LSNPEIHEKT TAEEIWEDCE GKLDALIAGV GTGGTITGCA RFLKQKNPKI KVFAVEPSSS PVLSGGNPGS HAIQGIGAGF IPNVLDMNQI DEVIRINDNE AMDIGRRLAK EEGLLSGVSS GAAVAAALKV GNQPEFANKR LIVILPSFGE RYLSTTMFTS IPAKPVKGNE FL
|
| |