Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_04581 |
Symbol | |
ID | 4779692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 419750 |
End bp | 420736 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640083735 |
Product | O-acetylserine (thiol)-lyase A |
Protein accession | YP_001014287 |
Protein GI | 124025171 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.795061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGAA TCTATGAAGA CAATAGTTTC GCTATTGGTC ATACCCCATT AGTCAAGCTT AATTCAATTA CAAAAAACGC AAAAGCAACA GTATTGGCAA AGATAGAAGG CCGTAACCCC GCCTACAGCG TTAAGTGCAG AATAGGAGCG AACATGATCT GGGATGCTGA GAAGAAAGGT TTACTCAATA AAGATAAAGT CATTATTGAA CCAACTTCTG GAAATACTGG AATAGCCCTT GCTTACACAG CTGCTGCTCG AGGTTACAAG TTAATCCTCA CGATGCCTGA GTCCATGTCA ATTGAACGTC GGAGGATGAT GGCTGTTCTT GGTGCTGAAT TAATACTAAC TGAAGCAGCA AAAGGAATGC CTGGTGCCAT TGCAAAAGCG AAGGAGATAG CTGATGGCGA CCCTCAAAAA TACTTCATGC CAGGCCAATT TGATAATCCA GCTAATCCAG AAATACATTT CAAAACAACA GGTCCTGAAA TTTGGGATGA TACCGATGGC CAAATTGATG TTTTAGTCTC TGGCGTCGGG ACAGGTGGAA CAATAACAGG TGTCTCTCGT TTTATAAAAC AAGAAAAAAA TCATTCATTA TTATCTGTTG CCGTTGAGCC CACTCATAGC CCAGTAATAA CACAAACTCT CAATGGAGAA GAGGTAAAGC CCGGCCCTCA CAAGATTCAA GGGATCGGAG CTGGGTTTAT CCCTAAAAAT CTTGACTTAT CAGTAGTCGA TCAAGTTGAG CAAGTCTCTA ATGATGAGTC TATTGCAATG GCATTACGTT TAGCACAAGA AGAAGGTCTA CTAGTTGGAA TCTCATGTGG TGCCGCTGCA GCTGTGGCTT TAAGACTTGC CGAAAAAGAA GAGTTCGCAG GGAAGACAAT CGTTGTGGTT CTTCCTGACC TTGCAGAAAG ATATATCTCA TCCGTAATGT TTGAGAATGT TCCTACAGGC GTTATTAAAG AACCATCATT AGCTTGA
|
Protein sequence | MSRIYEDNSF AIGHTPLVKL NSITKNAKAT VLAKIEGRNP AYSVKCRIGA NMIWDAEKKG LLNKDKVIIE PTSGNTGIAL AYTAAARGYK LILTMPESMS IERRRMMAVL GAELILTEAA KGMPGAIAKA KEIADGDPQK YFMPGQFDNP ANPEIHFKTT GPEIWDDTDG QIDVLVSGVG TGGTITGVSR FIKQEKNHSL LSVAVEPTHS PVITQTLNGE EVKPGPHKIQ GIGAGFIPKN LDLSVVDQVE QVSNDESIAM ALRLAQEEGL LVGISCGAAA AVALRLAEKE EFAGKTIVVV LPDLAERYIS SVMFENVPTG VIKEPSLA
|
| |