Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_17841 |
Symbol | |
ID | 4718518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1517744 |
End bp | 1519264 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640079514 |
Product | anthranilate synthase component I/chorismate-binding protein |
Protein accession | YP_001010174 |
Protein GI | 123969316 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.633527 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAGCT CACAAAAAGA TAGTTTTTTA AAGGCTTACA AAGAAGGTAA AAACTTTATA CCTATAGTTG AAACTTGGCC AGCAGATTTA GAGACTCCAT TATCGACTTG GTTAAAATTA TCTTCAAAAG ATTCCCATGG TGTTTTTCTT GAATCTGTTG AGGGTGGCGA GAATTTGGGT AGGTGGAGTA TTGTTGCTAC TCAACCTCTT TGGGAAGCCG TTTGTTATGG AGAAGAAATA ATTAAAACTT GGAATAATGG CAAAACTGAA ACACATAAAG GTGATCCTTT TGATATTTTG AGAAGTTGGA CAAACGAATA CAAGTCAACC ACGCTTGATG AATTACCCTC AATTGGACAG TTATATGGCT CTTGGGGTTA TGAATTAATA AATCGAATAG AACCAAGCGT TCCAATAAAT GAAAAATTAG AAAACAATAT CCCTTATGGT TCCTGGATGT TTTTTGATCA GATAGTTGTT TTTGATCAAA TAAAAAGATG TATTACTGCA GTGGTTTATG CAGATACAAC TTCTACAAAA GAGTGCGAAA TTGAACTGTT GTACCTAAAC TCAATTTCTA GAATTAAGAA AACTAGAAAT TTAATGAGAG TTCCTCTAAA AGAAAATGAG TTTTTAGATT GGAATGAAAA TGAGAATTTG AATTTAGATC TAGAAAGTAA TTGGGAGAAA AAAGATTTTG AGGATGCAGT TCTCTCTGCA AAAGAATATA TAAGAAAGGG AGATATCTTC CAAATAGTTA TTAGTCAGAG ATTCCAAACT CAAGTCAATA ATGATCCCTT TAATTTATAT AGAAGTCTGA GAATGGTTAA TCCATCTCCA TACATGTCAT TTTTTGATTT TGGCTCATGG TATCTGATAG GTTCAAGTCC TGAAGTCATG GTTAAAGCAG AAAAAAATAA AAATAGTCAG ATCGTTGCAA GCTTAAGACC AATAGCTGGC ACTAGACCTA GAGGTATTGA TAATCAGCAA GACTTGGAAT TAGAAAAGGA ATTATTAAAA GATCCAAAAG AGATAGCTGA GCATGTAATG CTAATTGATC TTGGGAGAAA TGATCTTGGA AGAGTTTGTG AAATTGGTAC TGTCAAGGTC AAGGATTTAA TGGTTATTGA GAAATATTCA CATGTTATGC ATATAGTCAG TCAAGTTGAG GGAATCTTAA AAAATAATGC TGATGTATGG GATTTGCTCA AAGCATCCTT TCCCGCTGGG ACAGTAACTG GCGCTCCAAA AATAAGAGCT ATGCAATTGA TTAAGCACTT TGAAAAAGAT GCTAGAGGAC CTTATGCAGG TGTATACGGA TCTATTGATA TTAATGGCGC ATTAAATACA GCAATTACAA TAAGAACTAT GATAGTAAAA CCCTCAATAG ATGGGAAATA TGATGTTTCA GTGCAAGCAG GAGCTGGAAT AGTTGCTGAT TCTTTTCCTG AAAATGAATA TCAAGAGACG ATAAATAAAG CAAAGGGAAT ACTAAAAGCA CTAGCCTGTT TGGATAAATA A
|
Protein sequence | MISSQKDSFL KAYKEGKNFI PIVETWPADL ETPLSTWLKL SSKDSHGVFL ESVEGGENLG RWSIVATQPL WEAVCYGEEI IKTWNNGKTE THKGDPFDIL RSWTNEYKST TLDELPSIGQ LYGSWGYELI NRIEPSVPIN EKLENNIPYG SWMFFDQIVV FDQIKRCITA VVYADTTSTK ECEIELLYLN SISRIKKTRN LMRVPLKENE FLDWNENENL NLDLESNWEK KDFEDAVLSA KEYIRKGDIF QIVISQRFQT QVNNDPFNLY RSLRMVNPSP YMSFFDFGSW YLIGSSPEVM VKAEKNKNSQ IVASLRPIAG TRPRGIDNQQ DLELEKELLK DPKEIAEHVM LIDLGRNDLG RVCEIGTVKV KDLMVIEKYS HVMHIVSQVE GILKNNADVW DLLKASFPAG TVTGAPKIRA MQLIKHFEKD ARGPYAGVYG SIDINGALNT AITIRTMIVK PSIDGKYDVS VQAGAGIVAD SFPENEYQET INKAKGILKA LACLDK
|
| |