Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_17681 |
Symbol | |
ID | 4911892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1491404 |
End bp | 1492924 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640161369 |
Product | anthranilate synthase component I/chorismate-binding protein |
Protein accession | YP_001091992 |
Protein GI | 126697106 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAGCT CACAGAAAAT TAGTTTTTTA AAGGCTTATA AAGAAGGTAA AAATTTTATA CCTATAGTAG AAACTTGGCC AGCTGATTTG GAGACTCCAT TATCAACTTG GTTAAAATTA TCTTCAAAAG ATTCCCATGG TGTTTTTCTT GAATCTGTTG AAGGTGGGGA GAATTTGGGC AGGTGGAGTA TTGTTGCTAC TAAACCTCTT TGGGAGGCAG TTTGTTATGG AGAAGAAATA GTTAAAACTT GGAATAATGG CAAAACTGAA ACACATAAAG GTGATCCTTT TGATATTTTA AAAAGTTGGA CAATGGAATA TAAGTCAACC ATGCTTGAAG AATTACCATC AATTGGACAG TTATATGGCT CTTGGGGTTA TGAATTAATA AATCGAATAG AGCCAAGCGT TCCAATAAAT GAAATGCCAG AAAACAATAT CCCTCAAGGT TCCTGGATGT TTTTTGATCA ATTAGTTGTT TTTGATCAAA TAAAAAGATG TATTACTGCG GTGGTTTATG CAGATACAAC TTCTTCTCAA GAGTCTTCAA TTGAAGAGTT GTATCTAAAC TCAATATCTA AAATTCAGGA AACTAGAAAT TTAATGAGAG TTCCTCTAAA AGAAAATGAG TTTTTAGATT GGAATGAAAA TGATAATTTA AATTTAGATT TAGAAAGTAA TTGGGAGAAA AAAGATTTTG AGGATGCAGT TCTCTCTGCA AAAGAATACA TAAGAAAGGG AGATATCTTC CAAATAGTTA TAAGTCAAAG ATTTCAAACT CAAGTCAATA ATGATCCCTT TAACTTATAT AGAAGTCTGA GGATGGTTAA TCCATCTCCT TACATGTCAT TTTTTGATTT CGGCTCATGG TATCTGATAG GTTCAAGTCC TGAAGTAATG GTTAAAGCAG AAAAAAATAA AAATAGTCAG ATTGTTGCAA GCTTAAGACC AATAGCTGGC ACGAGACCAA GAGGTATTGA TAATCAACAA GACTTGGAAT TAGAAAAGGA ATTATTAAAG GATCCCAAAG AGATAGCTGA GCATGTAATG TTGATTGATC TTGGAAGAAA TGATCTTGGA AGAGTTTGTG AAATTGGTAC TGTCAAGGTA AAGGATTTAA TGATTATTGA GAAATATTCA CATGTTATGC ATATAGTCAG TCAAGTTGAG GGAATCTTAA AAAATAATGC TGATGTATGG GATTTGCTCA AAGCATCCTT TCCTGCTGGG ACTGTGACTG GCGCCCCAAA AATAAGAGCT ATGCAATTGA TTAAGCATTT TGAAAAAGAT GCTAGAGGAC CTTATGCTGG TGTATACGGA TCTATTGATA TTAATGGTGC ATTAAATACA GCAATTACGA TAAGAACCAT GATAGTTAAA CCCTCAAGAG ATGGGAAATA TGATGTGTCA GTGCAAGCAG GAGCTGGAAT AGTTGCTGAT TCTTTTCCTG AAAATGAATA TCAAGAGACG ATAAATAAAG CAAAGGGAAT ACTTAAAGCA TTAGCCTGTT TGGATAAATA A
|
Protein sequence | MISSQKISFL KAYKEGKNFI PIVETWPADL ETPLSTWLKL SSKDSHGVFL ESVEGGENLG RWSIVATKPL WEAVCYGEEI VKTWNNGKTE THKGDPFDIL KSWTMEYKST MLEELPSIGQ LYGSWGYELI NRIEPSVPIN EMPENNIPQG SWMFFDQLVV FDQIKRCITA VVYADTTSSQ ESSIEELYLN SISKIQETRN LMRVPLKENE FLDWNENDNL NLDLESNWEK KDFEDAVLSA KEYIRKGDIF QIVISQRFQT QVNNDPFNLY RSLRMVNPSP YMSFFDFGSW YLIGSSPEVM VKAEKNKNSQ IVASLRPIAG TRPRGIDNQQ DLELEKELLK DPKEIAEHVM LIDLGRNDLG RVCEIGTVKV KDLMIIEKYS HVMHIVSQVE GILKNNADVW DLLKASFPAG TVTGAPKIRA MQLIKHFEKD ARGPYAGVYG SIDINGALNT AITIRTMIVK PSRDGKYDVS VQAGAGIVAD SFPENEYQET INKAKGILKA LACLDK
|
| |