Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_16971 |
Symbol | |
ID | 5730045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1523422 |
End bp | 1524942 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641286079 |
Product | anthranilate synthase component I/chorismate-binding protein |
Protein accession | YP_001551582 |
Protein GI | 159904238 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.623423 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATATAT CAGATCAGGA TTCATTTTTA GAAGCAGCTT CTAGAGGATT AACTTTTATT CCACTGGTTC ACAGTTGGCC TGCAGACCTT GAGACACCAT TATCAACATG GTTGAAGGTT GGTGAAGGCC ATCCTCCAGG GGTTCTTCTT GAATCTGTAG AGGGAGGCGA AACTCTTGGG AGATGGAGTG TAGTTGCTAC CGATCCTCTT TGGATAGCAA CTGCTAGAGG GAATAGTCTC AAAAGGGAGT GGCGTGACGG ACAATGCGAC GAAATACAAG GCAATCCTTT TGAAGTTATT AGGGAATGGC TTCTTCCATA TCGTACTGAG CCTATTGACG GCTTACCTTG CATAGGTCAA TTGTATGGAA TATGGGGTTA TGAATTGATT CAATGGGTAG AGCCAAAAGT TTCAGTGTTT GCAAGGACTA AAAGTGATCC TCCTGATGGA GTGTGGATGT TTATGGATAG AGTTTTAATT TTTGATCAGG TTAAAAGAGT GATTAATGCA GTCTCGTATG GAGATTTGAC ATGCAATGAT CAACCACTTC AGGCATATGA AAAGGCTGCA CAAAGAAGCA AAGATTTGCA AGTTCTTTTG CAATCCCCAC TTCCTTCGCT GAAACCTCTT CAATGGCAAT CAACAACTGA GACTCCAAAC TCAGTAAAGA GTAATACCAC TCAAGTCAAA TTTAAGAATG CAGTTAAATC AGCTAAGGAA TACATAAAAA AAGGAGATAT TTTTCAAATT GTCCTTAGTC AGAAATTAAG AACTCAGGTT CCTAATAAAC CTTTTGAGAT TTATCGAAGT TTGCGCATGG TGAATCCTTC GCCGTTTATG GCTTTTTTTG ATTTTGGTGA TTGGCAACTT ATTGGATCTA GTCCTGAAGT GATGGTTCAA GCAAAGCCCA GTGAAAAAGG TATCTATGCA AGCTTGAGAC CTATTGCAGG TACAAGACCT AGAGGTATCA ATGAAATGGA AGATAAAACA TTAGAACGCG AATTATTATC TGATCCAAAA GAAATAGCAG AGCATGTAAT GCTAGTGGAT TTAGGACGTA ATGATTTGGG CAGGGTTTGT CGATCTGGGA CTGTTGAAGT TAAAGAGTTG ATGGTGATTG AAAAGTATTC TCATGTGATG CATATTGTTA GTGAAGTAGA AGGAATGCTT AGAGAAGATA AGGATGTATG GGATCTACTA ATGGCAGCTT TCCCTGCTGG CACGGTTTCT GGAGCACCTA AGATAAGAGC AATGCAGTTA ATCAATGAAT TAGAAACTCA GCCTCGAGGA CCATATTCAG GGGTATATGG ATCAATGGAT TTAAATGGAG CATTAAATAC AGCAATTACC ATTAGAACTA TGGTTGTATC CTCTCATTCA AACAATATTT CAAATGTGCA AGTTCAAGCA GGTGCAGGTG TAGTTGCTGA CTCAATCCCT GCAAATGAAT TCCAAGAAAC TATGAATAAA GCTAAAGGCT TGCTCACTGC ACTAGGATGT CTTGAGCGGT CTGATTCATG A
|
Protein sequence | MHISDQDSFL EAASRGLTFI PLVHSWPADL ETPLSTWLKV GEGHPPGVLL ESVEGGETLG RWSVVATDPL WIATARGNSL KREWRDGQCD EIQGNPFEVI REWLLPYRTE PIDGLPCIGQ LYGIWGYELI QWVEPKVSVF ARTKSDPPDG VWMFMDRVLI FDQVKRVINA VSYGDLTCND QPLQAYEKAA QRSKDLQVLL QSPLPSLKPL QWQSTTETPN SVKSNTTQVK FKNAVKSAKE YIKKGDIFQI VLSQKLRTQV PNKPFEIYRS LRMVNPSPFM AFFDFGDWQL IGSSPEVMVQ AKPSEKGIYA SLRPIAGTRP RGINEMEDKT LERELLSDPK EIAEHVMLVD LGRNDLGRVC RSGTVEVKEL MVIEKYSHVM HIVSEVEGML REDKDVWDLL MAAFPAGTVS GAPKIRAMQL INELETQPRG PYSGVYGSMD LNGALNTAIT IRTMVVSSHS NNISNVQVQA GAGVVADSIP ANEFQETMNK AKGLLTALGC LERSDS
|
| |