Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_22721 |
Symbol | |
ID | 4778652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2006956 |
End bp | 2008476 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640087790 |
Product | anthranilate synthase component I/chorismate-binding protein |
Protein accession | YP_001018272 |
Protein GI | 124023965 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.267297 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAGCT CTGATCGTGA CCATTTCTTT GAGATGGCTG CTAGTGGTGC CAATTTCATT CCTTTGGCCC ACAGTTGGCC AGCGGATCTG GAGACCCCTC TCACAACTTG GTTAAAAGTT GGGGCAGACC ATCCCCCTGG GGTTTTACTT GAATCGGTCG AAGGGGGTGA AACTCTTGGG CGCTGGAGTG TGGTTGCCTG CAATCCACTT TGGACTGCCA CATGCCGAGG GAAACACCTC ACACGTCGTT GGCGAGAAGG ACGAACAGAT GAAGCCATCG GCAACCCTTT TGAAGGCCTC AGGCAATGGC TAGCTCCTTA TCGCACCGCA ACCCTTCCAG GCCTACCCCC CCTTGGTCAG CTCTATGGAA TGTGGGGTTT TGAACTGATC AAGTGGGTTG AACCCACAGT GCCCGTTCAC TTAAGGGACA ACAACGATCC GCCTGATGGC ATCTGGATGC TGATGGACAG CATCTTGATC ATTGATCAAG TCAAACGCCT CATCACTGCC GTTGCATACG CAGACCTGAG TGGCGAGCAA ACGGCTAACG AAGCTTGGGA CAAGGCACAA GCACGCATTC AAGACCTAGA AAAGTGCATG GCGGAACCAC TTGCACCGAT TCAGCCACTG AAATGGCAAC CAAAAGGTCA ATCTCCACCT TCCACCATCA GTAACTACAG CCAAAAAGGC TTTGAGGAGG CAGTTCAAAC GGCCAAGCAA CACATCGCCG CAGGGGATGT GTTCCAGCTT GTGATCAGTC AAAGGCTGGA GACCAGAGTT CCTCAACAGC CACTTGAGCT CTACCGAAGT CTGCGGATGG TGAATCCTTC TCCATATATG GCTTTCTTTG ACTTCGGCGA CTGGCAGCTG ATTGGCTCAA GCCCGGAGGT CATGGTCAAG GCGGAGCCAG TCGTCGATGG CATTAAGGCC AGCCTTCGGC CTATTGCCGG CACGCGTCCG CGTGGCGGCA ACGAACTTGA GGACCGCAAT CTTGAAGCAG AGTTGATGGC AGATCCCAAG GAACGTGCCG AGCATGTGAT GTTGGTTGAT CTTGGCCGCA ATGACCTTGG ACGCGTTTGC AGGCCGGGCA GTGTGACGGT GAAAGAGCTG ATGGTGATCG AGAAATATTC CCACGTCATG CACATCGTCA GTGCAGTGGA AGGTGTGCTT GCCAAAGGCA AGGATGTTTG GGATCTACTC ATGGCCTCAT TCCCAGCAGG CACGGTCAGT GGCGCCCCAA AAATCAGAGC CATGCAGCTC ATTCATGACC TCGAACCCGA CTCACGAGGA CCTTATTCAG GTGTTTATGG GTCCATCGAT CTCAATGGTG CCCTGAATAC AGCTATAACC ATTCGCACAA TGATTGTGCG GCCCCATCCT GAAGGCGGCT GGCAAGTCAA GGTTCAAGCA GGCGCTGGTG TGGTGGCCGA TTCCATCCCC ACCAAGGAAT ACGAAGAGAC CCTCAACAAG GCAAGGGGAA TGCTCACAGC CCTGGCCTGC CTCGAGTCCC ACAAGTCATG A
|
Protein sequence | MLSSDRDHFF EMAASGANFI PLAHSWPADL ETPLTTWLKV GADHPPGVLL ESVEGGETLG RWSVVACNPL WTATCRGKHL TRRWREGRTD EAIGNPFEGL RQWLAPYRTA TLPGLPPLGQ LYGMWGFELI KWVEPTVPVH LRDNNDPPDG IWMLMDSILI IDQVKRLITA VAYADLSGEQ TANEAWDKAQ ARIQDLEKCM AEPLAPIQPL KWQPKGQSPP STISNYSQKG FEEAVQTAKQ HIAAGDVFQL VISQRLETRV PQQPLELYRS LRMVNPSPYM AFFDFGDWQL IGSSPEVMVK AEPVVDGIKA SLRPIAGTRP RGGNELEDRN LEAELMADPK ERAEHVMLVD LGRNDLGRVC RPGSVTVKEL MVIEKYSHVM HIVSAVEGVL AKGKDVWDLL MASFPAGTVS GAPKIRAMQL IHDLEPDSRG PYSGVYGSID LNGALNTAIT IRTMIVRPHP EGGWQVKVQA GAGVVADSIP TKEYEETLNK ARGMLTALAC LESHKS
|
| |