Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_29681 |
Symbol | |
ID | 4778490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2621771 |
End bp | 2623186 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640088492 |
Product | putative p-aminobenzoate synthetase |
Protein accession | YP_001018963 |
Protein GI | 124024656 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACAGC TGCCGCATCC GCGATGCAGC TTTGAGCGAG GCTGGCAGAT CAGATCTCTG CAGCAAGCCG GCACCATGAC TTCTCTGGAG CGGCGCCTTT GCCGCTGGCA AGAACCAGCC ATGTTGGCAA AGCAACTTAC CAACACTTGG GGAGAAGCTG GGCTTATCTG GCTGGATGGA GACGGCAGCG ATCTGGGTCG TTGGGCGACT CTGGCAGTCG ATCCCATTAA TCAGATTTGT TGCCGAGGCA TCCCTGGCGA AAAAGGCGCC AGTAACCCTT TTGCAGCACT GCGTGATCTA GAACCAGGCC ATTGGACAGG CTGGCTCAGC TACGAAGCAG CAGCCTGGAT AGAACCCAAA AATCCCTGGA AAGCAGATTC CATGGCGACT CTGTGGATGG CGCGTCACGA TCCCATCCTG CGATTCGACC TTCAGAAACG CCAACTCTGG ATCGAAGGGT GTCATCCCAA ACGACTGCAA GAACTGGCTA ACTGGCTAGA AGCCAACCCA TCTGAAGATG CACCTAAAAG CAGCAAAGAT GAGCCGTTAC TTGCAACAAC AGACCTCAAA ATTCCAGTCA ATGCATGGGA ATGGCTCACC ACTAGAGCTG ACTATGCACG TGATGTGCAA CAGATCAGAC ATTGGATTGC CAGCGGCGAT ATTTTCCAGG CCAATCTCAG CGCTTGCTGT ACCACCACGA TCCCTTCAGG AAGCTTCGCC GTAGACCTTT TCCTAAAACT GCGTCACCAC AACCCAGCTC CCTTTGCCGG CCTAGTGATA GCTGCAGGCC TTGCAAAAGG TGAAGCCGTG ATCTCCGCCT CTCCAGAGCG TTTCCTAAAA GCACTCCCCA CAGGAGAAGT TGAAACACGA CCAATCAAAG GAACCCGACC ACGCCACCCC AACCAAAGCC AAGACGCTGA TCTGGCAGCT GATCTTGTTT GCAGCAGCAA AGACCGAGCT GAAAACGTGA TGATCGTCGA TCTATTGCGC AACGATCTAG GGCGAGTCTG TCAACCCGGC TCCATCACAG TGCCTCAATT GGTAGGCCTT GAGAGCTATC CCCATGTGCA TCACCTCACC TCAGTAGTGC AAGGACGGCT TCGATCAGAC CAATCTTGGG TAGACCTACT ACAAGCCTGT TGGCCGGGAG GCTCCATCAG CGGTGCCCCC AAGCTGCGCG CCTGTCAGCG GCTGAACGAA CTGGAACCAA CAGCACGCGG GCCCTATTGC GGCTCGTTGT TGCATCTCAA CTGGGATGGG CAGCTCGACA GCAGCATTCT GATTCGCTCG ATGTTGCTCG AGGGCAACAC CTTACGAGCC CATGCCGGCT GCGGCATCGT TACCGGTTCC GACCCCTATT GCGAAGCTGA TGAGCTGAAC TGGAAACTGC TGCCACTACT GGAGGCACTG CAATGA
|
Protein sequence | MRQLPHPRCS FERGWQIRSL QQAGTMTSLE RRLCRWQEPA MLAKQLTNTW GEAGLIWLDG DGSDLGRWAT LAVDPINQIC CRGIPGEKGA SNPFAALRDL EPGHWTGWLS YEAAAWIEPK NPWKADSMAT LWMARHDPIL RFDLQKRQLW IEGCHPKRLQ ELANWLEANP SEDAPKSSKD EPLLATTDLK IPVNAWEWLT TRADYARDVQ QIRHWIASGD IFQANLSACC TTTIPSGSFA VDLFLKLRHH NPAPFAGLVI AAGLAKGEAV ISASPERFLK ALPTGEVETR PIKGTRPRHP NQSQDADLAA DLVCSSKDRA ENVMIVDLLR NDLGRVCQPG SITVPQLVGL ESYPHVHHLT SVVQGRLRSD QSWVDLLQAC WPGGSISGAP KLRACQRLNE LEPTARGPYC GSLLHLNWDG QLDSSILIRS MLLEGNTLRA HAGCGIVTGS DPYCEADELN WKLLPLLEAL Q
|
| |