Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21661 |
Symbol | |
ID | 4779728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1823252 |
End bp | 1824592 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640085464 |
Product | putative p-aminobenzoate synthetase |
Protein accession | YP_001015986 |
Protein GI | 124026871 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.145928 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATAATA TTAAACGACT ACTATGTCAG TGGGAATCAC CTGAGCAAGT AGCCTATAAA TTGATTCAAG AATGGGGAGA GGCGGGATTT ATATGGCTTG ATGGCGATGG GAGTGATTTA GGTCGATGGG TCACCTTAGG AATTAATCCT TTAGAACAGT TTTGCTCTAG AGAATTAGAA AATTCAAAAA AGTACTCTAA TCCTTTTAAA ATTTTACGCG AATTACCCCA AGGGCATTGG ACTGGTTGGT TGAGTTATGA AGCTGCATCT TGGACAGAAC CACAAAACCC ATGGCAAAAA AGTTCCATGG CAACTTTATG GATAGCCTCT CATGACCCAA TATTAAAATT TGATCTTCAG AAGAGAGAGC TATGGCTAGA AGGCAAAGAT CCAAAACGCA TCTCTATTAT GGAAAATTTT CTAAAAAGTA CTTTTCATCA AAAGCTTTCT AAACCAGACA GTGAAGCAGC AAAACAAGCA GAATGCAAAC ATAGTATTCC CCTAGAATCT TGGGACTGGC GGTTAACAAG TCAAGAATAT TCTGAAAGAG TTGATGAGAT CAAAGAATGG ATTGCGAACG GTGATATTTT TCAGGCTAAT CTAACTACCT CATGCCAAGC CCCATTGCCA GAATCAATGC GTCCAATAGA CGTATATTCA AAACTTAAAA AATACTCCCC TGCTCCATTT GCTGGAGTAA TTATCGGCGA TCAGATGGCA AAAGGAGAAG CTATTATATC AACTTCACCA GAAAGATTTT TAAAAGCGTT ACCCAGTGGT GAGGTGGAAA CGAGACCAAT CAAAGGTACT AGACCAAGAG ATAGAGATCC AGAAAAAGAT GCTGATTGGG CAGCAGAGCT TATTTGTAGT CCAAAAGATC ATGCTGAGAA TGTAATGATT GTAGATCTAC TAAGAAATGA TCTTGGAAGA GTATGCCAAC CTGGTTCAAT CAATGTTCCT CATCTACTAG TTTTAGAAAG CTATTCACAA GTTCATCATC TCACTTCAGT AGTTAAAGGT AGACTCAATA CAAATAAGAC TTGGGTTGAC TTACTTGAAG CCTGCTGGCC AGGAGGTTCA GTAACAGGAG CACCGAAGCT TAGAGCATGT AAAAGATTGT ATGAACTTGA ACCAACAGCG AGAGGTCCAT ATTGTGGATC AATATTAAAT ATAAATTGGG ATGGAGTACT TGATAGTAAT ATTCTCATTC GATCTTTAAT GATTAAAGAA TCTTCTATCA GTGCTCATGC TGGATGTGGA ATTGTTGCAG ATTCAGATAG TCAAAAGGAA GCTGAAGAAA TGAATTGGAA ATTGATGCCA CTTTTAAACG CATTAACATG A
|
Protein sequence | MNNIKRLLCQ WESPEQVAYK LIQEWGEAGF IWLDGDGSDL GRWVTLGINP LEQFCSRELE NSKKYSNPFK ILRELPQGHW TGWLSYEAAS WTEPQNPWQK SSMATLWIAS HDPILKFDLQ KRELWLEGKD PKRISIMENF LKSTFHQKLS KPDSEAAKQA ECKHSIPLES WDWRLTSQEY SERVDEIKEW IANGDIFQAN LTTSCQAPLP ESMRPIDVYS KLKKYSPAPF AGVIIGDQMA KGEAIISTSP ERFLKALPSG EVETRPIKGT RPRDRDPEKD ADWAAELICS PKDHAENVMI VDLLRNDLGR VCQPGSINVP HLLVLESYSQ VHHLTSVVKG RLNTNKTWVD LLEACWPGGS VTGAPKLRAC KRLYELEPTA RGPYCGSILN INWDGVLDSN ILIRSLMIKE SSISAHAGCG IVADSDSQKE AEEMNWKLMP LLNALT
|
| |