Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20231 |
Symbol | |
ID | 4779669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1667649 |
End bp | 1669169 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640085316 |
Product | anthranilate synthase component I/chorismate-binding protein |
Protein accession | YP_001015843 |
Protein GI | 124026728 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTAATT TAGATAAGGA AGAATTTATT TTGTCAGCGT CCAGCGGGGC TAATTATATT CCGTTGGCAA AAAGTTGGCC GGCAGATTTA GAAACTCCTC TTACTACCTG GCTTAAAGTT GGTAATGATG CTCCTTCAGG AGTATTGCTT GAATCAGTAG AGGGTGGAGA AACTATCGGT AGGTGGAGTG TGGTTGCATC AGATCCCCTT TGGAAAGTAG TAGTAAGGGG CGATGAATTA ACTAGATGCT GGAGGGATGG AAAACAAGAA AAGTTTCATG GAAATCCAGT GGAAATCCTC AGGAAAATGC TTGAGCCGTA TAAATCTGTT TCTTTGCCTG GCTTGCCACA ACTGGGACAA CTTTTTGGCA TGTGGGGATA TGAACTAATT CAATGGATAG AGCCCTCAGT GCCTACTTAT GAATTATCAG ATCAAGACTT ACCTGATGGT ATTTGGATGT TTATGGACAA AGTTCTGATT TTTGATCAAG TCAAACGCCT AATAACAGCT GTTGCATATG GAAATTTAAG TGATGGAGTT TCTTCTCAAA AAGCTTATGA AATTGCCTGT GAACAAATCA ATGAACTGCA AGATTTAATG TCTTCTCCTT TAAAGCCAAT AAAGTCTTTA AAGTGGAATC AAAAATCGAA TAGATCTCCT GATATGGCTG CTAATACCTC AAAAAGTGAA TTTGAACATA GTGTTGAAGC GGCAAAAGAA TTTATTAAAC AAGGCGATGT TTTTCAGTTA GTTCTTAGTC AAAAATTGGA GTCGACTGTT ACGCAAAAAC CCTTTGAACT ATATCGAAGC CTAAGGATGG TAAATCCCTC TCCATTTATG GCGTTTTTTG ACTTTGGTGA CTGGCAACTT ATTGGTTCTA GCCCGGAGGT AATGGTTAAG GCCCAAAAAA CAGAAAAGGG TATTCAAACA AGTTTGAGAC CAATTGCAGG TACACGTCCT AGAGGTAAAA ATGATTTGGA AGATGCAGCC TTAGAGAAAG ATCTTTTAAA AGATCCCAAA GAACGAGCAG AACATGTGAT GTTGGTAGAT TTGGGTCGAA ATGATTTAGG TCGTGTTTGT ACCCCAGGTA GTGTTGTTGT GAAAGAATTA ATGGTTATTG AAAAATATTC GCATGTAATG CATATCGTCA GTGAGGTTGA AGGCACTTTA AAAAAAGAAC AGGATGTTTG GGACTTATTA ATTGCTTCTT TCCCAGCTGG GACTGTAAGT GGAGCCCCAA AAATAAGAGC AATGCAACTA ATTAATCAAT TAGAAAAACA ACGTAGAGGG CCTTATTCAG GCGTTTATGG GTCTATAGAT TTAAATGGCG CATTAAATAC AGCTATTACT ATTAGGACAA TGATTGTACG TAAAAAAAAT AAAAATGGTT TTACTGTTGA AGTGCAAGCA GGGGCAGGGG TTGTTGCAGA TTCCATTTCT TCTAATGAGT ATCAAGAAAC TTTAAATAAA GCCAAAGGGA TGTTTACTGC TTTAGCTTGC TTAGACCCCC AAGATTTATG A
|
Protein sequence | MLNLDKEEFI LSASSGANYI PLAKSWPADL ETPLTTWLKV GNDAPSGVLL ESVEGGETIG RWSVVASDPL WKVVVRGDEL TRCWRDGKQE KFHGNPVEIL RKMLEPYKSV SLPGLPQLGQ LFGMWGYELI QWIEPSVPTY ELSDQDLPDG IWMFMDKVLI FDQVKRLITA VAYGNLSDGV SSQKAYEIAC EQINELQDLM SSPLKPIKSL KWNQKSNRSP DMAANTSKSE FEHSVEAAKE FIKQGDVFQL VLSQKLESTV TQKPFELYRS LRMVNPSPFM AFFDFGDWQL IGSSPEVMVK AQKTEKGIQT SLRPIAGTRP RGKNDLEDAA LEKDLLKDPK ERAEHVMLVD LGRNDLGRVC TPGSVVVKEL MVIEKYSHVM HIVSEVEGTL KKEQDVWDLL IASFPAGTVS GAPKIRAMQL INQLEKQRRG PYSGVYGSID LNGALNTAIT IRTMIVRKKN KNGFTVEVQA GAGVVADSIS SNEYQETLNK AKGMFTALAC LDPQDL
|
| |