Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_06921 |
Symbol | trpD |
ID | 5731806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 604843 |
End bp | 605877 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285055 |
Product | anthranilate phosphoribosyltransferase |
Protein accession | YP_001550577 |
Protein GI | 159903233 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0742709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.398882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGACT TCTCTTGGCC AAAAATTCTT GACAAGCTTT TGAATGGTAA TGAACTTACC TCTGAAGAGA CAAATGCTTT GATGAATGCT TGGCTCAATC AAGAACTTGC TCCTGTTCAA ACCGGAGCTT TTCTAGCCGC TTTTAGATCT AAGAATGTTA GTGGATTGGA GTTGGCAGCT ATGGCTAAGG TCTTAAGAGA TGCCTGTGTT TTCCCTTTTC CTGTTCCTGA TCTTTATTTA GTGGATACAT GTGGTACTGG GGGGGATGGA GCGGATACAT TTAATATTTC AACTGCAGTA GCATTTCTGT CTGCATCTTT GGGTGTGAAA ATAGCCAAGC ATGGCAATCG AAGTGCTAGC GGAAAAGTTG GATCGGCTGA TGTTCTTGAA GGAGTAGGAA TCAGATTAGA CACTCCCATC GAAAATGTAG TGACAGCCCT AGATAAAACA GGCATTTCTT TTTTATTTGC ACCTATCTGG CACTCTTCAT TAGTGAACCT TGCCCCTTTA AGAAAGACTT TGGGAGTAAG AACAGTATTT AATCTTTTAG GACCATTGGT AAACCCTTTT AGGCCGAGTG CTCAAGTCCT AGGTGTTGCA ACATCAGAAT TACTAGATCC AATCGCTCAA GCTTTAAAAT ATTTAGGCTT AAAAAGAGCA GTTGTAGTTC ATGGTGCTGG AGGCTTAGAT GAAGCCTCAC TTGAAGGCAG TAACCAAGTA CGATTTCTAA AAGATGGCGA GATTTCCTCA TCTGAAATTG ATATAACTGA TTTAGGTCTT ACCCCGGCCT CCAATAATCA ATTAAAAGGT GGAGATCTTT CTAAAAATGA GGCTATCTTT ATGTCAGTTC TAAAAGGAAA TGCCACTAAG CCTCAGATGG AGGTGGTTGC TCTTAACACT GCTTTGGTCT TATGGGCTTC TGGCCTAGAA GAAGATTTGA GTCAAGGAGT CGAGATGGCA TTAAACTCTC TAAAAAGTGG AAATGGCTTG AAAAAATTAC TGGAATTAAA AGAGTTTTTA GGACCAAAGA ATTAG
|
Protein sequence | MSDFSWPKIL DKLLNGNELT SEETNALMNA WLNQELAPVQ TGAFLAAFRS KNVSGLELAA MAKVLRDACV FPFPVPDLYL VDTCGTGGDG ADTFNISTAV AFLSASLGVK IAKHGNRSAS GKVGSADVLE GVGIRLDTPI ENVVTALDKT GISFLFAPIW HSSLVNLAPL RKTLGVRTVF NLLGPLVNPF RPSAQVLGVA TSELLDPIAQ ALKYLGLKRA VVVHGAGGLD EASLEGSNQV RFLKDGEISS SEIDITDLGL TPASNNQLKG GDLSKNEAIF MSVLKGNATK PQMEVVALNT ALVLWASGLE EDLSQGVEMA LNSLKSGNGL KKLLELKEFL GPKN
|
| |