Gene P9211_06921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_06921 
SymboltrpD 
ID5731806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp604843 
End bp605877 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content40% 
IMG OID641285055 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001550577 
Protein GI159903233 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0742709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.398882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACT TCTCTTGGCC AAAAATTCTT GACAAGCTTT TGAATGGTAA TGAACTTACC 
TCTGAAGAGA CAAATGCTTT GATGAATGCT TGGCTCAATC AAGAACTTGC TCCTGTTCAA
ACCGGAGCTT TTCTAGCCGC TTTTAGATCT AAGAATGTTA GTGGATTGGA GTTGGCAGCT
ATGGCTAAGG TCTTAAGAGA TGCCTGTGTT TTCCCTTTTC CTGTTCCTGA TCTTTATTTA
GTGGATACAT GTGGTACTGG GGGGGATGGA GCGGATACAT TTAATATTTC AACTGCAGTA
GCATTTCTGT CTGCATCTTT GGGTGTGAAA ATAGCCAAGC ATGGCAATCG AAGTGCTAGC
GGAAAAGTTG GATCGGCTGA TGTTCTTGAA GGAGTAGGAA TCAGATTAGA CACTCCCATC
GAAAATGTAG TGACAGCCCT AGATAAAACA GGCATTTCTT TTTTATTTGC ACCTATCTGG
CACTCTTCAT TAGTGAACCT TGCCCCTTTA AGAAAGACTT TGGGAGTAAG AACAGTATTT
AATCTTTTAG GACCATTGGT AAACCCTTTT AGGCCGAGTG CTCAAGTCCT AGGTGTTGCA
ACATCAGAAT TACTAGATCC AATCGCTCAA GCTTTAAAAT ATTTAGGCTT AAAAAGAGCA
GTTGTAGTTC ATGGTGCTGG AGGCTTAGAT GAAGCCTCAC TTGAAGGCAG TAACCAAGTA
CGATTTCTAA AAGATGGCGA GATTTCCTCA TCTGAAATTG ATATAACTGA TTTAGGTCTT
ACCCCGGCCT CCAATAATCA ATTAAAAGGT GGAGATCTTT CTAAAAATGA GGCTATCTTT
ATGTCAGTTC TAAAAGGAAA TGCCACTAAG CCTCAGATGG AGGTGGTTGC TCTTAACACT
GCTTTGGTCT TATGGGCTTC TGGCCTAGAA GAAGATTTGA GTCAAGGAGT CGAGATGGCA
TTAAACTCTC TAAAAAGTGG AAATGGCTTG AAAAAATTAC TGGAATTAAA AGAGTTTTTA
GGACCAAAGA ATTAG
 
Protein sequence
MSDFSWPKIL DKLLNGNELT SEETNALMNA WLNQELAPVQ TGAFLAAFRS KNVSGLELAA 
MAKVLRDACV FPFPVPDLYL VDTCGTGGDG ADTFNISTAV AFLSASLGVK IAKHGNRSAS
GKVGSADVLE GVGIRLDTPI ENVVTALDKT GISFLFAPIW HSSLVNLAPL RKTLGVRTVF
NLLGPLVNPF RPSAQVLGVA TSELLDPIAQ ALKYLGLKRA VVVHGAGGLD EASLEGSNQV
RFLKDGEISS SEIDITDLGL TPASNNQLKG GDLSKNEAIF MSVLKGNATK PQMEVVALNT
ALVLWASGLE EDLSQGVEMA LNSLKSGNGL KKLLELKEFL GPKN