Gene P9303_15761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_15761 
SymboltrpD 
ID4775956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1379931 
End bp1381043 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content57% 
IMG OID640087085 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001017585 
Protein GI124023278 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGTCCC CATCACCCAG CCCGGAGTGC TTCGCACCCG GGTTTTTTTC TGCTCCGATG 
CCTACTTCAG CTATTGCCTC TCCCTCTTGG TCGCAAATCC TCGAGATGTT GCTCGAGGGA
CAAAACCTGC CCGAGGTGGA AGCAACTGCC TTGATGGAGG CTTGGTTGGC GGAACAACTA
ACCCCTGTGC AGACAGGTGC GTTTCTAGCA GCTTTGCGAG CCAAGGGGGT AACCGGAAGT
GAGTTGTCAG GCATGGCCCA GGTGTTGCGA GGTGCTTGTC CCTTGCCTTG CCCATTGCCA
GGCATCCCTA TGGTCGACAC CTGTGGAACA GGTGGCGACG GTGCAGATAC CTTCAACATC
TCAACCGCAG TGGCGTTTAC TGCTGCCGCC TGCGGGGCGA ATGTGGCTAA GCATGGCAAT
CGCAGTGCAA GTGGCAAAGT CGGTTCAGCA GATGTTCTCG AGGGCCTGGG TCTGCAGCTC
AAGGCTCCTC TTGTCTCTGT GGTGGAGGCC CTGGCTGAGG TACGCGTCAC ATTTTTGTTT
GCCCCGGCCT GGCACCCCGC TTTGGTCAAC TTGGCCCCGT TGCGGCGCAG CCTTGGAGTG
CGCACCGTGT TCAATCTTCT AGGTCCACTG GTGAATCCTT TACAACCGAA TGCCCAAGTT
CTCGGGGTAG CTAAGGCTGA GCTGCTCAAT CCAATGGCGG AAGCATTGCA ACGGCTTGGC
TTGCAGCGGG CCGTTGTTGT CCATGGCGCC GGTGGCCTTG ATGAAGCGTC GTTGGAGGGA
GTCAATGCAA TGCGTTTGCT TGAGGATGGT CATGTGCGAC AAGCATCGAT CGATTCGGCA
GAACTCGGGC TTACTAGAGC TCCTTTGCAG GCTCTCCAGG GGGGTGATTT GGCAACAAAT
CAAGCGATTC TTTCCGCTGT ACTTCAGGGA GGCGGCACCG CCCCTCAAAG GGATGTGGTG
GCATTGAACA CAGCCCTAGT GCTCTGGGCT GCTGGCCTAC AAGATGATTT ACGAGCAGGT
GTTTCTGCTG CAAAGACTTG CCTGCAGGAG GGCCTCCCCT GGCAGCGGCT AGAAGGGCTC
CGCATGGCAC TTGATCATCA AATTGGAGAA TGA
 
Protein sequence
MLSPSPSPEC FAPGFFSAPM PTSAIASPSW SQILEMLLEG QNLPEVEATA LMEAWLAEQL 
TPVQTGAFLA ALRAKGVTGS ELSGMAQVLR GACPLPCPLP GIPMVDTCGT GGDGADTFNI
STAVAFTAAA CGANVAKHGN RSASGKVGSA DVLEGLGLQL KAPLVSVVEA LAEVRVTFLF
APAWHPALVN LAPLRRSLGV RTVFNLLGPL VNPLQPNAQV LGVAKAELLN PMAEALQRLG
LQRAVVVHGA GGLDEASLEG VNAMRLLEDG HVRQASIDSA ELGLTRAPLQ ALQGGDLATN
QAILSAVLQG GGTAPQRDVV ALNTALVLWA AGLQDDLRAG VSAAKTCLQE GLPWQRLEGL
RMALDHQIGE