Gene P9303_01071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01071 
Symbol 
ID4776535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp106451 
End bp107953 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content37% 
IMG OID640085606 
Productimidazoleglycerol-phosphate synthase 
Protein accessionYP_001016127 
Protein GI124021820 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0107] Imidazoleglycerol-phosphate synthase
[COG0118] Glutamine amidotransferase 
TIGRFAM ID[TIGR00735] imidazoleglycerol phosphate synthase, cyclase subunit
[TIGR01855] imidazole glycerol phosphate synthase, glutamine amidotransferase subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.388112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGGC AATCAAGCAA AAATACAGTA AAATATATGG CACTATCTAA AAGAATTATT 
GCACGATTAG ATGTGAAGGG AAATAAGATC ATTAAAGGTG TCCGTTTTGA AGGCCTACGA
GTAATTGGCG ATCCGTTAGA TATGGCAAGA AAGTATGCAA AGGCTGGTAT TGATGAACTG
CTTTATATTG ATGCTGTTGC AAGCTTATAT GGCAGAAATA GTCTTATCGA AGTATTGAAA
AGCACCTCAT CAGAAGTCTT CATACCAATT ACTGCAGGAG GAGGCATTAG AAATCTAGAA
GATGCAAAGT TACTTTTATC TAATGGGGCG GACAAAATTG CGATTAACAC TGCGGCGCTG
GAGTGCCCCC AATTGATATC AGAACTTTCT AACACCTTAG GAAGTCAATG TGTAGTTGTA
TCTATTCAGG CTAGAGCTTC ATCAAGCTCA TCATGGGACG CAATGAAAGA ATCTGGAAGG
GAAAAGTCAG GTATTGATGT CATAAAATGG ATTGAAAATG TACAACTTTT AGGAGCAGGA
GAGATATTAT TGACTTCAGT AGATCAAGAT GGGACATGTA TAAAGCCAGA CGAAATATTA
ATTCAGAAAG TACTTCCTAT CCTGAAACTA CCTTTAATCG TAAGCGGAGG TTTTTCTGAT
CCTAATCAAG TCTATCAAGC ATTAACTAAT TCCAGGATAT CAGCAGTAGC GATTGGAGCC
GCTTTGCATT ATCAAAAGAC TAATGTATCG GATATTAAGA GTATGCTGTC ATTTAATAAT
ATAGCAATTC GTAAAACAAA AGAGCCTGCC ATCGAAAATA ATCATTACTT AAATAAAAAT
AATAAAATAG TTGGGATTAT AGATTATGGC ATGGGAAATC ATCAAAGTCT AATAAATGCA
TTACATGAAT TAGGATATAG TTCAATTGTT AGTCATGTAG CAGAAGAACT AAACAACTGT
GAATTAATTG CATTGCCAGG TGTAGGTGCA TTTGCCAAAG GGATGGATAA TTTAAGAAGA
ATGGGTTTAG ACCAATATCT TATTAATGCA TGCAACGAGG GCCGTCCATT AATAGGTATA
TGTCTTGGAA TGCAATTGCT GTTTGAGTCA AGTGAAGAGT ACGGCATCAA TAAAGGATTG
GGATTGCTAA ACGGATGTGT GTCCAAAATG CAAGAACATT TCAGTAGTAA AGCCTTCTAT
CCGCTTCCAC ATATGGGTTG GAACTTAATA GAGTCTACTA AATTAGCCAA AGATTCTGGT
CTGAGTGAAA ACTCATTTTA TCAATATTTT GTTCACAGCT ATTCAGTGAA ACCGAAGTCT
GATGAGATTA TTCTACATAC TTGTGAATAT GCAGAGAACT CAATAGTCGC ATCTGTTCTT
TATGAGAATA TATGTGGTCT TCAGTTTCAC CCAGAAAGAA GTGGAGAAGT GGGGATTAAT
CTTCTAGATA AGATCCTGGA AAAATTGACT AGTGGCAAAG ATGATCAAAC CCAGAGCTTG
TAA
 
Protein sequence
MGRQSSKNTV KYMALSKRII ARLDVKGNKI IKGVRFEGLR VIGDPLDMAR KYAKAGIDEL 
LYIDAVASLY GRNSLIEVLK STSSEVFIPI TAGGGIRNLE DAKLLLSNGA DKIAINTAAL
ECPQLISELS NTLGSQCVVV SIQARASSSS SWDAMKESGR EKSGIDVIKW IENVQLLGAG
EILLTSVDQD GTCIKPDEIL IQKVLPILKL PLIVSGGFSD PNQVYQALTN SRISAVAIGA
ALHYQKTNVS DIKSMLSFNN IAIRKTKEPA IENNHYLNKN NKIVGIIDYG MGNHQSLINA
LHELGYSSIV SHVAEELNNC ELIALPGVGA FAKGMDNLRR MGLDQYLINA CNEGRPLIGI
CLGMQLLFES SEEYGINKGL GLLNGCVSKM QEHFSSKAFY PLPHMGWNLI ESTKLAKDSG
LSENSFYQYF VHSYSVKPKS DEIILHTCEY AENSIVASVL YENICGLQFH PERSGEVGIN
LLDKILEKLT SGKDDQTQSL