Gene P9211_14921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_14921 
Symbol 
ID5730434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1333639 
End bp1335078 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content35% 
IMG OID641285870 
Productimidazoleglycerol-phosphate synthase 
Protein accessionYP_001551377 
Protein GI159904033 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0107] Imidazoleglycerol-phosphate synthase
[COG0118] Glutamine amidotransferase 
TIGRFAM ID[TIGR00735] imidazoleglycerol phosphate synthase, cyclase subunit
[TIGR01855] imidazole glycerol phosphate synthase, glutamine amidotransferase subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0377543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTAT GCAAAAGAGT GATCGCTAGA CTTGATGTTA AAGGAACAAG GTTAATTAAA 
GGCATCAGAT TCGAAGGCTT GAGAGTAATG GGGGATTCTC GTGAAGCAGC TATTAATTAT
TTCAATTCAG GTGTAGACGA GATTTTGTAT ATAGATTCAG TTGCAAGCTT ATATGGTAGA
AACAGCCTGA CAGAAATTCT AAAAAGTACG GCTAAAAGTA TTTTTATTCC AATTACTGCA
GGAGGAGGTG TTCGTAATAT AGAAGATGCA GCCAGACTAT TAGCCGCTGG TGCAGATAAG
ATCGCAGTAA ATACTGCATG CATTCAAAAC CCAAAACTGA TTAATCAATT AGCCAAGGAG
TTTGGCTCTC AGTGTATTGT GGTTTCTATA CAAGCAAGAG CAAAGCCTTC TTCCCAAGAA
TGGGAATGTA TGACAGAAGC TGGAAGAGAA AGAAGTGATG TCTCTGTAAT TGATTGGATA
CAAAAAGTAC AAGAGCTAGG TGCTGGTGAA ATATTACTTA CATCAGTTGA CCAAGATGGA
ACTTGCAAAG GGCCAGATAA GAAACTTATT AATGCTGCTG CCGATGTTGC AGAAGTACCT
TTAATAGTTG GAGGAGGAAT ATCAACTGTT GAAGAGATAG AAGATTCTTT TAGAAAAACA
ATTATTACCG GTGTAAGTTT AGCTGCTTCA TTACACCATA GGAAAATAGA GGTAAAAGAA
ATAAAAAGTC GTTTATTGTC TTCTGATTTT AATATTAGGA TTCCTGCTTC AATTAAAAGA
GAAGAATTAA AAGAAAAGCT TAGTGATATC AATGTAGGAA TTATCGATTA TGGTATGGGT
AATCAACAAA GTCTGATAAA TGCATTCACC GAAATAGGTC TTAGCACTTG TTTGACTTCT
TCCGTTGAAA AATTAACTAA AACAGATTTA CTAGCACTTC CAGGGGTTGG CTCATTCCCA
AAAGGTATGG AGATGCTAAA AGAACTAGAT CTTATTGATT TCTTAAAAGA TAGAGCAGAG
AAAGATCATC CACTAATTGG TATTTGCTTG GGTATGCAAA TGTTATTTGA AAGTGGTAAT
GAATTCAAAC ATACTAAAGG TCTTGGCTTA ATTGAAGGTG AGGTTGAAAT GCTATCAGTT
AGTCTGAAAC CAGATACTAT AAATGTATTA CCTCATGTTG GATGGAATAA GATTTATAAA
CATAATGAGG CTGAAAATAG CTGTGAAAAG TCATTCAATC AATACTTCGT TCATAGCTAT
GCTGCTATAG ATGTACCAAA AGAGTATATA ACTTATGAAT GTAATTATGC TGGAAATGAC
TTTATAGCTG CCGTAAACAA AAATTGCATA AGTGGTTTTC AGTTTCACCC CGAACGTAGT
GGAAGAGCTG GACTTAATCT TTTGGCAGAT GAAGTTTTAA GGTTGGTAAG AAGTCAATGA
 
Protein sequence
MSLCKRVIAR LDVKGTRLIK GIRFEGLRVM GDSREAAINY FNSGVDEILY IDSVASLYGR 
NSLTEILKST AKSIFIPITA GGGVRNIEDA ARLLAAGADK IAVNTACIQN PKLINQLAKE
FGSQCIVVSI QARAKPSSQE WECMTEAGRE RSDVSVIDWI QKVQELGAGE ILLTSVDQDG
TCKGPDKKLI NAAADVAEVP LIVGGGISTV EEIEDSFRKT IITGVSLAAS LHHRKIEVKE
IKSRLLSSDF NIRIPASIKR EELKEKLSDI NVGIIDYGMG NQQSLINAFT EIGLSTCLTS
SVEKLTKTDL LALPGVGSFP KGMEMLKELD LIDFLKDRAE KDHPLIGICL GMQMLFESGN
EFKHTKGLGL IEGEVEMLSV SLKPDTINVL PHVGWNKIYK HNEAENSCEK SFNQYFVHSY
AAIDVPKEYI TYECNYAGND FIAAVNKNCI SGFQFHPERS GRAGLNLLAD EVLRLVRSQ