Gene P9515_11521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_11521 
SymbolguaB 
ID4719045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp1002210 
End bp1003511 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content36% 
IMG OID640080833 
Productinosine 5-monophosphate dehydrogenase 
Protein accessionYP_001011466 
Protein GI123966385 
COG category[C] Energy production and conversion 
COG ID[COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases 
TIGRFAM ID[TIGR01304] IMP dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA TAATGCTAAT AATTAATTTT CAAATACTCA TATTTCTACA AAATTATAAA 
AAAACTCTTT TGTTTTTAAG AAATTATTGG CTTATTACAC CTTTAGTTGT TAACTTAATC
AGAATAATTA AAAAAACCGT GAATATTGAA ATTGGCTTAA ACAAAAAAGT TAGAAGGGCT
TACGGCATTG ATGAAATTGC ATTAGTGCCA GGAACAAGAA CCCTAGATTA CGAATTAACA
AATCCTTCTT GGTCAATTGG AAATATTGAA AGAGATATTC CAATTATCGC CAGTGCAATG
GATAGTGTTG TTGATGTGAA TACAGCTGTA GATCTCTCTA AATTAGGAGC TATTGGTGTT
CTCAACATGG AAGGTATACA AACTAGGTAT GAAAATCCAA AGGAAATACT AAGTCAAATC
TCCTCCGTAG GAAAAAATGA ATTCGTACCT TTGATGCAAG AAATATATAA AGAACCCATC
AAGCAAGAAC TTATTTTGCA AAGAATTAAC GAAATTAAAG AAAAAGATGG AATAGCAGCA
TTAAGTGGAA CACCGCAAGC AGCTATCAAA TTTAAAGAAA CACTTGTGAA GTCGAAAATA
GATTTATTCT TTCTTCAAGG TACTGTAGTT TCAACCGAAC ATTTAGGTAT GGAGGGGAAT
GAGACATTAA ATATCAAAAG CTTATGTCAA TCTTTAAAAG TACCAGTTGT TGCAGGTAAT
TGTGTAACTT ATGAAGTTGC AGAACTTCTT ATGAAATCAG GTGTTGCAGG TCTTATGGTG
GGAATCGGCC CAGGAGCAGC TTGCACTTCG AGAGGAGTAT TGGGAATAGG AATCCCCCAA
GCAACAGCAA TCTCTGATTG TAGTTCAGCA AGAGATGATT ATTTTCAAGA AACTGGTCGT
TATGTCCCCA TAATTGCTGA TGGAGGAATT GTTACTGGTG GTGACATTTG CAAATGCATC
GCCTGTGGTG CTGACGCAGT TATGATTGGT TCTCCAATAG CTAAATCAAC AAGTGCTCCG
GGCAATGGAT TTCATTGGGG TATGGCCACA CCAAGTCCTA TATTACCTAG AGGTACAAGA
ATTGAAGTCG GCTCTACAGG TTCCTTAGAG AGAATATTAA AAGGACCCGC AATACTTGAT
GATGGGACAC ACAATTTACT TGGAGCTATT AGGACATCAA TGAGTACTCT TGGAGCTAAA
AATATCAAAG AGATGCAAAA TGTTGATATT GTAATTGCGC CATCTCTTTT AACAGAGGGA
AAAGTATATC AAAAAGCTCA ACAGCTTGGA ATGGGTAAAT AA
 
Protein sequence
MNKIMLIINF QILIFLQNYK KTLLFLRNYW LITPLVVNLI RIIKKTVNIE IGLNKKVRRA 
YGIDEIALVP GTRTLDYELT NPSWSIGNIE RDIPIIASAM DSVVDVNTAV DLSKLGAIGV
LNMEGIQTRY ENPKEILSQI SSVGKNEFVP LMQEIYKEPI KQELILQRIN EIKEKDGIAA
LSGTPQAAIK FKETLVKSKI DLFFLQGTVV STEHLGMEGN ETLNIKSLCQ SLKVPVVAGN
CVTYEVAELL MKSGVAGLMV GIGPGAACTS RGVLGIGIPQ ATAISDCSSA RDDYFQETGR
YVPIIADGGI VTGGDICKCI ACGADAVMIG SPIAKSTSAP GNGFHWGMAT PSPILPRGTR
IEVGSTGSLE RILKGPAILD DGTHNLLGAI RTSMSTLGAK NIKEMQNVDI VIAPSLLTEG
KVYQKAQQLG MGK