Gene NATL1_15241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15241 
SymbolguaB 
ID4780696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1241379 
End bp1242542 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content41% 
IMG OID640084806 
Productinosine 5-monophosphate dehydrogenase 
Protein accessionYP_001015346 
Protein GI124026230 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01304] IMP dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATATTC AACTAGGACG CACCAAAATT GTTCGAAGAG CCTACGGGAT TGATGAGATT 
GCATTAGTTC CTGGAAGAAG AACAGTTGAC CCTGGCATCA CAAAAACCAA TTGGGAAATC
GGTGGAATTG AAAGGGATAT TCCCATAATC GCAAGCGCGA TGGATGGTGT TGTAGATGTA
AACATGGCCG TAGCCTTGTC AAAATTAGGA GCCCTAGGTG TTTTAAATCT AGAAGGAGTA
CAAACTAGAT ATGAAGATCC CAAAGAAGTT CTAACGAAAA TCCAATCGAT CGGGAAAGAG
GAATTTGTTC CCCTAATGCA AGAAATCTAT AATAAGCCGA TCAAAGAAAA GTTAATTTTA
AAAAGAATTC AAGAAATCAA AGATAGTGGG GGTATTGCTG CTGTAAGCGG GACACCATTA
GCAGCAATTA AATATAAAAA TCTAGTCAAA GACTCAGGTG CAGACTTATT TTTCCTTCAG
GCGACTGTAG TTTCAACAGA ACATCTGGGT AAAGAGGGTA GTCAAAATCT TGATCTTTAT
GATCTTTGCG AAAACATTGG CATACCGGTT GCTGTGGGTA ATTGCGTTAC TTATGAAGTG
TCTTTAAAGC TTATGAAAGC AGGCGCTGCC GCAGTGATGG TTGGTATTGG ACCTGGAGCC
GCATGCACCT CCAGAGGAGT ATTAGGGGTT GGAATTCCTC AAGCAACTGC TATTTCTGAT
TGCGCTGCTG CAAGGGATGA TTTTCAAAAA GAAAGTGGGA AATATGTCCC AATTATTGCT
GATGGAGGAA TTATCACTGG TGGGGATATT TGCAAATGCA TAGCATGTGG TGCTGATTCA
GTGATGATTG GTTCCCCTAT CGCAAGATCT CAGGAGGCTC CAGGTAAAGG TTTCCATTGG
GGTATGGCAA CACCAAGTCC TGTTCTACCA AGAGGTACAA GGATTCAAGT TGGAACAACC
GGTAGTTTAA AAAGTATTCT TTGTGGACCA GCAATTCTTG ATGATGGAAC CCACAATTTA
TTAGGAGCAA TAAAAACCTC AATGGGAACT CTAGGCGCAA CTAATATCAA GGAAATGCAA
AATGTTGAGG TAGTTATCGC ACCTTCTTTG TTAACAGAAG GAAAGGTTTA CCAAAAAGCG
CAACAGCTTG GAATGGGAAA GTAA
 
Protein sequence
MNIQLGRTKI VRRAYGIDEI ALVPGRRTVD PGITKTNWEI GGIERDIPII ASAMDGVVDV 
NMAVALSKLG ALGVLNLEGV QTRYEDPKEV LTKIQSIGKE EFVPLMQEIY NKPIKEKLIL
KRIQEIKDSG GIAAVSGTPL AAIKYKNLVK DSGADLFFLQ ATVVSTEHLG KEGSQNLDLY
DLCENIGIPV AVGNCVTYEV SLKLMKAGAA AVMVGIGPGA ACTSRGVLGV GIPQATAISD
CAAARDDFQK ESGKYVPIIA DGGIITGGDI CKCIACGADS VMIGSPIARS QEAPGKGFHW
GMATPSPVLP RGTRIQVGTT GSLKSILCGP AILDDGTHNL LGAIKTSMGT LGATNIKEMQ
NVEVVIAPSL LTEGKVYQKA QQLGMGK