Gene NATL1_20401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20401 
Symbolcmk 
ID4779871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1683005 
End bp1684555 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content33% 
IMG OID640085334 
Productbifunctional pantoate ligase/cytidylate kinase 
Protein accessionYP_001015860 
Protein GI124026745 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0283] Cytidylate kinase
[COG0414] Panthothenate synthetase 
TIGRFAM ID[TIGR00017] cytidylate kinase
[TIGR00018] pantoate--beta-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.523642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTACGAA AAATCTTCCA AACTAATGCT GAATTAAAAG ATTGGCTTAG TGAACAAAAT 
TCAGCAATAA TTTTTATCCC TACTATGGGA GGACTTCACC CTGGGCACCA ATATTTAATT
CAGAAAGCCA AAGAAAGAAA AACAAACACA AACCAAATTA TTCTTGTAAG TATTTTTGTA
AATCCATTGC AATTTAGCAA GGGTGAAGAC TTCAAAAAAT ATCCCAGAAA TATAAACAGA
GATGCTGAAT TAGCTTTTAG TGCAGGAGCA GACGCGATTT GGGCTCCAGA TTATGATGAA
GTTTTTCCCG GAGGCGCAGA TTCACATTTT AAAATTGAAG TTCCTAAAAC ATTGCATAAT
CAATTATGTG GTGCTGAAAG GAAAGGACAT TTTGATGGAG TCGCAACCGT TATTATTCGA
TTGATAAAAA TTATCAAACC AAAGAAACTT GTTCTGGGAG AGAAAGATTG GCAACAATTA
ATTATAATTA GAAAACTCTT TCAAGAACTA TCAATACCTG TAAAAATTGA ATCCTATTCC
ACACAAAGAG ACCAAAGTGG ATTTGCTTAT AGTTCAAGAA ATTCTTATCT TAGTGATTCT
GAAAGAGTAA ATGCTCAATC ATTACCTAAT GCAATCAAAG AAGCAAAAAC AGAATTTGAT
AAAGGGAAAG TAATAAATCT CACAAAAATA GCTTCGATAT TTAAAGAAAA TAATTTAAAA
ATTGAATATC TCAAAATTGT AGATCCATTT TCATTAAAAG AAACAGAAAA TATCAATAGA
CTATGCCTTT TGGCAGTAGC AGTAAAATGT GGGTCTACGA GGCTAATTGA TCACACTTTT
CTTATGCACA GAAAACCAAT TATTGCGATT GATGGTCCTG CTGGTGCAGG GAAAAGCACA
GTAACTAAAG CATTTGCCAA GAAACTTGGT TTTATTTATT TAGATACTGG CGCAATGTAT
CGAGCAGTGA CTTGGTTAAT CATAAGCAAT TCTATTGATC CAAATGATCA AGTGGAAATC
AAAAATATCT TGAAAGATTC AAAATTAGAA TTTAAAAGCT CAAGCTTTGT TGAGCAAAAA
ATCTTCATAA ATAATATTGA CGTAACAGAG AAGATACGAT CCCCTAAAGT GACTTCAATG
GTATCCGAAA TTGCCAAACA ACAATTTGTA AGAGAATTAT TGACGCGAAA ACAACAAGTA
ATTGGAAATA ATGGGGGCTT AGTTGCAGAA GGAAGAGACA TAGGCACAGC TGTATTTCCA
GATGCAGACC TAAAAATTTT TCTTACTGCC TCTCCAACAG AAAGAGCAAA AAGAAGAGCC
CTTGACTTAC ACAAAAGAGG TTATGAATTC TCCAGCATTG AAGATCTTGA AAAAGAAATT
AAAGAAAGAG ATAAAAAAGA TAGTGAACGA AAAATAGCCC CTTTAAAAAA AGCTCAAGAT
GCCATAGAGC TTGTAACAGA TGGTATGAAT ATTGAAGATG TATTAAAAGA GCTAATTGAT
ATTTTCAGAT CAAAGATTCC AGAGGAAGTC TGGCCAACGC CTAATTCGTG A
 
Protein sequence
MVRKIFQTNA ELKDWLSEQN SAIIFIPTMG GLHPGHQYLI QKAKERKTNT NQIILVSIFV 
NPLQFSKGED FKKYPRNINR DAELAFSAGA DAIWAPDYDE VFPGGADSHF KIEVPKTLHN
QLCGAERKGH FDGVATVIIR LIKIIKPKKL VLGEKDWQQL IIIRKLFQEL SIPVKIESYS
TQRDQSGFAY SSRNSYLSDS ERVNAQSLPN AIKEAKTEFD KGKVINLTKI ASIFKENNLK
IEYLKIVDPF SLKETENINR LCLLAVAVKC GSTRLIDHTF LMHRKPIIAI DGPAGAGKST
VTKAFAKKLG FIYLDTGAMY RAVTWLIISN SIDPNDQVEI KNILKDSKLE FKSSSFVEQK
IFINNIDVTE KIRSPKVTSM VSEIAKQQFV RELLTRKQQV IGNNGGLVAE GRDIGTAVFP
DADLKIFLTA SPTERAKRRA LDLHKRGYEF SSIEDLEKEI KERDKKDSER KIAPLKKAQD
AIELVTDGMN IEDVLKELID IFRSKIPEEV WPTPNS