Gene NATL1_21161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21161 
SymbollysC 
ID4780251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1770703 
End bp1772469 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content38% 
IMG OID640085413 
Productaspartate kinase 
Protein accessionYP_001015936 
Protein GI124026821 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.926226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTGC TGGTTCAAAA ATTTGGCGGC ACCTCTCTAG GAAGTATTGA GCGCATAAAA 
GCTGTCGCGC AAAGAATCAA ATCAAGTAAA GAAAAAGGTG CTGATCTAGT AGTTGTTGTG
TCGGCCATGG GACATCAAAC TGATGAGTTA ACACGGCTAG CGTCAGAAAT AACTGTTGAT
CCTCCTCATA GAGAAATGGA TATGCTCCTC TCAACTGGGG AGCAAGTTTC AATATCATTA
TTAACAATGG CCCTGAACGA ATTGGGCACA CCAGCAATCT CTTTGACTGG AACTCAAGCT
GGAATTATCA CAGAATCAGC TCATGGAAGA GCCAGAATCC TCGAGATAAG GACAGAACGA
ATAAAAAATC TCTTAGACCA AGGTCAAACC ATAGTTATTG CTGGATTCCA AGGAACAACT
CTTGGCATAG GAGGAATTGC TGAAATTACA ACTTTAGGTA GAGGAGGTTC AGATACTTCT
GCAGTAGCTC TAGCGGCATC GCTTGAAGCT GCTACATGTG AAATTTATAC CGATGTTCCT
GGCGTACTTA CAACTGACCC AAGAATTGTG AAAAATGCAA AATTAATGAA AAGTATTAGT
TGTGATGAAA TGTTAGAACT TGCCAGCCTT GGCGCAGCTG TTTTACATCC TCGAGCAGTT
GAAATAGCAA GAAATTTCGG CGTAACTCTC GTCGTTAAAT CCAGTTGGGA CAACCTTGAT
GGAACCACTC TAACTAGTAA TAAGAAGCCT GACTTTTCTC AAGGTGGAAT AGAACATCAA
AGTCCTGTCG ATGGATTAGA ACTTCTTGAG AATCAAGCAG TCGTAGCTTT ATCTAATATT
CCAGATCGTC CAGGAATTGC TGCGGAACTT TTTGAATCTT TATCAGAGGG TGGGGTGAAT
GTCGATCTCA TTATTCAAGC GACACATCAA ATTGACTCTA ACGACATCAC TTTTACTATT
GCTGAAAATG AATTACATAA TGCACTAACT CAATGTAAAA AACTCGTTAA TACTATTGGA
GGTGATATCT CTTTTCAAAA AGATCTGACT AAACTAAGTA TTTATGGAGC TGGGATAATG
GGAAGGCCTG GAATAGCGTC ATCGCTATTC CAAATTCTAT CTGACTCTGG TATTAATATA
AGACTAATCG CAACTAGTGA AGTCAAAGTC AGTTGTGTTA TTGATGCAGA ATTAGGGAAA
AAAGCACTAC GTAATGTAAG CGAAGTTTTC AAGCTCACTG ATAAACAAAT TACCGTGAAT
CCTACGATTG AAAATAATAA CGAGCCAGAA GTAAGGGGAA TAGCTTTAGA TAAAGATCAA
ATACAAATTA GCGTGAAGAA TGTGCCAGAT AAACCAGGGA CTGCCTCATC AATATGTTCC
ACTTTAGCTG AGAAAAATAT CAGCTTAGAT ACTATAGTTC AATCTGAAAG AAAGCATAAA
GATAAAACCA AAGATATCAG CTTCACTTTA AAGAAAAATG ATAGAAGCGA TGCTAAATAT
GCATTAAAAG AATTGATTGA AAATTGGAAA GGAGCAAAAC TCGAAGAAGG AGAGTCAATA
GTACGAATTA GCGCAGTAGG TTCTGGAATG CCTTTTACAA AAGGAACAGC CGGTAAAATT
TTTAGAGCAC TAGCAAATCA AAAAATCAAC ATAGAAATGA TCGCCACAAG TGAAATAAGA
ACAACTTGTA TTATCTCAGA AAAATATGGT GAAAAAGCAT TAAATGAAAT TCATTCTTGC
TTTAAATTAG GAAAAAATAA AAGCTAA
 
Protein sequence
MALLVQKFGG TSLGSIERIK AVAQRIKSSK EKGADLVVVV SAMGHQTDEL TRLASEITVD 
PPHREMDMLL STGEQVSISL LTMALNELGT PAISLTGTQA GIITESAHGR ARILEIRTER
IKNLLDQGQT IVIAGFQGTT LGIGGIAEIT TLGRGGSDTS AVALAASLEA ATCEIYTDVP
GVLTTDPRIV KNAKLMKSIS CDEMLELASL GAAVLHPRAV EIARNFGVTL VVKSSWDNLD
GTTLTSNKKP DFSQGGIEHQ SPVDGLELLE NQAVVALSNI PDRPGIAAEL FESLSEGGVN
VDLIIQATHQ IDSNDITFTI AENELHNALT QCKKLVNTIG GDISFQKDLT KLSIYGAGIM
GRPGIASSLF QILSDSGINI RLIATSEVKV SCVIDAELGK KALRNVSEVF KLTDKQITVN
PTIENNNEPE VRGIALDKDQ IQISVKNVPD KPGTASSICS TLAEKNISLD TIVQSERKHK
DKTKDISFTL KKNDRSDAKY ALKELIENWK GAKLEEGESI VRISAVGSGM PFTKGTAGKI
FRALANQKIN IEMIATSEIR TTCIISEKYG EKALNEIHSC FKLGKNKS