Gene NATL1_16401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16401 
Symbol 
ID4780812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1337575 
End bp1339914 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content26% 
IMG OID640084923 
Producthypothetical protein 
Protein accessionYP_001015462 
Protein GI124026346 
COG category[H] Coenzyme transport and metabolism
[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2226] Methylase involved in ubiquinone/menaquinone biosynthesis
[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.659248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTGGTT TCGGGGAAAA GAAAAAAGAG AATAAAGGCT CTTCTAAAAA ACTCCAAAAG 
CTTTCAGAAA AGGATCTGAA AGCAAAATCA ATCAATAATC ATATTAAAGG GAATTTAGAT
GAAGCAGAGA AAGGTTATAT AGCTTTTTTA AGAAATGGTT ATTCTGATGC AGATATAATT
TCAAATTATG CACTCATATG TGAAGGAAAA GGAGAAAATG AGAAAGCAAT AAGATTATAC
AAAAAATGTG CTAAAAGTTT TCCAAATCAT ATTTACTCAA AATTAAATTT ATCTTTTTTA
TATTATAAAT TAAATCAATT AGAAATAGCA GAAAAAATAA TTGAAGAAGC AATTCAATTA
AAGCCAAGCA TGCCAAATGG GCATTGTATC AGAGGATTGA TTTTAAAAGG TTTAGATAAA
TATGATGAGT CAAGACTATC ATTTGAGAAA GCAATCGAAC TAGATAAAAA TTACTTTGAT
GCTTATATCA ATCTAGGATT ATTGAACAAA GATTCTAATA AATATAATGA AGCAGAAGAA
TGTTATTTAA AAGCATTAGA AATAAATAAT AAATCTGCTA TCGCTCATTT GAATCTAGGC
GCATGCTACA AAGAGAAACA AGATCTAGAT AAAGCCATTT TGCATACTAA GATGGCAATA
GAGATAGATA ATAAATTAGA AAACTGTTAT TTAAATCTAG CTACAATATA TAACCAAATT
GGAGATTATA AAAAATCACT CTCACTTACA AAAAAAGAAC TTTTATTGCA TAAGCATAGT
GAGTTAAGTT ATCAACTTAT AAGTGAATTA ATCAAAAAAG GAGAAGTATT AATCACATCG
GAGAAAGATA ATAGAGAATT ACTTAAAAAT TTATTAAATA GAAAAGATAT ATCTCATCGA
GAACTATTTG GGAATATAAA CAGCCTTATC TCGAAAGAAA TATTAGAAGA ATTATCTATT
TTAGAATCGA AGTTGTATGA AAACAATAAA TTTAATATTT TAATTAAGGA CAAGGAATTA
GTAAAAGCAC TTTCTTTACT AATATTTTGC TCTCCATTAT GGGAAAAGGT TTTAGGAAAT
ATACGTAAGA ATATTCTATT AAACTATTCA GATAAAGATA AAATAAGTAA TAGTATTTTC
AACTTTATAA TAGGGCTTGG ATCACAATGT TTCTTAAATG AGTATGTCTA TTACATATCT
ACAGAAGAAA AGGATAAACT AAAAGAACTT AAAAAGATAA TCAATAATAA TAAAAATCAA
GACTATACAT TAGCAATAAT TTCTTGCTAC CAATCTCTAT CTTCAATAAA TGATGAAATA
ATTAATTTAA ATACTTATAT ACCAAATAAA AAAGAATTAA ATAATCTTCT TAATTTACAA
TTTAAAGAGT TAAATGCTGA AAAAATGATT TCAAAAGGAA TTAAAAAGAT AGGAAATATA
AAAGATTCAA CATCTAAAGA AGTTAAAAAC CAGTATGAAT TAAACCCATA TCCTAGATGG
AGATACAATT CATATACTAA AGAAAACAAA CTGAACTTTT TTTCAGTTAT TAATTCAGAA
ATTTCACCGA ATACAATTAA ACCTAATTCA GCTCAATTAA CAAATAAAAA AATAAATATT
CTTATAGCAG GTTGTGGAAC GGGTATTCAA ATAATTGAAG CATCTCGGTA TAGTAATTGT
GAAATAACAG CCATCGATCT AAGCAATTCA AGTATCTCAT ATGCAAAGAG AAAGGTCGAT
GAATATGGAT TGAAAAATAT CAATTTTATA GAAATGGATT TACTTGAATT GACATCGCTA
AATAAAAGAT TTGATTTAAT AGAATGTTCA GGTGTTCTTC ACCATATGAA TGAGCCTATT
AAGGGATTAT CAAATCTATT TGAAGTATTA GAACCAGAAG GCTTTTTAAA GTTAGGTTTG
TACAGCAAGT ATGCAAGAGA AGAAATCCTA AAAGCAAGAA AACTAATCAA AGAAAAAGAT
ATTAAACCAA ACATTGATGG AATAAGAAAC TTCCGAAATG ATCTTCTGAA TGGAGAAATT
AAAGAGGTAA ATGAGATAAG TAATTGGTCA GATTTCTACT CAACTTCAAT GTGTAGAGAT
CTTTGCTTTC ATATCCATGA AAACTGTTAC ACGCTAATCG AAATTAAAAA CATGTTAAAA
GTATCTAATC TGGAATTCCT AGGTTTTACT CTTTCAAAAG AAATTAGAGA TAAGTATCAG
ATAGATAATA AAGATAAAGA CTCTTTAAAA AATTTAGAAT TATGGGATAA ATTTGAAAAA
TTAAATCCTA AATCTTTTAG AGAAATGTAT CAATTCTGGT CTAGGAAATC AACTAAATAG
 
Protein sequence
MSGFGEKKKE NKGSSKKLQK LSEKDLKAKS INNHIKGNLD EAEKGYIAFL RNGYSDADII 
SNYALICEGK GENEKAIRLY KKCAKSFPNH IYSKLNLSFL YYKLNQLEIA EKIIEEAIQL
KPSMPNGHCI RGLILKGLDK YDESRLSFEK AIELDKNYFD AYINLGLLNK DSNKYNEAEE
CYLKALEINN KSAIAHLNLG ACYKEKQDLD KAILHTKMAI EIDNKLENCY LNLATIYNQI
GDYKKSLSLT KKELLLHKHS ELSYQLISEL IKKGEVLITS EKDNRELLKN LLNRKDISHR
ELFGNINSLI SKEILEELSI LESKLYENNK FNILIKDKEL VKALSLLIFC SPLWEKVLGN
IRKNILLNYS DKDKISNSIF NFIIGLGSQC FLNEYVYYIS TEEKDKLKEL KKIINNNKNQ
DYTLAIISCY QSLSSINDEI INLNTYIPNK KELNNLLNLQ FKELNAEKMI SKGIKKIGNI
KDSTSKEVKN QYELNPYPRW RYNSYTKENK LNFFSVINSE ISPNTIKPNS AQLTNKKINI
LIAGCGTGIQ IIEASRYSNC EITAIDLSNS SISYAKRKVD EYGLKNINFI EMDLLELTSL
NKRFDLIECS GVLHHMNEPI KGLSNLFEVL EPEGFLKLGL YSKYAREEIL KARKLIKEKD
IKPNIDGIRN FRNDLLNGEI KEVNEISNWS DFYSTSMCRD LCFHIHENCY TLIEIKNMLK
VSNLEFLGFT LSKEIRDKYQ IDNKDKDSLK NLELWDKFEK LNPKSFREMY QFWSRKSTK