Gene NATL1_08351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08351 
Symbol 
ID4780599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp767008 
End bp768603 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content30% 
IMG OID640084110 
Producthypothetical protein 
Protein accessionYP_001014658 
Protein GI124025542 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0702953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.19264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGCGC CTTCACTGTT AGGCGAGTCA TTGGCATTAC AGCTAACATC TCAAGATGAT 
AATTTAGAAA TAATCCTAGA CAAAAAAGAT ATAAATGGGT TACCTAAACT TATTCTATTT
TGCCTTGAAG AAGTAGAACT CTCAAACTCA ATCAAACTAG AAATTCATAA GTTAAAAGAA
AGATGGGAGC AATCTCCTGT CCTAATTGTA ATACCAAAAA GTATTAAATT ATCTTCTGAT
GATTTGATGA CTTTTGGTAG TGAAGGAGTT ATTCAAGATC CTACTGTTGA ACTTTTAAGA
GATACAATCA ATATTTTGAT TGGCGGCGGC AGAGTATTTA AAATTAATAA TGAAACAAAT
TACAATGCTG ACTCAATTCA TAATTCATAT GGCCTAGGAC ATTGGCTATT AACAAGTGGT
TTATCACAAA TAAATAAAGA TCTATACACT ATAGATCAAA TAATAGCCAA GAAATCAACA
AATACATTTT ATCTTTTCAT ATTAATAGGT AGAAGAAGAG AACTATTAAC AGCGAAGAGA
TTAATTATCT GGTTATGGGG GCCGCTAGAG GTATTAATAG AGTCTCCAAT TAAAAGTAAT
AACAATAAAA ATATAATCAA CAAATACAAT ACCGACATAA CAATAAAAAA TACCTCAACT
AATGAATTAT GGAATGTTAT TTATAAACGG GTAAAGGAGA GACTTCAAGA TGACCTTACA
AATTCTACGG GCGAGCTAGC AGCTCTTTAT TCATTAAATA AAAGCAAACG ATATAATCTT
TTAAAAACTC TTCTGAAAGA ATTTTCAACT ATTATAATCA AACTTGATTC TAAAGATAAT
AGAGAAAAAG GATTAGAGGA GATTTTACAA TCAATTACTC CTGAATTACG AGCTAATACA
TTGCGCAATT TCATAGATTC ATATGATCGT TTAAAAAAGA ATGGTGTTGA CGTTTTTATT
TCAGACTTTC TAGTACATAA TGCAGATCTC GGAATACTTG ATGATGAACT ACCATCAATA
GCATTAATAA TAGATCCAAT ATTAAATAAT AAGCCGCTTC TTATGGATGG AGACTATTTA
TCAATAGAAG ACCCTCGATC TATTATTCAA TTAGAAACAT TTATTCTAAA TTGGATATTC
AGGTCAGCTG AAATAGTTAG TGAAGAGATT ATATCTTCAT GTTCTGAATG GCCAGAATTA
CGTAAATACT TTCTAAATAA AGAATTAGTT TCAACAAGGG AACTTGAACG CAAACGAAAT
CATATCAATA CAAATAATCA ACTTCAAAAT CTATTTAAAA AGCCGGTTAG ATTATATGAA
AGTAAAAGAT TATATTATAC AGTCAAAAAC AATAACATTG AAAAAATTAT CACTCTTGAA
CCTAGAGATG ATGAATTAAA GAAACTAGAC TGGCCCCAAA GGCAAATAGC ATTTATAATA
GAATTAAGAG ATGCCTTGGC ACCACAAGTA CAGGCAATAA TTCAATACTT AGGTGATTTA
ATAGTTCTAA TCCTCACTAA AGTCGTGGGA AGATCTATAG GATTAATTGG TAGAGGTATC
GCTCAAGGTA TGGGAAGAAA CTTATCCAAA GGATAA
 
Protein sequence
MIAPSLLGES LALQLTSQDD NLEIILDKKD INGLPKLILF CLEEVELSNS IKLEIHKLKE 
RWEQSPVLIV IPKSIKLSSD DLMTFGSEGV IQDPTVELLR DTINILIGGG RVFKINNETN
YNADSIHNSY GLGHWLLTSG LSQINKDLYT IDQIIAKKST NTFYLFILIG RRRELLTAKR
LIIWLWGPLE VLIESPIKSN NNKNIINKYN TDITIKNTST NELWNVIYKR VKERLQDDLT
NSTGELAALY SLNKSKRYNL LKTLLKEFST IIIKLDSKDN REKGLEEILQ SITPELRANT
LRNFIDSYDR LKKNGVDVFI SDFLVHNADL GILDDELPSI ALIIDPILNN KPLLMDGDYL
SIEDPRSIIQ LETFILNWIF RSAEIVSEEI ISSCSEWPEL RKYFLNKELV STRELERKRN
HINTNNQLQN LFKKPVRLYE SKRLYYTVKN NNIEKIITLE PRDDELKKLD WPQRQIAFII
ELRDALAPQV QAIIQYLGDL IVLILTKVVG RSIGLIGRGI AQGMGRNLSK G