Gene NATL1_00831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00831 
Symbol 
ID4779188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp83844 
End bp85352 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content37% 
IMG OID640083346 
Producthypothetical protein 
Protein accessionYP_001013912 
Protein GI124024796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.752054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTGGT CTCTTGCAAA TATTGAACAA CTGAGTGTGA GTGGAACAAA CGCTTTAGGT 
GTAAGTTCTC ATTTGGAAAT AAATCAAGGC GATAGTAATG ACAAATATCA GCTTTATTAC
ACAGGTGATG GAGGAGTCAC TGTCAATAGT ATGTCTACTG ATCTTGAGCT TACTAAGCAG
GGAGAAATTA AATTAATTCA AGATCTCACA ATCATTACTA CTAATGAAGG TATACGTAGG
GCTTATTATG TAGAAGTTGA TCCGAATACC GGCAATCATG AAATTCTTAC TGCATTGCTA
TCAGAGGATG GATTAACTCT TTCACAAGCC ACTAGAACAG GAATATTAGA CAACGGAGAC
AACGCTTGGG GAGTTCCTGA CTCAGTAGTT CTTCCAGATG GAAGAATCAG AATATACTGG
GTTAGTACAG ATAATACAGA TTCTGAATGG ACCGTTCCAC AAGATGATGA AATTATAAAA
AATGATGGTT TTAAGACCCC AAATGGAGAA TGGGTCTCGG CAAATGAATA TTTTGAAAAC
GACAGAAAAA TACCTGATAG TGCAACTAAA ACACTTGCTA ATGAAGTGAT ACTTAGTGCA
ACTTCTGATA CATCAAAGGG TACAGAATTT ACTGTAGATG AGGGCTATAG GACTGAAGGA
GGGTATGTCG ACTTTGAGGT TTTAAAAGCT AAAGAAAATG ATTGGTTAGC GATAATGTCT
TCTTCGCCAG TAACGATCCC TGATGAACCT CAGGGCATTT ATATAGGAAT TTCAAGTGAT
GGCCTTAGTT GGGAAATAGA TGATAATAAT TTAGCTCCTC TTGAAAGGAG TTATCTCGAT
CCTACAGGAC TCTTATTATC TAATACACCC AATAAATATC AGATTGTCAT GAGCTCATCG
CTATCAATAC TTGGCGATAG AGAATATACA TTAGTAACAG CTGAGCTTAC ATCAGCTACT
ACCACATATT TAGGCAATAG CTTTGATTAT AATTTCTTTA ATATTGGTAA TGGAGTATAT
GGAATAAGAC CAGATTCTAC TGGAACTATC GATTCATTGA CTGGAATTTC AAATATTCAA
TTCGATGATA AAAAACTAAA CATCACATCT GATATTAAAG CGACATTTGA TCAGGTCACA
GGTTTAAATA CAGACTCAGG TGAGATGTTC CGTCTCTATA ACGCTGCTTT CGCACGCTTC
CCTGATGCTG ATGGTTTGAA GTATTGGATT GAGCAATTTA GTTCTGGGAA AAATACAAGA
CGAGTTGTTG CTCAATCTTT TTTAGGTTCT GCAGAGTTCA CTGAGAAATA TGGAAGCAAT
GTAAGTGATG AGACATACGT GAATAACCTC TATAAAAATG TCCTTGGAAG AGACGCTGAT
GCAGAGGGGC TTAACTACTG GGTAGGCAAT CTCAGTAATG GAATTGAAAC TCGATACGAA
GCGCTCCTAG GGTTTTCAGA GTCAGAAGAG AACAAAGCGC TCTTTACAGA AATGACAGGT
TTTGGATAA
 
Protein sequence
MTWSLANIEQ LSVSGTNALG VSSHLEINQG DSNDKYQLYY TGDGGVTVNS MSTDLELTKQ 
GEIKLIQDLT IITTNEGIRR AYYVEVDPNT GNHEILTALL SEDGLTLSQA TRTGILDNGD
NAWGVPDSVV LPDGRIRIYW VSTDNTDSEW TVPQDDEIIK NDGFKTPNGE WVSANEYFEN
DRKIPDSATK TLANEVILSA TSDTSKGTEF TVDEGYRTEG GYVDFEVLKA KENDWLAIMS
SSPVTIPDEP QGIYIGISSD GLSWEIDDNN LAPLERSYLD PTGLLLSNTP NKYQIVMSSS
LSILGDREYT LVTAELTSAT TTYLGNSFDY NFFNIGNGVY GIRPDSTGTI DSLTGISNIQ
FDDKKLNITS DIKATFDQVT GLNTDSGEMF RLYNAAFARF PDADGLKYWI EQFSSGKNTR
RVVAQSFLGS AEFTEKYGSN VSDETYVNNL YKNVLGRDAD AEGLNYWVGN LSNGIETRYE
ALLGFSESEE NKALFTEMTG FG