Gene NATL1_16211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16211 
Symbol 
ID4779855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1324436 
End bp1325683 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content43% 
IMG OID640084904 
Productporin-like protein 
Protein accessionYP_001015443 
Protein GI124026327 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.191112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTTT TTCAGCGTTT GCTGGTAGCT CCTGCTGCTT TGGGCCTAAT GGCACCATTA 
GCTGCAAATG CCGACGTTAC TGCGGTATCT AACGATTCAG ATCTAAGTTC TGAAGTTATT
CAAGCTCGTG TTGACGGCGT TGAAGCTCAA CTCGGCGAAA TAATGGCTGG TCAGTTCTCT
TCATCTACAA AGATGAGCGG AAAGGCTGCT TTCATAACTG GTTATGTTGA CGATGATGCT
GAGTCAGATA CTGACAGCAT CACTATGGAA TATATGTATC AGTTGAACAT GAACACCAGT
TTTACTGGTG AAGACATGCT CTACACAAGA CTTAAAGCTG GTAACGTTTC TGATCATTTT
GTAGATAAAG GTCAAGGTAC TTATCTTTCA GCTGGTAAAA ACACATCAGA TTCTCTGACA
GTTGACAAGA TTTGGTATCA ATTCCCAGTT GGCGATGACG TAACTGTTTG GGTTGGTCCA
AAAATTGAGA ACTACTACAT GCTTGCAAGT TCTCCTTCTA TTTATAAGCC TGTAACTAAG
CAATTCTCTC TTGGTGGAAA CGGAACTGTT TATGGTTCCT CTACTAAAGC TGGTTTTGGC
GCCGCTTATG TACAACCAAC GGAAGATCCT TCAGCCGGTA GATTTGCAGT AAGTGTTGCT
TACACAAACC AAAGTGGTGC AAAGTCTGGC AAGGATCAAG GTCTATTTGG TGAAGATGGT
AAGTCAGCTT TGCTTACTAA GCTTGAGTAC GGAACACCTC AGTGGCAAGT ATCTGGTGCT
GTTGCTTTAA AGGAAAATGG CTGGTCTGAT AGCTATTTCA CAACAGCCGC AGGTAAGGCA
AGGTCAGCTG CTGGTTCTGA AACTGCAGTT GGTTTAAGAG CATATTGGAG ACCTGATACA
ACAGGAGCTA TTCCTGAAGT ACAGCTTGGT TATGACGTTT CTACTATTGA CGATGCACCA
ACTGGTTTTG CTGATGAAGC ATCAGGTTGG ATGGTTGGAC TTGGTTGGAA AGATCTACTA
ATCGACGGAA ACAGAGCTGG TGTTGCTTTC GGTTCAAGAG TTAGCGCTAC TTCTATTGTT
GGTGGTACAT CTGATCCTTC TGAGGATAAC AGTGTTTGGG AAGCTTACTA TTCCTTCAAA
ATTAATGATG GCGTAACAGT TACTCCTGCA ATCTTTGGTG GTTCAGATGT TGAGTCTGAA
GGCAAAGATG TTAGTGGAGC AGTTGTTCTA ACTGAATTTA GATTCTAA
 
Protein sequence
MKLFQRLLVA PAALGLMAPL AANADVTAVS NDSDLSSEVI QARVDGVEAQ LGEIMAGQFS 
SSTKMSGKAA FITGYVDDDA ESDTDSITME YMYQLNMNTS FTGEDMLYTR LKAGNVSDHF
VDKGQGTYLS AGKNTSDSLT VDKIWYQFPV GDDVTVWVGP KIENYYMLAS SPSIYKPVTK
QFSLGGNGTV YGSSTKAGFG AAYVQPTEDP SAGRFAVSVA YTNQSGAKSG KDQGLFGEDG
KSALLTKLEY GTPQWQVSGA VALKENGWSD SYFTTAAGKA RSAAGSETAV GLRAYWRPDT
TGAIPEVQLG YDVSTIDDAP TGFADEASGW MVGLGWKDLL IDGNRAGVAF GSRVSATSIV
GGTSDPSEDN SVWEAYYSFK INDGVTVTPA IFGGSDVESE GKDVSGAVVL TEFRF