Gene NATL1_15741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15741 
Symbol 
ID4780933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1281122 
End bp1282666 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content27% 
IMG OID640084856 
Producthypothetical protein 
Protein accessionYP_001015396 
Protein GI124026280 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.354121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA AAAGAAATAA GGCTATTAAT CACACAAAAA GAAATCAAGT CAAAACATTT 
CCTATCTCAT TTTCTTTATA TAAAATCAAA GATGATCTAA TAATAAATAC AAATTATCTT
TCTGAAACTC CAGAAGAAGA AATAATTAAT CAAGCAATTA AGTTTCATTT GCAAGGTAAG
ATTTCAGAGG CAATAAAATA TTATAAGCAT TGTCTTATCA AAGGTTTTCA TGATGAAAAA
GTTTTTTGTA ATTATGGAAT TATATTGAAA AATTTGGGTA AAACTAAAGA AGCAGAATTA
CTACAACTTA AAGCAATAGA AATAAAACCA AACTATGCAG AAGCTTACTC TAATCTGGGA
GTTATATATA AAGATAAAGG TAAGTTAAAG GAAGCTGAAT TATCACTTAA GAAAGCTATA
GAAATAAGAC CTAATTTTGC AAATGCTCAT AATAATCTGG GTATTATATT CAATGATCTG
GGGAAATTTA AAGAAGCTGA GTTATCTTAT TTGAAAGCGA TAGAATTAAA ACCAGATTTT
GCAGAACCAT ATTATTCTTT ATCACTTCTC AACTGCTCAA ATGAAAATAA ATTTTGGCAG
AATAGACTTT TTTCTAAAAG TATACTAAAT AATAAATCAA AAAAAGAGAA GATTGATATT
TATTTCGCTA GGTCGAATGT TTTACATAAA GAGAAAAATT ATATAGAAAG TTCTCACTAT
CTAAATTTAG CTAATCGGAT CAAACTTAAT CTTAGACCAT CAAATATTAA TCTTCGTTTA
GAAAAATCTA AAAGATTGCT TTTAGAATCA GATAAAAGAG AAACCAATAA AAAAGAAGAT
ATAAAAACTA TTAAAACAAT TTTTATTGTT GGCATGCCTA GAAGCGGATC AACATTACTT
GAATCAATAA TTAGTATGAA TCCTAATGTC GATAGCTTAG GGGAAATTAA TTTTCTGGAG
GAGTCATTCT TAGAGGAAGA ACATAATGAA CAAGGATTGA ATCTTGCTGA ATTATATTTG
GAAAAGATTA ATGTTAACAG TACTAGATCA ACTATTAAAA CTAACAAGTA TTTATATAAC
TATCAGTATG CAGGTATTAT TGCAACTTGC ATCCCTAATA GTAAAATTAT TCACTGCTTT
AGAAATCCTT TAGATAATAT ATTATCTATA TATCGAGCAC ATTTTACTAT AGGTAACGAA
TACTCTTCTT CCTTAGCAGA TTGTGCAAAT ATATATTTAG ATCAAGAAGA AATAATGGAT
ATTTATAAAA AAAAATATAG ATCAAATATT TATGATCTCA ACTATGACTT ATTGGTTAGC
AATCCTCAGG AAGAAATTAT CTCATTAATT TCATGGTTGG GATGGGAATG GGATGATTCA
TATCTATCGC ACCATTTAAA TCCACGTTCA ATTTCAACAG CAAGTAGCGT TCAAGTCCGT
TCTCCTATTA ATTCGAAATC CATTGGAGGG TGGAAGAACT ACAGAGATAT GTTAAAACCT
GTTATTGAAA TATTAGAAAA GCATGAGAAA TATAAAAATT TATAA
 
Protein sequence
MNKKRNKAIN HTKRNQVKTF PISFSLYKIK DDLIINTNYL SETPEEEIIN QAIKFHLQGK 
ISEAIKYYKH CLIKGFHDEK VFCNYGIILK NLGKTKEAEL LQLKAIEIKP NYAEAYSNLG
VIYKDKGKLK EAELSLKKAI EIRPNFANAH NNLGIIFNDL GKFKEAELSY LKAIELKPDF
AEPYYSLSLL NCSNENKFWQ NRLFSKSILN NKSKKEKIDI YFARSNVLHK EKNYIESSHY
LNLANRIKLN LRPSNINLRL EKSKRLLLES DKRETNKKED IKTIKTIFIV GMPRSGSTLL
ESIISMNPNV DSLGEINFLE ESFLEEEHNE QGLNLAELYL EKINVNSTRS TIKTNKYLYN
YQYAGIIATC IPNSKIIHCF RNPLDNILSI YRAHFTIGNE YSSSLADCAN IYLDQEEIMD
IYKKKYRSNI YDLNYDLLVS NPQEEIISLI SWLGWEWDDS YLSHHLNPRS ISTASSVQVR
SPINSKSIGG WKNYRDMLKP VIEILEKHEK YKNL