Gene NATL1_15821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15821 
Symbol 
ID4779166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1288627 
End bp1290075 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content29% 
IMG OID640084864 
Producthypothetical protein 
Protein accessionYP_001015404 
Protein GI124026288 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.978409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGAAT TTCAGCATAA GAAGAATAAT GAAAAGAAAG AAAAGGAATT CAAGACATTC 
TCTGTCCCAT TTGATATAGT TCAAAACAAT ATAAATATTA CTTTTCCCAA TCATAAATTT
TCTACTTTAT CGGAAGAACA AATTATTAAT CAAGCAATAA AATTTCATTT AGATGGTAAC
CTTTCTCAAG CTTCTATATG TTATAAACAT TGTATTAATA AGGGTGTTAA AGATCCCAGG
GTTTTTTCTA ATTTTGCATG TATATTAAAA GGACTTCGTA AATTGAAAGA AGCAGAATTA
TACTTGAAAA AAGCAATAGA GCTTAAACCT GATTTTGCCG ATGCTTTTTC TAATTTGGGT
ATTGTCTCAA AGGGGCTTGG TAAATTAAAA GAAGCAGAAT TATATTTGAA AAAAGCAATA
GAGCTTAAAC CTGATTTTCC TTCTGCATAT TATTCTTTAT CAAATCTTAA ATACTCTAAA
GATGATCAGA AATGGCAGGA TAAATTGTTT TCTAAAAGTA TATTAAATAA TAAATCTAAA
AAAGCTAAAA TTGAGATTTT TTTTGCAAGA TCTAATATTC TTCATCAAAA TAAAAAACAT
AATGAAAGTT CTCTATACCT AGGACTTGCT AATCAAATTA AATTATCTAT TAAACCATCC
AATGCTGATG GTTTAATTTG TAAATCTAAA GAATTACTTG TTGAGTTTAA TAGACAAAAA
ACAAATACCA AACATCACCA GAGATCACAT CATAGTATTT TTATTGTAGG TATGCCAAGA
AGTGGATCTA CACTAGTTGA GTCTATTCTT AGTATGAACC CTAATGTTGA TGATTTGGGT
GAGATTAACA TTCTTGAGAA GTCTTTTTTG GAGAGTAAAA AGGTCGGTCA AAAATTAACT
CTTGCTGAAA TATATTGGAA GGGAATAGAT AACTATAAGA AGACATCAAA TATAACAACA
GATAAATGGT TAGATAACTA TCAATATGGA GGAATAGTTT TAAAGCAAAT ACCGAACTCC
ATATTTATTC ACTGTTTTAG AAACCCTTTA GATAATATAC TATCTATATA TCGTGCACAT
TTTTCTAAAC TTAATGAATA TGCTTCATCA CTGATTGATT GTACAAGAGT TTATATAAAT
CAAGATGAAT TGATGACAGA ATATAAAAAA CACTTTAGAT CAAAAATATA TGACTTGGAT
TATGATTTAT TAGTTAATGA TCCCAAAAAA GAAATCAAAT CATTGATCGC TTGGTTGGGA
TGGAAATGGG ATGATTCATA TCTATCTCCT CATCTAAATA CACGCTCGAT TTCAACAGCA
AGTAAGGTGC AAGTTCGGTC CCCAATTAAT TCAAAATCAT TGGGTGGATG GAAAAACTAC
AGAGATATGC TTAAACCCGC TATTGAAGTC CTTGCTAAAA ATGATCGCTA TCTAGACTTG
ATTTCTTAA
 
Protein sequence
MEEFQHKKNN EKKEKEFKTF SVPFDIVQNN INITFPNHKF STLSEEQIIN QAIKFHLDGN 
LSQASICYKH CINKGVKDPR VFSNFACILK GLRKLKEAEL YLKKAIELKP DFADAFSNLG
IVSKGLGKLK EAELYLKKAI ELKPDFPSAY YSLSNLKYSK DDQKWQDKLF SKSILNNKSK
KAKIEIFFAR SNILHQNKKH NESSLYLGLA NQIKLSIKPS NADGLICKSK ELLVEFNRQK
TNTKHHQRSH HSIFIVGMPR SGSTLVESIL SMNPNVDDLG EINILEKSFL ESKKVGQKLT
LAEIYWKGID NYKKTSNITT DKWLDNYQYG GIVLKQIPNS IFIHCFRNPL DNILSIYRAH
FSKLNEYASS LIDCTRVYIN QDELMTEYKK HFRSKIYDLD YDLLVNDPKK EIKSLIAWLG
WKWDDSYLSP HLNTRSISTA SKVQVRSPIN SKSLGGWKNY RDMLKPAIEV LAKNDRYLDL
IS