Gene NATL1_03761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03761 
Symbol 
ID4779594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp347903 
End bp348919 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content39% 
IMG OID640083644 
ProductYcf48-like protein 
Protein accessionYP_001014205 
Protein GI124025089 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.156925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTC TGTTTTCAAA TGTCATTAAT TTGACCCTTG TCTTAATAGT TGGAGTAGCT 
CTTAGTGGTT GTACTGTTAG CAATGCTTCC ATAGGCTCAT CAAGTCCTTG GTCACTAGTT
GACCTAGATA CTGAGGCAAA TCCATTAGAT GTTGACTTTG TAGATGACAA AAATGGGTTC
TTAGTAGGGA CAAACAGATT AATTCTTGAG ACAAATGATG GAGGAATAAC TTGGAAAGAA
AGAAATTTAG ACATACCAAG CGAAGGTAAT TTTCGTTTAA TAAGCGTTGA TTTCAAAGGC
CAAGAAGGTT GGATTGCAGG TCAACCAGGA TTGATTCTTC ATACCACTGA TGGAGGAAAA
AATTGGACTC GTCTTGATTT GGGAAATAAG TTACCTGGAG ATCCGTATTT GATAACAACA
ATTGATACTG ATACTGCTGA GTTAGCAACT ACTGCTGGGG CAATTTATAA AACTACCGAT
GCTGGTACCA ATTGGGAAGC AATCGTTGTT GATACTTCTG GCTCTGGAGG CATAAGAGAA
TTAAGACGTA CTAATAACGG TGGATATATA AGTGTTAGTA GCCTTGGTAA CTTCTTCTCA
GTCCTAAGAC CTGGAGAAGA GATATGGAGC CCTCATCAAA GAGCAAGTAG TAAGAGAGTT
CAAAGTGTTG GTGAACAGCC AAATGGTGAT TTATGGATGC TTTCTAGAGG AGCTGAAATC
AGATTTAATG CAGATCCTGA TGATATCGAT TCATGGTCAA AGCCCATAAT CCCAATCGTT
AATGGGTATA ACTATCAAGA TCTTGTTTGG GATCCATCTA AGTCGATTTG GGCCGCTGGA
GGAAATGGAA CTTTATTAGT GAGTAATGAT CAGGGAAAAA CTTGGGAAAA AGACCCTGTT
GGTGAATCTG TTCCAACAAA TTTCATAAGA ATTCTATTTC TGGACGATTT AAATAGTGAC
AGTCCTAAGG GATTTGTATT TGGAGAAAGG GGCAATTTAC TGCGGTGGCA GGGTTGA
 
Protein sequence
MKRLFSNVIN LTLVLIVGVA LSGCTVSNAS IGSSSPWSLV DLDTEANPLD VDFVDDKNGF 
LVGTNRLILE TNDGGITWKE RNLDIPSEGN FRLISVDFKG QEGWIAGQPG LILHTTDGGK
NWTRLDLGNK LPGDPYLITT IDTDTAELAT TAGAIYKTTD AGTNWEAIVV DTSGSGGIRE
LRRTNNGGYI SVSSLGNFFS VLRPGEEIWS PHQRASSKRV QSVGEQPNGD LWMLSRGAEI
RFNADPDDID SWSKPIIPIV NGYNYQDLVW DPSKSIWAAG GNGTLLVSND QGKTWEKDPV
GESVPTNFIR ILFLDDLNSD SPKGFVFGER GNLLRWQG