Gene NATL1_15721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15721 
Symbol 
ID4780138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1278488 
End bp1279570 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content47% 
IMG OID640084854 
Producthypothetical protein 
Protein accessionYP_001015394 
Protein GI124026278 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B)) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.131507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA TTCAGCAGCA GCGTTCTTCG TTGCTCAAAG GTTGGCCACA ATTCTGCGAG 
TGGGTTACTT CCACCAACAA CCGTATCTAT GTCGGTTGGT TCGGTGTATT GATGATCCCT
TGCCTTCTTG CGGCAACAAC TTGTTTCATC GTTGCATTTA TCGCTGCTCC TCCAGTTGAT
ATCGACGGTA TCCGTGAGCC AGTAGCTGGT TCATTCATGT ATGGAAACAA CATCATTTCT
GGTGCTGTTG TTCCTTCAAG TAACGCTATC GGCCTTCACT TCTACCCAAT CTGGGAAGCA
GCAACTCTTG ATGAGTGGCT ATATAACGGT GGCCCTTACC AGCTTGTAAT CTTCCACTTC
CTTATCGGTA TCTCTGCATA CATGGGACGT CAGTGGGAGC TTTCATACCG TTTAGGTATG
CGCCCATGGA TCTGTGTTGC TTACTCAGCT CCTGTATCAG CAGCTTTCGC TGTATTCCTT
GTTTACCCAT TCGGTCAGGG TTCATTCTCT GATGGTATGC CTCTAGGAAT TTCTGGAACA
TTCAACTTCA TGTTCGTTTT CCAGGCTGAG CACAACATCT TGATGCACCC ATTCCATATG
GCTGGTGTAG CAGGTATGTT CGGTGGTGCT TTGTTCTCTG CAATGCACGG TTCTTTGGTT
ACTTCATCAC TTATCCGTGA GACCACAGGA CTTGATTCAC AGAACTATGG TTACAAGTTT
GGACAAGAAG AAGAGACATA CAACATCGTT GCAGCTCATG GCTACTTCGG TCGTTTGATC
TTCCAATATG CAAGCTTCAA CAACAGCCGT AGTCTTCACT TCTTCTTGGC TTCATGGCCA
GTGATTTGTG TTTGGTTGAC ATCTATGGGC ATCTGCACCA TGGCGTTCAA CTTGAACGGT
TTCAACTTCA ACCAGTCTGT AGTTGATACT TCAGGCAAGG TTGTACCAAC CTGGGGTGAC
GTACTTAACC GTGCAAACCT TGGTATGGAA GTAATGCACG AGCGTAATGC TCACAACTTC
CCACTTGACC TAGCAGCTGC TGAGTCTACT TCTGTAGCTC TTGTTGCACC TGCAATCGGT
TAA
 
Protein sequence
MTTIQQQRSS LLKGWPQFCE WVTSTNNRIY VGWFGVLMIP CLLAATTCFI VAFIAAPPVD 
IDGIREPVAG SFMYGNNIIS GAVVPSSNAI GLHFYPIWEA ATLDEWLYNG GPYQLVIFHF
LIGISAYMGR QWELSYRLGM RPWICVAYSA PVSAAFAVFL VYPFGQGSFS DGMPLGISGT
FNFMFVFQAE HNILMHPFHM AGVAGMFGGA LFSAMHGSLV TSSLIRETTG LDSQNYGYKF
GQEEETYNIV AAHGYFGRLI FQYASFNNSR SLHFFLASWP VICVWLTSMG ICTMAFNLNG
FNFNQSVVDT SGKVVPTWGD VLNRANLGME VMHERNAHNF PLDLAAAEST SVALVAPAIG