Gene NATL1_19581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_19581 
SymbolpsaB 
ID4779256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1616109 
End bp1618337 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content41% 
IMG OID640085248 
Productphotosystem I P700 chlorophyll a apoprotein A2 
Protein accessionYP_001015778 
Protein GI124026663 
COG category 
COG ID 
TIGRFAM ID[TIGR01336] photosystem I core protein PsaB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACTA AATTTCCATC TTTTAGTCAG GGTCTTGCTC AAGACCCTAC AACCAGAAGA 
ATCTGGTACG GCATAGCTAC TGCCCATGAT TTCGAAAGTC ATGACGGCAT GACTGAGGAG
CAGTTATATC AAAAACTCTT CTCTACTCAT TTTGGTCACT TGGCCATAAT TGGCCTTTGG
GTGGCTGGCA ACCTTTTTCA CATTGCTTGG CAAGGAAACT TTGAGCAATG GGTTCTAGAT
CCACTCCACA CGCGTCCAAT TGCACATGCG ATTTGGGATC CACATTTTGG ACAAGGACTT
ACTGATGCAC TAACTCAAGC TGGAGCGACT TCTCCAGTCA ACATTGCTTA CTCAGGCCTA
TACCATTGGT GGTACACCAT CGGCATGAGA ACTAATGAAC AGCTTTTTCA AGGTGCGATC
TTTATAAACA TCCTTGTTTG TTGGTTATTG TTTGCAGGAT GGCTTCATCT TCAACCTAAG
TACAGACCGT CATTAGCATG GTTCAAAAAT GCTGAGTCTC AGTTAAATCA TCACTTGGCT
GTTCTTTTTG GATTCAGTAG CATCGCTTGG ACAGGTCACT TAATTCATGT AGCCATCCCT
GAGTCAAGGG GTATACACGT TGGCTGGGAA AATTGGTTAA CTGTTATGCC TCATCCAGAA
GGTTTAACTC CTTTCTTCTC AGGAAATTGG GGAGCTTATG CTCAAAACCC TGATTCGATT
GATGCAGTTT TCGGAACTTC TCAAGGAGCT GGAACAGCAA TATTTACTTT TCTGGGTGGT
CTTCATCCTC AAAGTGAATC ACTTTGGCTA ACAGATATTG CTCATCATCA TTTAGCTATA
GGAGTTGTAT TTATAATTGC TGGACATATG TATAGGACTA ACTTTGGCAT AGGTCATAGT
CTCAAGGAAA TTATTGAAGC TCATAATACA AGTCACCCAA AGGATCCACA TAGAGGTTAT
TTCGGGATAA AGCACAACGG TCTTTTCGAG ACAGTTAACA ATTCACTCCA TTTTCAGCTT
GGACTAGCAC TTGCTTCTCT AGGAGTTGCT TGCAGCTTAG TTGCTCAGCA TATGGGTGCA
TTACCTTCAT ATGCATTTAT CGCAAGGGAT TACACAACTC AGTCAGCTCT ATATACCCAT
CATCAATACA TAGCAATGTT CTTGATGGTT GGTGCGTTCT CTCACGGAGC AATTTTCTTT
GTCAGAGATT ATGATCCAGA GCTTAATAAA GACAATGTTC TAGCAAGAAT CCTTAGTACA
AAAGAAGCCT TAATTAGCCA CTTAAGTTGG GTAACAATGC TTCTAGGCTT CCATACTCTT
GGAATTTATG TTCACAACGA TGTAGTTGTA GCCTTTGGTA CACCTGAAAA GCAGATTCTT
ATAGAACCAG TATTTGCACA GTTCGCACAG GCTGCTAGTG GAAAAATGAT GTATGGATTT
AATGCGTTGC TAGCAAATGC ATCAAGTTCT GCATCTATAG CTGCTAATTC CATGCCAGGT
AATCACTACT GGATGGACAT GATCAATAGA CCAGATGCAC TAACCAATTT CTTACCTATT
GGACCTGCAG ATTTCTTAGT TCACCATGCG ATTGCACTAG GACTACATAC AACTGCTTTG
ATTCTTATAA AGGGTGCTCT TGATGCTAGA GGAACAAAAC TAATTCCAGA TAAAAAGGAT
TTAGGATTTG CTTTCCCATG TGATGGACCA GGTCGTGGTG GTACTTGTGA TAGTTCTTCT
TGGGACGCTA CTTATTTAGC AATGTTCTGG GCCTTAAATA CTATTGCTTG GATTACCTTC
TATTGGCATT GGAAACATCT AGCAATTTGG ATGGGTAATA CCGCTCAATT CAATGAATCT
GGTACCTATT TAATGGGTTG GTTTAGAGAT TATCTATGGC TAAATAGTTC TCAGCTAATC
AATGGATATA ACCCATTTGG TGTAAATGCT TTATCTCCAT GGGCTTGGAT GTTCTTATTT
GGTCATCTCA TTTGGGCGAC TGGATTTATG TTCCTCATCT CTTGGAGAGG ATATTGGCAA
GAACTTATCG AAACTCTTGT TTGGGCTCAT CAAAGAACAC CAATAGCTAA TTTAGTTGGT
TGGAGAGATA AGCCTGTTGC ATTATCCATT GTCCAAGCAC GATTAGTTGG TTTAACCCAT
TTCACAGTTG GAAACTTTGT AACCTTTGGT GCATTTGTTA TCGCATCTAC TTCAGGTAAG
TTCGGATAA
 
Protein sequence
MATKFPSFSQ GLAQDPTTRR IWYGIATAHD FESHDGMTEE QLYQKLFSTH FGHLAIIGLW 
VAGNLFHIAW QGNFEQWVLD PLHTRPIAHA IWDPHFGQGL TDALTQAGAT SPVNIAYSGL
YHWWYTIGMR TNEQLFQGAI FINILVCWLL FAGWLHLQPK YRPSLAWFKN AESQLNHHLA
VLFGFSSIAW TGHLIHVAIP ESRGIHVGWE NWLTVMPHPE GLTPFFSGNW GAYAQNPDSI
DAVFGTSQGA GTAIFTFLGG LHPQSESLWL TDIAHHHLAI GVVFIIAGHM YRTNFGIGHS
LKEIIEAHNT SHPKDPHRGY FGIKHNGLFE TVNNSLHFQL GLALASLGVA CSLVAQHMGA
LPSYAFIARD YTTQSALYTH HQYIAMFLMV GAFSHGAIFF VRDYDPELNK DNVLARILST
KEALISHLSW VTMLLGFHTL GIYVHNDVVV AFGTPEKQIL IEPVFAQFAQ AASGKMMYGF
NALLANASSS ASIAANSMPG NHYWMDMINR PDALTNFLPI GPADFLVHHA IALGLHTTAL
ILIKGALDAR GTKLIPDKKD LGFAFPCDGP GRGGTCDSSS WDATYLAMFW ALNTIAWITF
YWHWKHLAIW MGNTAQFNES GTYLMGWFRD YLWLNSSQLI NGYNPFGVNA LSPWAWMFLF
GHLIWATGFM FLISWRGYWQ ELIETLVWAH QRTPIANLVG WRDKPVALSI VQARLVGLTH
FTVGNFVTFG AFVIASTSGK FG