Gene PMN2A_1560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1560 
Symbol 
ID3606958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp230831 
End bp231811 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content37% 
IMG OID637688438 
Productputative pseudouridylate synthase specific to ribosomal large subunit 
Protein accessionYP_292751 
Protein GI72383396 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAG AAATCAAGCA TGGATTTGGG CAGGGCCCAG GTGAATTATT TGAGGGCGAA 
TATAAGAAAC CTTTGCCAAT GCGGCTTGAT AGATGGCTGG TAAGTCAACG AGCTGAACAA
AGTAGAGCTC ATATTCAGAA ATTCATTGAA GCGGGGTTCG CAAGAGTAAA TGGTAAAACT
GGAAAAGCAA AAACTCCCGT TAGGCCAGGT GACATAATTC AACTTTGGGT TCCACCACCT
GAGCCTCTTC CTTACTTAAA GCCAGAAGAA ATTGATTTAG ATATTTTGTA TGAAGATGAT
CATTTAATAG TCATTAACAA AACAGCAGGA ATAGCGGTTC ATCCTGCACC AGGTAATAAA
TCAGGAACAT TAGTTAATGG ATTAATCCAT CATTGCCCAG ACTTACCTGG CATTGGAGGT
AAATTAAGAC CTGGAATCGT TCATCGCTTA GATAAAGACA CAACTGGATG CATTGTGGTA
GCAAAGACTC AAGAAGCCTT AGTTAAACTG CAAATTCAAA TACAAAAGAG AGTAGCCTCA
AGAAATTATA TTGCTGTTGT TCATGGAGCT ATTAAAAACA ATGAAGGCAT GATTGTCGGG
AGTATTGGCC GACATCCAAA AGATAGAAAA AAATATGCTG TAGTTGATGA GGAATCTGGT
AGATATGCTT GTACACATTG GAAATTAATT AAAAATCTAG GAAATTTTTC TCTTCTTAAA
TTCAAGCTTG ATACAGGTAG AACGCATCAA ATTCGTGTCC ATAGCGCACA CATCGGTCAT
CCCATTATTG GAGACCAAAC TTATAGTAGA TGTAAAAAAT TACCTATTAA ACTTGGTGGC
CAAGCATTAC ATGCAATTGA ATTGGGTTTA ATACACCCAA TAACATTAGA AAAAATGAAA
TTCACAGCTC CATTGCCTGA AGACTTTGAA CGACTTTTAA AAGTTTTACA ACCTAAAAAT
AATATTAATC CGGAAGTCTA G
 
Protein sequence
MDKEIKHGFG QGPGELFEGE YKKPLPMRLD RWLVSQRAEQ SRAHIQKFIE AGFARVNGKT 
GKAKTPVRPG DIIQLWVPPP EPLPYLKPEE IDLDILYEDD HLIVINKTAG IAVHPAPGNK
SGTLVNGLIH HCPDLPGIGG KLRPGIVHRL DKDTTGCIVV AKTQEALVKL QIQIQKRVAS
RNYIAVVHGA IKNNEGMIVG SIGRHPKDRK KYAVVDEESG RYACTHWKLI KNLGNFSLLK
FKLDTGRTHQ IRVHSAHIGH PIIGDQTYSR CKKLPIKLGG QALHAIELGL IHPITLEKMK
FTAPLPEDFE RLLKVLQPKN NINPEV