Gene Synpcc7942_1112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1112 
Symbol 
ID3775062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1129444 
End bp1130385 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content60% 
IMG OID637799538 
Productribosomal large subunit pseudouridine synthase D 
Protein accessionYP_400129 
Protein GI81299921 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0997976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.952064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATC GCCTGCAACT AGAAATCACA GAAGTGCCTT CTGAGCGGCT CGATCGTTGG 
CTTGCCCAGC AGTGGCCGCA TCTGTCGCGG GCACGCCTCC AGAAACTGAT TGCAGCGGGT
CAGCTGCGGG TCAATGGAGA AGTCTGTGAT CAGAAGCGCT GGCAGCCGCG TTTGGGCGAT
CGCCTCGAAC TGGAAATGCC TCCGACGGAA GCAATCGCGC TAGCCCCTGA AGAGATTCCC
CTCGACATCC TTTATGAAGA TGCTGACCTA TTGATCGTCA ACAAAGCCGT GGGGATGGTG
GTGCATCCGG CGGCGGGACA TGACACTGGT ACGCTCGTCC ATGCTCTGCT AGCCCACTGC
GGTGACTCCC TGACGGGCAT TGGTGGGGAA CAGCGGCCGG GTATTGTCCA TCGTCTAGAC
AAGGACACCA CAGGGGCGAT GGTGGTGGCG AAAACAGAAG CCGCCCTGCT GTCTTTACAA
GATCAAATCC GCCAGAAAAC CGCCCAGCGG GAATATCTGG GTGTGGTCTT TGGTTCGCCT
CGTCAAGATA GTGGCCAGGT TGAAGAGCCG ATTGGCCGCC ATCTGCGCGA TCGCAAACGA
ATGGCGGTTG TACCGATTGA ACGGGGCGGG CGTTGGGCAC TCACCCACTG GCAGGTCAGG
GAACGGCTCG GTAACTATGC GCTGCTGCAC TATCGGCTGG CAACGGGCCG CACCCACCAA
ATCCGCGTTC ACAGTCATCA CATGGGGCAT CCACTAGTGG GCGATCCGCT CTACGGCAAT
GGGCGATCGC TCGGGGTCAA TCTGCAGGGA CAAGCCCTCC ATGCCTGGCG ACTGAGTTTG
CAACATCCCC GCACGGGCGA GGTGATCGCG GTGGAAGCCC CACTACCGGC AGAATTTCAA
CGTCTGTTGC GTGTCCTTCG CGATCGGAGT GCCCAATCGT GA
 
Protein sequence
MSDRLQLEIT EVPSERLDRW LAQQWPHLSR ARLQKLIAAG QLRVNGEVCD QKRWQPRLGD 
RLELEMPPTE AIALAPEEIP LDILYEDADL LIVNKAVGMV VHPAAGHDTG TLVHALLAHC
GDSLTGIGGE QRPGIVHRLD KDTTGAMVVA KTEAALLSLQ DQIRQKTAQR EYLGVVFGSP
RQDSGQVEEP IGRHLRDRKR MAVVPIERGG RWALTHWQVR ERLGNYALLH YRLATGRTHQ
IRVHSHHMGH PLVGDPLYGN GRSLGVNLQG QALHAWRLSL QHPRTGEVIA VEAPLPAEFQ
RLLRVLRDRS AQS