Gene Cyan8802_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4020 
Symbol 
ID8393371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4138025 
End bp4138945 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content46% 
IMG OID644981940 
Productpseudouridine synthase, RluA family 
Protein accessionYP_003139653 
Protein GI257061765 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATC AAGGTTGGAT CTATCGAGAA CAGGTCAACC CTTCTGATGC CGGGTTAACC 
ATTTTGGCTT ACTATACTCA ACGGTATCCT CACTCTAGTC AGGGTCAATG GCAAGAACGG
ATTATTTCTG GGCAAATTTT GCTCAATGGT CAACCAACTA CCCCTGATAC CCCGTTACAA
CCTGGACAAT GCTTAAGCTA TCATCGTCCT CCGTGGCAAG AACCCGATGT TCCCCTCGCT
TTTGAGGTGT TATATGAAGA TGTTGATGTA TTAGTGGTGG CTAAACCATC GGGACTTCCT
GTCCTACCCG GAGGAGGTTT TCTTGAACAT ACCTTATTAG GACAATTAAA GCGATTATAT
CCCCAAGAAA CCCCTACGCC CATTCATCGT TTAGGTCGGG GAACCTCTGG ACTGATGTTA
TTAGCGCGAT CGCCTTTGGC CGCTTCCCAT CTCAGTAAAC AAATGCGCCA AGGTCAGATG
ACTAAAGTGT ATCATGCTCT AGTGGGGGCG GGCGATTTAC CTCATCAGTT TTCGATTCAT
CAACCCATTG GCAAAATTCC CCATCCCGTT TTAGGCTATG TTTACGGTGC GACTCCCGAT
GGATTATTTG CCCATAGTGA TTGTAGGGTA TTACAACGAT CAACTCAAGG GACGTTAGTC
GAAGTAAGGA TTTTTACAGG ACGACCCCAT CAAATTCGCA TCCATTTAGC GTCTGTTGGC
TATCCCTTAC TCGGAGATCC TTTATATGAG GTTGGGGGTA TTCCCCGGAC TGCACCCAAA
ATTGAGGTCA ATAAACTGCC TGTCCCTGGA GATTGTGGTT ATTTGCTTCA TTCCCATCTA
CTGGGGTTTA CCCACCCTCG AACCCATGAA CCCTTACAAT TCGTTAAAAA TCGTCAATTC
TTAGTGGATT GTTGCATTTA A
 
Protein sequence
MLNQGWIYRE QVNPSDAGLT ILAYYTQRYP HSSQGQWQER IISGQILLNG QPTTPDTPLQ 
PGQCLSYHRP PWQEPDVPLA FEVLYEDVDV LVVAKPSGLP VLPGGGFLEH TLLGQLKRLY
PQETPTPIHR LGRGTSGLML LARSPLAASH LSKQMRQGQM TKVYHALVGA GDLPHQFSIH
QPIGKIPHPV LGYVYGATPD GLFAHSDCRV LQRSTQGTLV EVRIFTGRPH QIRIHLASVG
YPLLGDPLYE VGGIPRTAPK IEVNKLPVPG DCGYLLHSHL LGFTHPRTHE PLQFVKNRQF
LVDCCI