Gene Cphy_2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2789 
Symbol 
ID5742104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3394840 
End bp3395880 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content35% 
IMG OID641293880 
ProductRluA family pseudouridine synthase 
Protein accessionYP_001559888 
Protein GI160880920 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATTA CTAATGGCCA TCTATATAAT ATAGTAAAGG ATGTTAATAA AGTGAGAGAA 
ATCACCATAG TACAAAATGA GGCAGGTCAA AGACTGGATA AGTTCTTAGC AAAATATCTT
AATAAAGCGC CTAAGAGTTT TTTTTATAAA ATGCTTCGAA AGAAAAATAT TACCTTGAAT
GGAAAAAAGG CAGAAGGAGC AGAAAAACTT ATAGAGGGGG ATATCGTTCG TCTTTTCCTA
GCAGAGGAGA CAATAGAAAG CTTCAAAGAA TCTTATCAGC TTGATGCTAA GGTGAATCAA
AGAGCTGTAA AACTAGATGT TTTATACGAA GATTCTCATG TAGTTATTAT TAATAAACCG
ATTGGTATGT TATCACAACG TGCAAAAGAA TCCGATGTAT CATTGGTTGA GCTATTAATT
GCTTATCTTT TAGAGATGGG GAGTTTGACA AAAGAGGAGT TATCGACATT TAAACCATCG
GTATGCAATC GATTAGACCG AAATACCAGC GGGATAGTAA TTGCCGGTAA AAGCTTACTA
GGACTTCAGG AGATGTCAGC AAAGCTACAG GATCGTAGTC TTCATAAATA TTATCGCTGT
ATTGTCAAAG GAACGATGAC TAAGGGTGCT CGAATCAATG GATATCTGGC TAAAGACGAG
AAGACGAATA AAGTAAGGAT TACTACGAAT GATCCTAACG ATGGTGAAAG TTCCTACATT
GAGACTGAAT ATCAACCGAT ATTAAGTAAG AACGGATATA CACTCTTAGA GGTATTATTA
ATCACCGGTA AGACTCATCA GATTCGTGCT CACTTAAGTA GCATTGGGCA TCCGATTATT
GGTGATACGA AATATGGTGA TGAAACGCTG AATAAAAAAA TGCAAAAACA GTATGGCTTA
AGTCATCAGT TACTTCACTC TTACCGATTA GAGTTTCCTA ATCTTCCTAA AGAGCTAGAG
AAATTAAGTA ATCAAAAGAT AATAGCACCA TATCCTAAAT TATTCAAAAA CTTAGAGAAA
AGTTTATTTA GCGATAACTA G
 
Protein sequence
MPITNGHLYN IVKDVNKVRE ITIVQNEAGQ RLDKFLAKYL NKAPKSFFYK MLRKKNITLN 
GKKAEGAEKL IEGDIVRLFL AEETIESFKE SYQLDAKVNQ RAVKLDVLYE DSHVVIINKP
IGMLSQRAKE SDVSLVELLI AYLLEMGSLT KEELSTFKPS VCNRLDRNTS GIVIAGKSLL
GLQEMSAKLQ DRSLHKYYRC IVKGTMTKGA RINGYLAKDE KTNKVRITTN DPNDGESSYI
ETEYQPILSK NGYTLLEVLL ITGKTHQIRA HLSSIGHPII GDTKYGDETL NKKMQKQYGL
SHQLLHSYRL EFPNLPKELE KLSNQKIIAP YPKLFKNLEK SLFSDN