Gene Cphamn1_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1044 
Symbol 
ID6374715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1127202 
End bp1128242 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content51% 
IMG OID642683545 
Productpseudouridine synthase, RluA family 
Protein accessionYP_001959466 
Protein GI189499996 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAC AGCAGGTAAA AAGCGAACAG GAAGAACAGG ACAGAGAACA TCAGGAACCG 
AAAAAAATGA CGCTTCAGGT TGCACAGACC CAGAAACCGA TGCGTATCGA TGTCTATCTC
GCCCAGCAGG TTGAAAACGC CACCAGAAAC AAGGTTCAGG AAGCAATTTC GGAACACCGC
GTACGGGTTA ACGGAAAAAC CGTCAAAGCC AATTACAAGA TAAAATCTCT CGATTCCATA
GAGATCACCT TTCTCCGCCC TCCCGCACCG GAACTCGCTC CTGAAGATAT CCCCGTCGAC
ATCATCTATG AGGACAACGA TCTTATGGTA ATCAATAAAG CTCCCGGCAT GGTGGTCCAT
CCCGCATTCG GCAACTGGAC GGGAACGCTT GCCAACGCCA TCCTTCACCA TCTCGGCACG
GATGCAGAAA AACTCGATAC AACGGAATTA CGTCCCGGCA TCGTTCACCG GCTGGACAAA
AACACCTCGG GACTGATCAT TGTCGCCAAA CACGCTACGG CCCTGCACCG TCTGGCAAAA
CAGTTCGCGG AGCGTCAGGT CGAAAAAAAA TATCAGGCGA TTGTCTGGGG CGTTCCGGAG
CCTCCTGAAG GAATCGTCAA AACAAACATA GGCCGTTCGA TACGCGACCG TAAAGTAATG
ACCTCCTACG ATTTTGAAGG AAAGGAAGGA AAAACAGCGG TAACAGAGTA CCGTGTAGTG
GAAAACCTGC GCTATTTCTC ACTTGTCGAG ATGATCCTCC ACACAGGCCG AACGCATCAG
ATCAGAGTTC ACCTCAAACA TATAAACGCG CCTATTCTCG GAGACGAAAC CTATGGAGGG
GCCGGAGTAC AGTCCCTTCC CTTCAGCAAA AGCGAAAGCT TCGTCAAGAA CCTCCTGGAG
CGTATCCCGC GCCAGGCGCT CCACGCCGCG AGCCTGAGCT TTTTCCAGCC CACAACCCGA
GAAAGGATTA CCCTGTCAGC CCCACAGCCG GAAGATATGC AGGCGGCACT GGATAAGATT
AAAAGAATGT TGAACTGTTG A
 
Protein sequence
MQKQQVKSEQ EEQDREHQEP KKMTLQVAQT QKPMRIDVYL AQQVENATRN KVQEAISEHR 
VRVNGKTVKA NYKIKSLDSI EITFLRPPAP ELAPEDIPVD IIYEDNDLMV INKAPGMVVH
PAFGNWTGTL ANAILHHLGT DAEKLDTTEL RPGIVHRLDK NTSGLIIVAK HATALHRLAK
QFAERQVEKK YQAIVWGVPE PPEGIVKTNI GRSIRDRKVM TSYDFEGKEG KTAVTEYRVV
ENLRYFSLVE MILHTGRTHQ IRVHLKHINA PILGDETYGG AGVQSLPFSK SESFVKNLLE
RIPRQALHAA SLSFFQPTTR ERITLSAPQP EDMQAALDKI KRMLNC