Gene BBta_5034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5034 
Symbol 
ID5150443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5261342 
End bp5262718 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content69% 
IMG OID640559812 
Productribosomal large subunit pseudouridine synthase C 
Protein accessionYP_001240941 
Protein GI148256356 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCC GCATCAAGCG AACCCAGACC CGATCCGACC GACCGACCGA CCGTCGCAAA 
GGCGAGCGGC CGAAGGCCGA AGCCGTGCGG AGCGCGCCTG CCAAGCGCGA TTCGCCGAAG
CGTGAGGCGA GCAAGCGCCC GGAGAGCGAC CGTCCGCGGC GCGAGCGTCT TGGCGGGGAA
CGCGAGGCCG CGTCGCGCAG CGAGTTCGGG CGCGGCAAGG CACGTCCGCC CCGTGCCGAG
CGCGACGAGC GTCGGGAGAC GTTCGAGCCG CGTGGCAAGC GCGTCGCCGC CGGCAAGCCC
GTGCGGTTCG GCGCCGAGCG GGCCGAACGC AAGCCCGTCG CCGCACCGCC ACCGCAGAAG
GCCGAGCCCG AGACGCCGCT GCTGCCGACC AAGGTGCAGA CCGTCGTGGT GACGGCGGAC
GAGAACAACA TGCGCGTCGA CCGCTTCCTC GAGGCGCGCT TTCCCGGCCT GTCATTCTCC
CACATCCAGC GCATCGTCCG TAAAGGCGAG TTGCGCGTCG ACGGCAAGCG CGCCGACAGC
AAGGATCGTC TGGAGGAGGG CCAGACCGTC CGCATTCCGC CGCTGAAGCT CGACGCGCCG
AAGGAGCGTG CCGGCCTCTC CGAGGCGGAG CGCAAGACGC TCGAGAGCCT CAAGGCGATG
ACGCTGTACG AGGACGACGA CGTGCTCGTC CTCAACAAGC CGGTCGGGCT TGCGGTGCAA
GGCGGCTCCG GCATGACGCG CCATATCGAC CAGATGCTGG AGGTGATGCG CGACGCCAAG
GGCCAGAAGC CGCGGCTCGT GCACCGCATC GACCGCGAGA CCTCGGGCTG TCTCCTGATC
GCCAAGACCC GCTTCGCCGC GACGCATCTG ACCGGCGCCT TCCGCAGCCG TTCGGCGCGC
AAGATCTATT GGGCGCTGGT GGCCGGCGTG CCCAAGCCGA AGCAGGGCCG CATCTCGACC
TATCTCGCCA AAGACGAGGG CGAGGACGAC ACCATCATGC GGGTCGCCGC CCATGGCGAC
GAGGGCGCCA GCCACGCCGT GACCTATTAT GCGGTCGTCG AGGCCTCGGC CAACAAGCTC
GCTTGGGTGT CGCTGAAGCC GGTGACCGGC CGCACGCATC AGCTGCGCGC GCATATGGCC
CATATCGGCC ATCCCATCGT CGGCGACCCC AAATATTTCA ACATCGAGAA CTGGGCGCTG
CCCGGCGGCC TGCAGAACCG GCTGCATCTC TTAGCCCGCC GCATCGTCAT CCCGCATCCG
CGCGGCGGTG TCATCGACGC GACCGCGCCG CTGCCGCCGC ATATGCTGCA ATCTTGGAAC
CTGCTCGGCC TCGAGCACGA CCGCTTCGAC CCGATCGAGA ACGCGCCGGA GGAGTGA
 
Protein sequence
MSRRIKRTQT RSDRPTDRRK GERPKAEAVR SAPAKRDSPK REASKRPESD RPRRERLGGE 
REAASRSEFG RGKARPPRAE RDERRETFEP RGKRVAAGKP VRFGAERAER KPVAAPPPQK
AEPETPLLPT KVQTVVVTAD ENNMRVDRFL EARFPGLSFS HIQRIVRKGE LRVDGKRADS
KDRLEEGQTV RIPPLKLDAP KERAGLSEAE RKTLESLKAM TLYEDDDVLV LNKPVGLAVQ
GGSGMTRHID QMLEVMRDAK GQKPRLVHRI DRETSGCLLI AKTRFAATHL TGAFRSRSAR
KIYWALVAGV PKPKQGRIST YLAKDEGEDD TIMRVAAHGD EGASHAVTYY AVVEASANKL
AWVSLKPVTG RTHQLRAHMA HIGHPIVGDP KYFNIENWAL PGGLQNRLHL LARRIVIPHP
RGGVIDATAP LPPHMLQSWN LLGLEHDRFD PIENAPEE