Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4639 |
Symbol | |
ID | 5902101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5016808 |
End bp | 5018313 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641565158 |
Product | RNA-binding S4 domain-containing protein |
Protein accession | YP_001686257 |
Protein GI | 167648594 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases |
TIGRFAM ID | [TIGR00093] pseudouridine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA AATACCAGCC CCACCTGCAT GACAACGCCA TTCCCGGTCA AGACGACGAC GGGGAAGGCG CGCGGGTCGC CAAGATGCTG GCCCGGGCCG GCGTGGCCTC GCGGCGCGCG GTCGAACGGC TGATCGAGGA CGGCCGGGTC GCCCTGAACG GCGAGGTTCT GACCACCCCG GCCATCAAGG TCCGCCCCGG CGACATCCTG ACCGTCGACG GCAAGATGAT CGACGAGCCC GAGGCGACGC GCGTCTTCCG CTACCACAAG CCCTCCGGGC TGATGACCAC CCACAACGAT CCCAAGCAGC GCCCGACGGT GTTCCAGGCC CTGCCGCGCG ACCTGCCGCG CCTGATCTCG GTCGGACGAC TGGACCTCAA CTCCGAAGGC CTGCTGCTGC TGACCAACGA CGGGGCTCTG TCACGGGCCC TGGAGATGCC GCAGAACGCC TGGGTGCGCC GCTATCGGGC CCGCGCGTTC GGCGACACCA CCCAGGCCAA GCTGGACAAG CTGAAGGACG GCTGCACCGT CGAGGGCGTC CGATACGGCC CGATCGAGGC GCGGCTCGAC AAGGCCCAGG AAAAGGCCGG CGGCGGCAAG AACATCTGGA TCACCCTGAC CCTCAGCGAG GGCAAGAACC GCGAAGTGCG GCGGGTGCTG GAATCCATCG GCCTGAAGGT CAACCGCCTG ATCCGCCTGT CCTACGGCCC GTTCGCGCTC GGAACCCTGC TGCCGGGCCA GGTCGAGGAG GTCGGTCCCC GGGTGATCCG CGAGCTGCTG GAAGGCATCG TCGCCGAAGA GAACATGCCC AAGGGCGACA AGCCGCAATT CATCGGCGTG GCCGATCCGC TGAAGGCCGT CGGCACCGCG GGCGGCGGCG ACATGCAGCG GCGCGGCGTG CCGCGCACCA ACAAGCTGAC CCAGGTCTCG ATCATCACGC CCGAGGAGCC GGTCGAGGAA GAGAAGTTCG TCCGCAAGCC GGGCTGGGCC AAGCCCAAGA AGAAGCCGGC GATCGTCGGT CGCGAGCCCG TGCGCACAGC CAAGAAGTCG ATCGAGAGCA AGATGATCGG CCCAAAGCCC CTGTCCTACC GCGACGCCGC CGCCAAGCGG GTCCGCGACA AGGGCATGGC CGACAAGAAC GCGGCCGACA AGCGGGCGGC GAGCGGCAAG CCGGCGCGGC CCGACAAGCC TGCCGGCCAG CCCTCCGGCG GGCACACGTC CAAGCCGAAG CTCGGCGCGC TGCGGGCCAA TGGCTACAAG CCGCTGACCG AGGGTCCGGC AAGGTCGGCG GGCAAGCCCG GCGGCGCGCG TCCGGGCGGC AAGCCCAGCA CGGTCAAGGC CGCCGGCGCG GGCAAGCCTG GCGAACCGGG CAAGGTGTGG TCCAAGCCCG GCATGGCAAA GTCGGCCGGG CCTCGCCCCG ACGGCCCCAA GGGTCCGCCG CGCGCCGGCG GCAAGCCGGG CGGCCCACGT CCGGGCGGTT CGAGCGCGCC GCGCGGCAAG CGATAG
|
Protein sequence | MTEKYQPHLH DNAIPGQDDD GEGARVAKML ARAGVASRRA VERLIEDGRV ALNGEVLTTP AIKVRPGDIL TVDGKMIDEP EATRVFRYHK PSGLMTTHND PKQRPTVFQA LPRDLPRLIS VGRLDLNSEG LLLLTNDGAL SRALEMPQNA WVRRYRARAF GDTTQAKLDK LKDGCTVEGV RYGPIEARLD KAQEKAGGGK NIWITLTLSE GKNREVRRVL ESIGLKVNRL IRLSYGPFAL GTLLPGQVEE VGPRVIRELL EGIVAEENMP KGDKPQFIGV ADPLKAVGTA GGGDMQRRGV PRTNKLTQVS IITPEEPVEE EKFVRKPGWA KPKKKPAIVG REPVRTAKKS IESKMIGPKP LSYRDAAAKR VRDKGMADKN AADKRAASGK PARPDKPAGQ PSGGHTSKPK LGALRANGYK PLTEGPARSA GKPGGARPGG KPSTVKAAGA GKPGEPGKVW SKPGMAKSAG PRPDGPKGPP RAGGKPGGPR PGGSSAPRGK R
|
| |