Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5019 |
Symbol | rho |
ID | 5902481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5421889 |
End bp | 5423310 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641565540 |
Product | transcription termination factor Rho |
Protein accession | YP_001686637 |
Protein GI | 167648974 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000339489 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC AAACAGAGAA CCAGACCGAC AGCGCCAACG AGGCCGAAGA GCCGATCGTC GATACGACGA CCCTGGCCGC CTCGGTCGAT CCGCAAGGCG ACGACAATGG CGGCGACGAC GAATCCGAAG TGGGCGCCAC CGTAGCGGCC ATGGGCCTGA AGACGATGTC GCTGCAGGAG CTGAAGGAGA AATCCCCGGC CGACCTGCTG GCCTTCGCCG AGACCTTCGA GGTCGAGAAC GCCAACTCCA TGCGCAAGCA GGACATGATG TTCGCGATCC TCAAGACCCT CGCCGAAGAA GGCGTGGAAA TCTCGGGCTC GGGGACCATG GAAGTGGTGC AGGACGGCTT TGGCTTCCTG CGCTCGCCGG AAGCCAACTA TCTTCCGGGT CCGGATGATA TCTACGTGTC GCCCTCGCAA ATCCGCAAGT TCGGCCTGCG CACCGGCGAC ACCATCGACG GCGCCATCCG CGCGCCCCGC GAGGGCGAGC GCTACTTCGC CCTCACCGGC GTGACCTTGA TCAATTTCGA GAGCCCGGAC AACGTCAAGC ACAAGGTCCA CTTCGACAAC CTGACCCCGC TCTATCCCGA GGAGCGGCTG AACATGGAAC TGCCCGATCC GACCATCAAG GATCGCTCGG GCCGGGTCAT CGACATCGTC GCCCCGCTGG GCAAGGGTCA GCGCTGCCTG ATCGTCGCCC CGCCGCGCGT CGGCAAGACG GTGATGCTGC AGAACATCGC CAAGTCGATC GAGACCAACC ACCCCGAGTG CTACCTGATC GTCCTGTTGA TCGACGAGCG CCCGGAAGAA GTCACCGACA TGCAACGCAC GGTAAAGGGC GAGGTCATCG CCTCGACCTT CGACGAACCG GCGACCCGCC ACGTGCAGGT GGCCGAGATG GTCATCGAAA AGGCCAAGCG CCTGGTCGAG CACAAGCGCG ACGTGGTCAT CCTGCTGGAC TCGGTCACCC GCCTGGGCCG CGCCTACAAC ACCACCGTCC CGTCGTCGGG CAAGGTGCTG ACCGGCGGCG TCGACGCCAA CGCCTTGCAG CGCCCCAAGC GCTTCTTCGG CGCGGCGCGG AACGTCGAGG AGGGCGGCTC GCTGTCGATC ATCGCCACCG CCCTGATCGA CACCGGCAGC CGGATGGACG AAGTGATCTT CGAAGAGTTC AAGGGCACCG GTAACTCGGA AATCGTTCTT GATCGTAAGG TGGCGGACAA GCGCATCTTC CCGGCCATCG ACGTGTTGAA GTCGGGCACC CGCAAGGAAG AGCTGATCAC GCCGCGGGAC CAATTGCAGA AGACCTACGT TCTGCGCCGG ATCCTCAACC CGATGGGCGC CTCGGACGCC ATCGAGTTCC TGCTCGAGAA GATGCGCCAG TCAAAGACCA ACGGCGATTT CTTCCAGTCG ATGAACACCT AG
|
Protein sequence | MTDQTENQTD SANEAEEPIV DTTTLAASVD PQGDDNGGDD ESEVGATVAA MGLKTMSLQE LKEKSPADLL AFAETFEVEN ANSMRKQDMM FAILKTLAEE GVEISGSGTM EVVQDGFGFL RSPEANYLPG PDDIYVSPSQ IRKFGLRTGD TIDGAIRAPR EGERYFALTG VTLINFESPD NVKHKVHFDN LTPLYPEERL NMELPDPTIK DRSGRVIDIV APLGKGQRCL IVAPPRVGKT VMLQNIAKSI ETNHPECYLI VLLIDERPEE VTDMQRTVKG EVIASTFDEP ATRHVQVAEM VIEKAKRLVE HKRDVVILLD SVTRLGRAYN TTVPSSGKVL TGGVDANALQ RPKRFFGAAR NVEEGGSLSI IATALIDTGS RMDEVIFEEF KGTGNSEIVL DRKVADKRIF PAIDVLKSGT RKEELITPRD QLQKTYVLRR ILNPMGASDA IEFLLEKMRQ SKTNGDFFQS MNT
|
| |