Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1747 |
Symbol | |
ID | 4597929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1857894 |
End bp | 1859630 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639776347 |
Product | transcription termination factor Rho |
Protein accession | YP_922947 |
Protein GI | 119715982 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.179757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGG CTCAGCTGGT CGAGGCGATC AAGGCCCACC AGAGCGGCGG CCGGCCGGCC AAGGAGCGCG GCGAGCAGCA GCAGGCCCAG CAGGAGCGGA GCGAGCAGCA GCGGACCCAG CCGGAACAGG CCCCGCAGGA GCGGACCCAG CCGGAGCGAC CGCAGCAGCA GCGGTCGGAG GAGCAGCCGC GGCGCGACCA GCGGCCCGAC GAGGCCCAGC GGGACCAGCA GCGGGACCAG CAGCGCGAGC GCAACCGCGG CCAGGGTCAG GGGAACCACG ACCAGAAGCA GGACCAGAAC AAGCAGGGCC AGCCCAAGCG GCAGGACCAG AAGCAGGCCC AGAACAAGCA GGACCAGAAG CAGGACCAGG GCAAGCAGGA CCAGAAGCAG GACCAGGGTC ACCAGGATCA GGACCATCAC GACCAGGGAC ACCAGGACCA GGGGCACCAG GACCAGGGCC GGGCGGGCGA GGACCAGGGT GAGGGCAGTC GCCGCAACCG GCGGCGTCGC GGTCGCGACC GTGACCGTAC CGGTCGGGGG GTCACCGGCG GTGGTCAGCG CAACGAGCCG GACACCACGA TCCTCGAGGA CGACGTCCTG GTGCCGGCCG CGGGCATCCT CGACGTTCTC GACAACTACG CGTTCGTGCG GACCAGCGGC TACCTCCCCG GCCCTGACGA CGTGTACGTG TCGCTCTCGA TGGTGCGCAA GTTCGGGCTG CGCCGCGGCG ACGCGCTCGT CGGGCAGGTG CGCCAGCCCC GGGAGGGCGA GCGCAAGGAG AAGTTCAACC CGATGGTCCG CATCGACAGC GTCAACGGCG CCGATCCGGA GATCGCGAAG GGGCGGGTCG ACTTCGCCAA GCTGACCCCG CTCTACCCCT CCGAGCGGTT GCGGCTGGAG ACCGAGCCGA CGAACCTGAT CGGTCGGGTC ATCGACATCG CGGCCCCGAT CGGCAAGGGC CAGCGCGGCC TGATCGTGTC CCCGGCGAAG GCCGGCAAGA CCATGATCAT GCAGTCGATC GCGAACTCGA TCACCACCAA CAACCCCGAG TGCCACCTGA TGGTGGTGCT GGTCGACGAG CGGCCCGAGG AGGTCACCGA CTTCGAGCGC TCGGTCAAGG GTGAGGTCAT CTCCTCGACC TTCGACCGTC CGGCCAGCGA CCACACGATG GTCGCCGAGC TCGCCATCGA GCGGGCCAAG CGGCTGGTCG AGCTCGGCCA CGACGTCGTC GTACTGCTCG ACGGCATCAC CCGGTTGGGG CGCGCCTACA ACCTCGCGAT GCCGGCGAGC GGCCGGATCC TCTCCGGTGG TGTGGACTCG GCCGCGCTCT ACCCACCGAA GAAGTTCTTC GGTGCGGCGC GCAACATCGA GAACGGCGGC TCGCTGACCA TCCTCGCCAC GGCCCTGATC GAGAGCGGCT CGAAGATGGA CGAGGTGATC TTCGAGGAGT TCAAGGGCAC CGGGAACATG GAGATCCGGT TGCGCCGCGA CCTTGCCGAC AAGCGACTGT TCCCCGCGAT CGACGCGGTC CAGTCCGGCA CCCGCCGCGA GGAGCTCCTG ATGAGCAAGG AGGAGCTGGC CATCGTCTGG AAGCTGCGCC GGGTGCTCTC CGGGCTCGAC GGCCAGCAGG CGCTCGAGCT CCTGCTGGAG CGGCTGAAGA AGTCCCAGAC CAACATCGAG TTCCTGATGC AGGTCCAGAA GACGACCCCG ACCCCGACCG GCGGGCGCGA AGACTGA
|
Protein sequence | MKKAQLVEAI KAHQSGGRPA KERGEQQQAQ QERSEQQRTQ PEQAPQERTQ PERPQQQRSE EQPRRDQRPD EAQRDQQRDQ QRERNRGQGQ GNHDQKQDQN KQGQPKRQDQ KQAQNKQDQK QDQGKQDQKQ DQGHQDQDHH DQGHQDQGHQ DQGRAGEDQG EGSRRNRRRR GRDRDRTGRG VTGGGQRNEP DTTILEDDVL VPAAGILDVL DNYAFVRTSG YLPGPDDVYV SLSMVRKFGL RRGDALVGQV RQPREGERKE KFNPMVRIDS VNGADPEIAK GRVDFAKLTP LYPSERLRLE TEPTNLIGRV IDIAAPIGKG QRGLIVSPAK AGKTMIMQSI ANSITTNNPE CHLMVVLVDE RPEEVTDFER SVKGEVISST FDRPASDHTM VAELAIERAK RLVELGHDVV VLLDGITRLG RAYNLAMPAS GRILSGGVDS AALYPPKKFF GAARNIENGG SLTILATALI ESGSKMDEVI FEEFKGTGNM EIRLRRDLAD KRLFPAIDAV QSGTRREELL MSKEELAIVW KLRRVLSGLD GQQALELLLE RLKKSQTNIE FLMQVQKTTP TPTGGRED
|
| |