Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0321 |
Symbol | |
ID | 9244156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 397922 |
End bp | 399910 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | transcription termination factor Rho |
Protein accession | YP_003678275 |
Protein GI | 297559301 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.440265 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGACA CCACCGAACT CCGAACGGAC GCGGCGGTCG AAGACAAAGC CACCACCGGC GTTTCCGTGA CGGAGGCTGG CGATGGCGCG CCGACAGCCA CGCCCTCGCG GTCCCGCACG GCGTCGCGCG GCACCGGTCT CGCGGCCCTG AAGCTCCCCG AGCTCCAGAA GCTCGCGTCC AGCCTGGGTA TCACCGGTAC GGGGCGCATG CGCAAGAGCG ACGTCATCGC CGCGATCGAG GCCAAGCAGG GCGGCCCGGT CGGCGGCCCC ACGAAGGCCA AAACAGCAAA GAGCGCAGAG ACGGCGTCCA AGGCCGGTAA GGTCGAGGCA ACCGACGGCC GGGCCGAGGC TCCCGAGACG CCGGGCACGG ACGAGGCCCC GCGCAAGCGC GCGGACAAGC AGCCGTCGGA CCAGGCCGTG ACCGACCCCC AGGGCGGACG GGGCGACAAG CCCTCCCGCG GTCGTCGTTC GTCCCGCCGA CGCGGTGACG AGCCCGGCGA CCAGCCCCGG GCCGTGGACG GCGCCGAATC CTCTACCTCC GCGAGCAGCG TGACCAAGAC ATCCAGCACC CCCCAGAAGA ACGGCCCCGA CTCCGCCGAG GACCGTGACA ACAGGTCCGG GCAGGGTCGC GAGCGCCAGC GCAACCGCCG CAACCGCAAC CGCGGCGGTG ACGACCAGAA CGCGAACAGC AACGCCCAGC AGGGTGGTGG CGGCCAGAAC CAGGGCCGCG GCTCCGGTGG TGGCGGTGGC GACGACGACG ACTTCGGCGG ACGCCGCCGC GGACGCCGCC GGGACCGTCG GGACCGCCGC GGACGCGGCG GGGGCCAGGA GCCGGAGCCG GTGATCGGCG AGGACGACGT CCTGCTGCCG GTCGCGGGCA TCCTCGACAT CCTGGACAAC TACGCCTTCG TGCGCACCAC CGGCTACCTC CCCGGCCAGA GCGACGTCTA CGTCTCCCTG GCCCAGGTCC GCAAGCACGG CCTGCGCAAG GGCGACCACA TCATCGGCGC GGTCCGCCAG CCCAAGGACG GCGAGCGCAG GGAGAAGTTC AACGCCCTGG TCCGCCTGGA CTCGGTCAAC GGCATGTCGC CCGACCAGGC CAGGGGCCGC CAGGAGTTCT CCAAGCTGGT CCCCCTGTAC CCCCAGGAGC GCCTGCGCCT GGAGACCGAG CCGCAGATCC TCACCACGCG CATCATCGAC CTGGTGGCGC CCATCGGCAA GGGCCAGCGC GGGCTGATCG TCTCCCCGCC CAAGGCGGGC AAGACGATGG TGGTGCAGGC GATCGCCAAC GCCATCACCG AGAACAACCC CGAGTGCTAC CTGATGGTGA TCCTGGTCGA CGAGCGGCCC GAGGAAGTCA CCGACATGCA GCGCACGGTC AAGGGCGAGG TCATCCACTC GACCTTCGAC CGGCCCGCCG AGGACCACAC GGTCGTCGCC GACCTGGCCA TCGAGCGCGC CAAGCGGCTC GTGGAGATGG GCATGGACGT CGTCGTCCTG CTGGACTCCA TCACCCGCCT GGGCCGCGCC TACAACCTGG CCGCCCCGGC CAGCGGGCGC ATCATGTCCG GCGGTGTGGA CTCCACGGCG CTCTACCCGC CCAAGCGCTT CTTCGGCGCG GCCCGCAACA TCGAGGGCGG CGGCTCGCTG ACCATCCTGG CCACGGCGCT GGTCGAGACC GGCTCGCGCG CCGACGAGGT GATCTTCGAG GAGTTCAAGG GCACCGGCAA CATGGAGCTC AAGCTCAACC GGAGCCTGGC CGACAAGCGG ATCTTCCCGG CGGTGGACGT GGACGCGTCC AGCACCCGCA AGGAGGAGAT CCTCATGTCC TCCGAGGAGC TGGGCGTGGT CTGGAAGCTG CGCCGGGTGC TGCACGCGCT CGACACCCAG CAGGCCATCG AGCTGCTCCT GGACAAGATG AAGGAGTCCA AGAGCAACGC CGAGTTCCTG CTCCAGATCC AGAAGACCAC CGTGGGCCCC GAGCGCTGA
|
Protein sequence | MSDTTELRTD AAVEDKATTG VSVTEAGDGA PTATPSRSRT ASRGTGLAAL KLPELQKLAS SLGITGTGRM RKSDVIAAIE AKQGGPVGGP TKAKTAKSAE TASKAGKVEA TDGRAEAPET PGTDEAPRKR ADKQPSDQAV TDPQGGRGDK PSRGRRSSRR RGDEPGDQPR AVDGAESSTS ASSVTKTSST PQKNGPDSAE DRDNRSGQGR ERQRNRRNRN RGGDDQNANS NAQQGGGGQN QGRGSGGGGG DDDDFGGRRR GRRRDRRDRR GRGGGQEPEP VIGEDDVLLP VAGILDILDN YAFVRTTGYL PGQSDVYVSL AQVRKHGLRK GDHIIGAVRQ PKDGERREKF NALVRLDSVN GMSPDQARGR QEFSKLVPLY PQERLRLETE PQILTTRIID LVAPIGKGQR GLIVSPPKAG KTMVVQAIAN AITENNPECY LMVILVDERP EEVTDMQRTV KGEVIHSTFD RPAEDHTVVA DLAIERAKRL VEMGMDVVVL LDSITRLGRA YNLAAPASGR IMSGGVDSTA LYPPKRFFGA ARNIEGGGSL TILATALVET GSRADEVIFE EFKGTGNMEL KLNRSLADKR IFPAVDVDAS STRKEEILMS SEELGVVWKL RRVLHALDTQ QAIELLLDKM KESKSNAEFL LQIQKTTVGP ER
|
| |