Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3561 |
Symbol | nusA |
ID | 5713792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3746464 |
End bp | 3748098 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641269490 |
Product | transcription elongation factor NusA |
Protein accession | YP_001534895 |
Protein GI | 159046101 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATCA CCTCTGCCAA CCAGTTGGAG CTGTTGCAGA CCGCCGAGGC GGTGGCGCGC GAGAAGATGA TCGACCCGGG CCTCGTGATC GAGGCCATGG AAGAGAGCCT CGCCCGTGCC GCCAAGTCGC GCTACGGCTC GGAGATGGAC ATCCGTGTGT CGATCGACCG TAAGACCGGC GTGGCGACCT TCACCCGCGT GCGCACCGTG GTCGAGGACG AGGAGCTGGA GAATTACCAG GCCGAGCTGA CCGTGGAGCA GGCCAAGCCC TATCTCGACG ATCCCAAGGT CGGTGACACC ATCGTCGACC AGGTGCCGCC GGTGGAGATG GGCCGGATCG CCGCCCAGTC GGCCAAACAG GTGATCCTGC AGAAGGTCCG CGAGGCCGAA CGCGACCGCC AGTACGAGGA ATTCAAGGAC CGCGCGGGCA CGATCATCAA CGGGGTCGTC AAGCGCGAGG AATACGGCAA TGTCATCGTC GACATCGGCC GGGGCGAGGG CGTGCTGCGC CGCAACGAGA AGATCGGGCG CGAGAGCTAC CGGCCCAATG ACCGCATCCG CTGCTACATC AAGGATGTGC GCCGCGAAGT GCGCGGGCCG CAGATCTTCC TCAGCCGCAC CGCGCCGGAA TTCATGGCCG AGCTGTTCAA GATGGAAGTG CCGGAAATCT ATGACGGCAT CATCGAGATC AAGGCCGTGG CCCGGGACCC GGGCAGCCGC GCGAAGATCG CCGTGATCTC CTATGACGGG TCCATCGACC CGGTGGGCGC CTGCGTCGGT ATGCGCGGCT CCCGCGTGCA GGCGGTGGTC AACGAGCTTC AGGGCGAGAA GATCGACATC ATCCCGTGGA ACGAGGACCA GCCGACCTTC CTGGTGAACG CATTGCAGCC CGCCGAGGTG AGCAAGGTGG TCCTGGACGA GGATGCGGAA CGGATCGAGG TCGTGGTCCC GGACGAGCAG CTCAGCCTCG CCATCGGGCG GCGCGGGCAG AACGTGCGTC TGGCCAGCCA GCTGACCGGG CTGGACATCG ACATCATGAC CGAGGCGCAG GAATCGGAGC GTCGTCAGGC CGAGTTCGCC GAGCGCACCA ACATGTTCGT CGAAGCGCTG GACGTGGACG AGGTGCTGGC GCAGCTGCTG GTCTCCGAAG GGTTCACCAA CCTCGAAGAG GTGGCCTATG TCGAGCAGGA AGAGCTGCTG GTGATCGACG GGTTCGACGA GGACACCGCC GAGGAGCTGC AGACCCGGGC CCGCGAATTC CTCGAAGAGC AGGCCAAGAA GGCCCTGGAG CGGGCGCGCG AGATGGGCGT CGAGGACAGC CTGGTTGAAT TCGAGGGCCT GACGCCCCAA ATGCTGGAAG CCCTCGCCGC AGACGGGGTC AAGACGCTCG AAGACTTCGC GACCTGTGCC GACTGGGAGC TGGCCGGCGG CTGGACCACG GTCGATGGCG AGCGCGTGAA GGACGAAGGC GTGCTGGAGA AGTTCGACGT GTCCCTCGAA GAGGCGCAGC TTCTGGTGAT GACGGCACGC GTGCAGCTCG GCTGGGTCAA CCCGGAGGAC CTGGAGATCG AGGAAGACGC CGATCCCGAG GCCGAAGGTG AAGCCGAGAC CGAAGAGGGG GCGCCGCAGG TCTGA
|
Protein sequence | MAITSANQLE LLQTAEAVAR EKMIDPGLVI EAMEESLARA AKSRYGSEMD IRVSIDRKTG VATFTRVRTV VEDEELENYQ AELTVEQAKP YLDDPKVGDT IVDQVPPVEM GRIAAQSAKQ VILQKVREAE RDRQYEEFKD RAGTIINGVV KREEYGNVIV DIGRGEGVLR RNEKIGRESY RPNDRIRCYI KDVRREVRGP QIFLSRTAPE FMAELFKMEV PEIYDGIIEI KAVARDPGSR AKIAVISYDG SIDPVGACVG MRGSRVQAVV NELQGEKIDI IPWNEDQPTF LVNALQPAEV SKVVLDEDAE RIEVVVPDEQ LSLAIGRRGQ NVRLASQLTG LDIDIMTEAQ ESERRQAEFA ERTNMFVEAL DVDEVLAQLL VSEGFTNLEE VAYVEQEELL VIDGFDEDTA EELQTRAREF LEEQAKKALE RAREMGVEDS LVEFEGLTPQ MLEALAADGV KTLEDFATCA DWELAGGWTT VDGERVKDEG VLEKFDVSLE EAQLLVMTAR VQLGWVNPED LEIEEDADPE AEGEAETEEG APQV
|
| |