Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_3161 |
Symbol | nusA |
ID | 3758136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 3147059 |
End bp | 3148348 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637784072 |
Product | transcription elongation factor NusA |
Protein accession | YP_389650 |
Protein GI | 78358201 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00111335 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATGG AATTGAAAAA AGCCATAGAC CAGATCAGCA AAGACCGGGG GCTGGACCGG GCACTGCTGA TCGAAACGCT GGAAGAAGCG GTACGTACCT CCGTCATCCG CAAATACGGC GATGAGCTGG ACCTTGAAGT CAACTTCAAT GAAGAAACAG GCGAAATCGA GGTTTACCAG TTCAAAATCG TGACGGAAGA AGTGGAGGAC GAGGCTTCGC AGATTTCTCT GGAAGACGCC CGCGCACACG ACCCCAGCGT CCAGCTTGAC GACGAGATGG GCTTTCGCCT GAAAATCGAA GATCTGGGCC GTATTGCCGC CCAGTCCGCC AAGCAGGTCA TCATCCAGCG CATGCGGGAT GCCGAGCAGG AAATCATATA CGAAGAATTC AAGGACCGCC TGAACGAAGT CGCCAGCGGC ATCATTCAGC GCAGAGACAA AACCGGCTGG ATCATCAATC TGGGCCGTAC CGAAGCGCTG CTGCCCAAAG ACGAACAGAT TCCGCGCGAA CACTACAAGC GCGGCGACCG CGTGCAGGCC ATCATCATAG AAGTGCGCAA GGAAGGCCGC GGCCCGCAGG TTGTGGTATC CCGTTCGCAC CCCGACTACA TGGCCGCGCT GTTCAAACGC GAAGTGCCCG AAGTGGATGA CGAAACCGTT CAGGTGATGG GTGTTGCCCG CGATCCGGGC AGCCGCGCCA AAGTGGCTGT CACCTCGCGG GACCGCGATG TCGATCCGGT GGGTGCCTGT GTGGGCATAC GCGGTTCGCG CATTCAGAAT ATCGTGCAGG AACTGCGCGG CGAACGCATC GACATAGTGG TGTGGAGTCC CGACATCGCA ACCTATGCAC GCAACGCCCT CAGCCCCGCC GTGATCAGCC GCATCATTGT GGACGAAGAG GAAAACATGC TCGAAGTGGT TGTGCCTGAC GATCAGCTGA CCAATGCCAT CGGCAGAAAG GGTCAGAACG TCAAACTGGC ATCCAAACTG CTGGGCTGGA AAATAGATAT TTACACCGAG ACCCGCTACA ACGAAGCCAA CGCCATAGGG CGCGGTCTCG AACAGCTGGC CAGCGTGGCC GAAGTGCCCA TCGAGCAGTT TGTGGCCGCG GGCTTCTCGT CCATAGAAGA GCTGCGCGAC GCCACAGACG AAGAGCTGAT GGCCGTGGAA GGCCTTACTC CCGGAAAGAT CTCGGATCTG CGGGCTGCCA TCAACTTCCT TGCCCCTGTA CAGCGCAACG CTGAAGACGG AGAGGAAGAA GAAGCCGCTG ACGGGGAAAA CGCCGAATAG
|
Protein sequence | MSMELKKAID QISKDRGLDR ALLIETLEEA VRTSVIRKYG DELDLEVNFN EETGEIEVYQ FKIVTEEVED EASQISLEDA RAHDPSVQLD DEMGFRLKIE DLGRIAAQSA KQVIIQRMRD AEQEIIYEEF KDRLNEVASG IIQRRDKTGW IINLGRTEAL LPKDEQIPRE HYKRGDRVQA IIIEVRKEGR GPQVVVSRSH PDYMAALFKR EVPEVDDETV QVMGVARDPG SRAKVAVTSR DRDVDPVGAC VGIRGSRIQN IVQELRGERI DIVVWSPDIA TYARNALSPA VISRIIVDEE ENMLEVVVPD DQLTNAIGRK GQNVKLASKL LGWKIDIYTE TRYNEANAIG RGLEQLASVA EVPIEQFVAA GFSSIEELRD ATDEELMAVE GLTPGKISDL RAAINFLAPV QRNAEDGEEE EAADGENAE
|
| |