Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0820 |
Symbol | nusA |
ID | 4240312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 894011 |
End bp | 895492 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638104375 |
Product | transcription elongation factor NusA |
Protein accession | YP_719030 |
Protein GI | 113460963 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAG AAATTTTATT AGCCGCTGAA GCGGTTTCCA ATGAAAAATT GTTACCCCGT GAGAAAATTT TTGAAGCATT GGAAAGTGCA ATTGCACTTT CGACGAAGAA AAAATATGAA CAAGAAATTG ATGTTCGTGT AGTCATCAAT ACTAAAACCG GTGAGTTTTC TACTTTTCGT CGTTGGTTAG TGGTTGAAAC AGTACTCAAT CCAACGAAAG AGATTACATT AGAAGCTGCT CAATTTGAAG ATCCGGAAAT AAAATTAGGT GACTATATTG AAGATGAAAT TGATTCGATT GCTTTTGATC GCATTGCGAT GCAAACAGCA CGTCAAGTAA TTAGTACCAA AATTCGTGAA GCAGAGCGTA ATAAAATTGT GGAACAGTTC CGCTCACAGG AAGGTGAAAT TGTTTCCGGT ACAGTGAAAA AAGTAAATCG TGATTCTATT ATTTTAGATT TAAGTCAACA GGCTGAAGCT GTTATTTTGC GTGAAGATAT GTTACCGCGT GAAAATTTCC GTCCGGGTGA TAGAGTTCGT GGTGTACTAT ACAAAGTAAG TCCGGAAAAC AAGGGTATTC AATTGTTTGT GACTCGTGCT AAACCTGAAA TGTTAGTGGA ATTGTTCCGT ATTGAAGTTC CTGAAATCGG CGAAGAGTTG ATTGAAATTA AAGGTGCCTC ACGTGATCCG GGATTACGTG CAAAAATTGC TGTCAAAAGC AATGATAAGA GAATTGATCC AGTTGGAGCT TGTGTCGGTA TGCGTGGCTC TCGAGTTCAA GCAATTACCA ATGAATTGGG GGGGGAGCGT GTAGATATTG TGTTATGGGA TGATAATCCT GCACAATTTG TCATTAATGC AATGGCACCG GCAGATGTGA GCTCAATTGT TGTTGATGAA GATAATCATT CTATGGATAT TGCCGTTGAA CCGGAAAATT TAGCACAAGC TATTGGACGC AATGGTCAAA ATGTACGTTT AGCAACACAG TTAACTGGTT GGGTCTTAAA TGTTATGACA ATAGATGATT TAAATGCTAA ACACCAAGCG GAAGATAACA AGATTTTAGC TTTATTTATG ACCGCACTAG AGATTGATGA GGAGTTTGCT CATATCCTTG TTGATGAAGG ATTCACTAAT TTGGAAGAGA TTGCTTATGT TGCTGTAAAC GAATTAACGG CAATTGATGG CTTAGAAGAT GAGGATCTTG TTGAGGAATT ACAAGCTCGA GCGAAAAATG CGATTACTGC AAGGATGTTG GCTGAAGAAG AAGCATTGAA ACAAGCTCAT GTTGAAGAAA AATTATTAAA TTTAGAAGGT ATGTCTCGTC ATATTGCATT TAGATTATCA GAAAAACAAA TTACAACTCT TGAAGAATTA GCTGAACAAG GTGTTGATGA TTTATCAGAT ATTGAAGAAT TAAGTGCTGA ACAAGCTGCA GATTTAATTA TGGCTGCACG TAATATTTGT TGGTTCAATT AG
|
Protein sequence | MSKEILLAAE AVSNEKLLPR EKIFEALESA IALSTKKKYE QEIDVRVVIN TKTGEFSTFR RWLVVETVLN PTKEITLEAA QFEDPEIKLG DYIEDEIDSI AFDRIAMQTA RQVISTKIRE AERNKIVEQF RSQEGEIVSG TVKKVNRDSI ILDLSQQAEA VILREDMLPR ENFRPGDRVR GVLYKVSPEN KGIQLFVTRA KPEMLVELFR IEVPEIGEEL IEIKGASRDP GLRAKIAVKS NDKRIDPVGA CVGMRGSRVQ AITNELGGER VDIVLWDDNP AQFVINAMAP ADVSSIVVDE DNHSMDIAVE PENLAQAIGR NGQNVRLATQ LTGWVLNVMT IDDLNAKHQA EDNKILALFM TALEIDEEFA HILVDEGFTN LEEIAYVAVN ELTAIDGLED EDLVEELQAR AKNAITARML AEEEALKQAH VEEKLLNLEG MSRHIAFRLS EKQITTLEEL AEQGVDDLSD IEELSAEQAA DLIMAARNIC WFN
|
| |