Gene HS_0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0820 
SymbolnusA 
ID4240312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp894011 
End bp895492 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content37% 
IMG OID638104375 
Producttranscription elongation factor NusA 
Protein accessionYP_719030 
Protein GI113460963 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAG AAATTTTATT AGCCGCTGAA GCGGTTTCCA ATGAAAAATT GTTACCCCGT 
GAGAAAATTT TTGAAGCATT GGAAAGTGCA ATTGCACTTT CGACGAAGAA AAAATATGAA
CAAGAAATTG ATGTTCGTGT AGTCATCAAT ACTAAAACCG GTGAGTTTTC TACTTTTCGT
CGTTGGTTAG TGGTTGAAAC AGTACTCAAT CCAACGAAAG AGATTACATT AGAAGCTGCT
CAATTTGAAG ATCCGGAAAT AAAATTAGGT GACTATATTG AAGATGAAAT TGATTCGATT
GCTTTTGATC GCATTGCGAT GCAAACAGCA CGTCAAGTAA TTAGTACCAA AATTCGTGAA
GCAGAGCGTA ATAAAATTGT GGAACAGTTC CGCTCACAGG AAGGTGAAAT TGTTTCCGGT
ACAGTGAAAA AAGTAAATCG TGATTCTATT ATTTTAGATT TAAGTCAACA GGCTGAAGCT
GTTATTTTGC GTGAAGATAT GTTACCGCGT GAAAATTTCC GTCCGGGTGA TAGAGTTCGT
GGTGTACTAT ACAAAGTAAG TCCGGAAAAC AAGGGTATTC AATTGTTTGT GACTCGTGCT
AAACCTGAAA TGTTAGTGGA ATTGTTCCGT ATTGAAGTTC CTGAAATCGG CGAAGAGTTG
ATTGAAATTA AAGGTGCCTC ACGTGATCCG GGATTACGTG CAAAAATTGC TGTCAAAAGC
AATGATAAGA GAATTGATCC AGTTGGAGCT TGTGTCGGTA TGCGTGGCTC TCGAGTTCAA
GCAATTACCA ATGAATTGGG GGGGGAGCGT GTAGATATTG TGTTATGGGA TGATAATCCT
GCACAATTTG TCATTAATGC AATGGCACCG GCAGATGTGA GCTCAATTGT TGTTGATGAA
GATAATCATT CTATGGATAT TGCCGTTGAA CCGGAAAATT TAGCACAAGC TATTGGACGC
AATGGTCAAA ATGTACGTTT AGCAACACAG TTAACTGGTT GGGTCTTAAA TGTTATGACA
ATAGATGATT TAAATGCTAA ACACCAAGCG GAAGATAACA AGATTTTAGC TTTATTTATG
ACCGCACTAG AGATTGATGA GGAGTTTGCT CATATCCTTG TTGATGAAGG ATTCACTAAT
TTGGAAGAGA TTGCTTATGT TGCTGTAAAC GAATTAACGG CAATTGATGG CTTAGAAGAT
GAGGATCTTG TTGAGGAATT ACAAGCTCGA GCGAAAAATG CGATTACTGC AAGGATGTTG
GCTGAAGAAG AAGCATTGAA ACAAGCTCAT GTTGAAGAAA AATTATTAAA TTTAGAAGGT
ATGTCTCGTC ATATTGCATT TAGATTATCA GAAAAACAAA TTACAACTCT TGAAGAATTA
GCTGAACAAG GTGTTGATGA TTTATCAGAT ATTGAAGAAT TAAGTGCTGA ACAAGCTGCA
GATTTAATTA TGGCTGCACG TAATATTTGT TGGTTCAATT AG
 
Protein sequence
MSKEILLAAE AVSNEKLLPR EKIFEALESA IALSTKKKYE QEIDVRVVIN TKTGEFSTFR 
RWLVVETVLN PTKEITLEAA QFEDPEIKLG DYIEDEIDSI AFDRIAMQTA RQVISTKIRE
AERNKIVEQF RSQEGEIVSG TVKKVNRDSI ILDLSQQAEA VILREDMLPR ENFRPGDRVR
GVLYKVSPEN KGIQLFVTRA KPEMLVELFR IEVPEIGEEL IEIKGASRDP GLRAKIAVKS
NDKRIDPVGA CVGMRGSRVQ AITNELGGER VDIVLWDDNP AQFVINAMAP ADVSSIVVDE
DNHSMDIAVE PENLAQAIGR NGQNVRLATQ LTGWVLNVMT IDDLNAKHQA EDNKILALFM
TALEIDEEFA HILVDEGFTN LEEIAYVAVN ELTAIDGLED EDLVEELQAR AKNAITARML
AEEEALKQAH VEEKLLNLEG MSRHIAFRLS EKQITTLEEL AEQGVDDLSD IEELSAEQAA
DLIMAARNIC WFN