Gene YpAngola_A3992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3992 
SymbolnusA 
ID5802470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4244310 
End bp4245797 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content49% 
IMG OID641341777 
Producttranscription elongation factor NusA 
Protein accessionYP_001608285 
Protein GI162418514 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000690041 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAG AGATTCTGGC TGTTGTAGAA GCAGTTTCCA ATGAGAAATC CCTTCCGCGC 
GAGAAGATTT TTGAGGCGTT GGAAACCGCT CTAGCGACAG CAACCAAGAA AAAATACGAA
CAAGAGATTG AAGTCCGCGT CAGTATTGAC CGTAAAACCG GTGATTTTGA TACTTTCCGC
CGTTGGGTCG CTGTTGACGA AGTGACTATG CCAACCCGTG AAATCACGTT GGATGCGGCT
CAATTTGAAG ATCCTTCTCT CCAATTGGGT GATTATGTCG AAGACCAGAT TGAATCGGTG
ACTTTTGACC GCATTACCAC CCAAACAGCC AAGCAAGTCA TCGTACAAAA AGTACGTGAA
GCTGAACGGG CGATGGTTGT TGAGCAGTTC CGTCAATATC TGGGGCAAAT TGTCACTGGT
ATTGTTAAGA AAGTTAGCCG TGACAGTATT GCACTGGATC TGGGCCACAA TGCGGAAGCT
GTTATTGGTC GTGAAGATAT GCTCCCGCGT GAAAATTTCC GCCCAGGTGA CCGTATCCGT
GGTGTTCTGT ATGACGTGCG TCCAGAAGCT CGTGGCGCAC AGTTGTTTGT CAGCCGTTCA
CGTTCTGAAA TGTTGGTCGA ACTGTTCCGC ATTGAAGTAC CAGAAATTGG TGAAGAGCTG
ATCGAAATTA AAGCCGCTGC CCGTGATCCT GGCTCTCGTG CTAAAATTGC GGTCAAAACC
AATGACAAGC GTATCGATCC GGTTGGTGCT TGCGTTGGTA TGCGTGGTGC CCGTGTTCAG
GCTGTGTCCA GCGAGCTTGG CGGCGAGCGC ATTGATATTG TATTGTGGGA TGATAATCCA
GCCCAGTTTG TTATTAACGC TATGGCGCCA GCTGATGTTG CGTCTATTGT GGTTGATGAA
GACAAACACA CGATGGATGT TGCCGTTGAA GCCAGTAATT TGGCCCAGGC AATTGGTCGT
AATGGCCAGA ACGTACGTTT AGCCGCGCAG CTTAGTGGCT GGGAACTGAA CGTAATGACG
GCGGACGATC TTCAGGCGAA GCATCAGGCC GAGGCTCATG CCGCTATTGA TACCTTCACT
AAATATCTTG ATATCGATGA AGACTTTGCC ACCGTATTGG TAGAAGAAGG TTTCTCTTCT
CTGGAAGAGT TGGCTTATGT GCCAATGAAA GAACTTCTGG AAATCGATGG TCTTGACGAA
GATACGGTTG AAGCGCTGCG TGATCGCGCC AAAGCTGCAT TGACCACGCT GGCCCTGGCA
CAAGAAGAAA GTTTTGGCGA CCAGAAACCC GCTGATGACC TGTTGAATTT AGCGGGTCTG
GAACGTAGCA TGGCATTCAA ATTGGCTGCG CGCGGTGTAT GTACGCTGGA AGATCTTGCC
GAGCAGGGTA TCGACGATTT GGCAGATATT GAAGGGCTTA GCGATGAGCA AGCCGGTGAG
CTGATTATGG CCGCACGTAA TATCTGTTGG TTTGGCGATA ATGCGTAA
 
Protein sequence
MNKEILAVVE AVSNEKSLPR EKIFEALETA LATATKKKYE QEIEVRVSID RKTGDFDTFR 
RWVAVDEVTM PTREITLDAA QFEDPSLQLG DYVEDQIESV TFDRITTQTA KQVIVQKVRE
AERAMVVEQF RQYLGQIVTG IVKKVSRDSI ALDLGHNAEA VIGREDMLPR ENFRPGDRIR
GVLYDVRPEA RGAQLFVSRS RSEMLVELFR IEVPEIGEEL IEIKAAARDP GSRAKIAVKT
NDKRIDPVGA CVGMRGARVQ AVSSELGGER IDIVLWDDNP AQFVINAMAP ADVASIVVDE
DKHTMDVAVE ASNLAQAIGR NGQNVRLAAQ LSGWELNVMT ADDLQAKHQA EAHAAIDTFT
KYLDIDEDFA TVLVEEGFSS LEELAYVPMK ELLEIDGLDE DTVEALRDRA KAALTTLALA
QEESFGDQKP ADDLLNLAGL ERSMAFKLAA RGVCTLEDLA EQGIDDLADI EGLSDEQAGE
LIMAARNICW FGDNA