Gene BURPS1106A_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1761 
SymbolnusA 
ID4902040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1719825 
End bp1721300 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content65% 
IMG OID640134991 
Producttranscription elongation factor NusA 
Protein accessionYP_001066030 
Protein GI126455004 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGCG AAGTGTTGAT GTTGGTGGAT GCGCTGGCGC GCGAGAAGAA CGTCGACAAG 
GACGTCGTGC TGGGCGCGCT CGAAGCGGCC CTCGCGTCGG CTTCCAAGAA GCTGTTCGAC
GAAGGCGCCG AGATCCGCGT ACATATCGAT CGCGAGAGCG GTGAACACGA GACGTTCCGT
CGCTGGCTCG TCGTGCCCGA CGAGGCGGGC CTCCAAGAGC CGGATCGCGA GATCCTGCTG
TTCGAGGCGC GCGAGCAGAA GCCCGATGTC GAGGTCGGCG ACTATATCGA AGAATCGGTG
CCGTCGATCG AGTTCGGCCG GATCGGCGCG CAGGCCGCGA AGCAGGTGAT CCTGCAGAAG
GTGCGCGACG CGGAGCGCGA GCAGATCCTG AACGATTACC TCGAGCGCGG CGAGAAGATC
ATGACGGGCA CGGTGAAGCG CCTCGACAAG GGCAACTTCA TCGTCGAATC GGGCCGTGTC
GAGGCGCTGC TGCGCCGCGA CCAACTGATT CCGAAGGAAA ACCTGCGCGT GGGCGACCGC
GTGCGCGCGT ACATCGCGAA GGTCGACCGC ACCGCGCGCG GCCCGCAGAT CGAGCTGTCG
CGCACCGCGC CCGAATTCCT GATGAAGCTC TTCGAGATGG AAGTGCCGGA AATCGAGCAG
GGGCTTCTCG AGATCAAGGC GGCGGCCCGC GATCCGGGCG TGCGCGCGAA GATCGGCGTC
GTCGCGTACG ACAAGCGGAT CGATCCGATC GGCACGTGCG TCGGCATTCG CGGCTCGCGC
GTGCAGGCCG TGCGCAACGA GCTCGGTGGC GAAAACATCG ACATCGTGCT ATGGTCGGAG
GATCCCGCCC AGTTCGTGAT CGGCGCGCTC GCGCCGGCGG CCGTCCAGTC GATCGTCGTC
GATGAAGAAA AGCATTCGAT GGACGTCGTC GTCGACGAGA ACGAATTGGC TGTCGCGATC
GGCCGCAGCG GCCAGAACGT GCGTCTTGCC AGCGAACTGA CCGGCTGGCA GATCAACATC
ATGACGCCGG ACGAATCCGC CCAGAAGCAG AACGAAGAGC GCGACGCGCT GCGCGGCCTG
TTCATGGCGC GCCTCGACGT CGACGAGGAA GTCGCGGACA TCCTGATCGA CGAAGGCTTC
ACGAGCCTCG AAGAGATCGC CTACGTGCCG CTCAACGAGA TGCTCGAGAT CGAGGCGTTC
GACGAGGACA CCGTGCACGA ACTGCGCAAC CGCTCGCGCG ACGCGCTGCT CACGATGGCG
ATCGCGAACG AGGAGAAGGT CGAGACGGCC GCCCTCGATC TGAAGAGCCT CGACGGCGTC
ACGCCCGAAC TGCTCGCGAA GCTGGCCGAG CAGGGCGTGC AGACGCGCGA CGATCTCGCG
GAGCTTGCCG TGGACGAGCT GGTCGACATG ACCGGCATGG AAGAGGAAGC CGCGAAGGCG
CTGATCATGA AAGCACGCGA ACACTGGTTC CAGTGA
 
Protein sequence
MSREVLMLVD ALAREKNVDK DVVLGALEAA LASASKKLFD EGAEIRVHID RESGEHETFR 
RWLVVPDEAG LQEPDREILL FEAREQKPDV EVGDYIEESV PSIEFGRIGA QAAKQVILQK
VRDAEREQIL NDYLERGEKI MTGTVKRLDK GNFIVESGRV EALLRRDQLI PKENLRVGDR
VRAYIAKVDR TARGPQIELS RTAPEFLMKL FEMEVPEIEQ GLLEIKAAAR DPGVRAKIGV
VAYDKRIDPI GTCVGIRGSR VQAVRNELGG ENIDIVLWSE DPAQFVIGAL APAAVQSIVV
DEEKHSMDVV VDENELAVAI GRSGQNVRLA SELTGWQINI MTPDESAQKQ NEERDALRGL
FMARLDVDEE VADILIDEGF TSLEEIAYVP LNEMLEIEAF DEDTVHELRN RSRDALLTMA
IANEEKVETA ALDLKSLDGV TPELLAKLAE QGVQTRDDLA ELAVDELVDM TGMEEEAAKA
LIMKAREHWF Q