Gene SeD_A3644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3644 
SymbolnusA 
ID6871595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3497440 
End bp3498942 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content52% 
IMG OID642786624 
Producttranscription elongation factor NusA 
Protein accessionYP_002217260 
Protein GI198243925 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAG AAATTTTGGC TGTTGTTGAA GCCGTCTCCA ACGAAAAAGC GCTGCCACGC 
GAAAAAATTT TTGAAGCGCT GGAAAGTGCG CTGGCGACAG CAACAAAGAA AAAATATGAG
CAGGAGATCG ATGTTCGTGT AGAAATCGAT CGTAAAAGCG GTGATTTTGA TACTTTCCGC
CGTTGGTTGA TCGTTGAAGA AGTGACCATG CCGACGAAGG AAATTACGCT GGAAGCGGCG
CGTTTTGAAG ACGAAAGCCT GAATGTCGGC GACTATGTTG AAGATCAGAT TGAATCTGTC
ACCTTTGACC GTATCACCAC GCAGACTGCG AAACAGGTTA TCGTACAGAA GGTCCGTGAA
GCTGAACGCG CGATGGTTGT CGATCAGTTC CGCGACCAGG AAGGCGAAAT TGTCACTGGC
GTGGTGAAGA AAGTGAACCG CGACAATATC TCTCTGGAAA TTAAATCCGA AGGGATGGCC
GGTAACGCTG AAGCGGTGAT TCTGCGTGAA GATATGCTGC CGCGTGAAAA CTTCCGTCCG
GGCGACCGCA TCCGCGGTGT TCTGTACGCT GTACGTCCAG AAGCGCGTGG CGCGCAGCTG
TTCGTCACCC GTTCCAAGCC GGAAATGCTG ATCGAACTGT TCCGCATCGA AGTGCCGGAA
ATCGGCGAAG AAGTGATTGA AATTAAAGCG GCGGCTCGCG ATCCGGGTTC TCGTGCGAAA
ATCGCAGTGA AAACCAACGA TAAACGTATC GATCCGGTCG GCGCTTGTGT GGGGATGCGC
GGCGCGCGCG TTCAGGCGGT CTCTACCGAA CTGGGCGGTG AGCGTATCGA TATCGTGCTG
TGGGATGATA ACCCGGCGCA GTTCGTCATT AATGCGATGG CGCCGGCAGA CGTCGCGTCT
ATCGTGGTGG ACGAAGATAA ACATACCATG GATATCGCCG TTGAAGCCGG TAATCTGGCG
CAGGCGATCG GACGTAATGG TCAGAACGTC CGCCTGGCTT CGCAATTGAG CGGCTGGGAA
CTCAACGTAA TGACCGTTGA TGACTTGCAG GCTAAACATC AGGCTGAAGC ACATGCCGCT
ATCGAGATCT TTACTAAATA TCTTGATATT GATGAAGAGT TCGCGACCGT TCTGGTAGAA
GAAGGTTTCT CCACGCTCGA GGAACTGGCC TATGTGCCAA TGAAAGAACT GCTGGAAATT
GACGGCCTTG ATGAGCCGAC CGTTGAAGCA CTGCGCGAGC GTGCTAAAAA CGCACTGGCC
ACTCTGGCGC AGGACCAGGA AGCAAGCCTC GGTGATAACA AACCGGCTGA CGATCTGCTG
AATCTGGAAG GATTAGATCG CGATATGGCT TTCAAACTGG CGGCTCGTGG TGTTTGTACG
CTGGAAGATC TCGCCGACCA GGGCATTGAT GATCTGGCTG ATATCGAAGG GTTGACCGAC
GAAAAAGCCG GTGAGCTGAT TATGGCTGCC CGTAATATTT GCTGGTTCGG CGACGAAGCG
TAA
 
Protein sequence
MNKEILAVVE AVSNEKALPR EKIFEALESA LATATKKKYE QEIDVRVEID RKSGDFDTFR 
RWLIVEEVTM PTKEITLEAA RFEDESLNVG DYVEDQIESV TFDRITTQTA KQVIVQKVRE
AERAMVVDQF RDQEGEIVTG VVKKVNRDNI SLEIKSEGMA GNAEAVILRE DMLPRENFRP
GDRIRGVLYA VRPEARGAQL FVTRSKPEML IELFRIEVPE IGEEVIEIKA AARDPGSRAK
IAVKTNDKRI DPVGACVGMR GARVQAVSTE LGGERIDIVL WDDNPAQFVI NAMAPADVAS
IVVDEDKHTM DIAVEAGNLA QAIGRNGQNV RLASQLSGWE LNVMTVDDLQ AKHQAEAHAA
IEIFTKYLDI DEEFATVLVE EGFSTLEELA YVPMKELLEI DGLDEPTVEA LRERAKNALA
TLAQDQEASL GDNKPADDLL NLEGLDRDMA FKLAARGVCT LEDLADQGID DLADIEGLTD
EKAGELIMAA RNICWFGDEA