Gene Ent638_3606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3606 
SymbolnusA 
ID5111797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3910977 
End bp3912464 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content51% 
IMG OID640493810 
Producttranscription elongation factor NusA 
Protein accessionYP_001178315 
Protein GI146313241 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0209266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0368341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAG AAATTTTGGC TGTTGTTGAA GCCGTCTCTA ACGAGAAATC ACTGCCGCGT 
GAGAAGATTT TCGAAGCGCT GGAAAGTGCA CTGGCTACAG CAACCAAGAA AAAATACGAA
CAAGAGATCG ATGTTCGCGT AGAAATCGAT CGTAAAAGCG GTGACTTCGA TACATTCCGT
CGTTGGGTTA TTGTTGAAGA AGTGACCCAA CCGACTAAAG AAATTACGCT GGAAGCGGCT
CGTTACGAAG ACGAAAGCTT CAATGTCGGC GAATATGTTG AAGATCAGAT TGAATCGGTG
ACGTTTGACC GTATCACTAC CCAAACGGCG AAACAGGTTA TAGTACAGAA AGTACGTGAA
GCTGAACGCG CCATGGTGGT TGATCAGTTC CGTTCACACG AAGGTGAAAT CATCACTGGC
GTCGTGAAGA AAGTTAACCG TGACAACATC GCGCTTGACC TGGGTAGCAA CGCTGAAGCG
GTTATCCTGC GCGAAGATAT GTTGCCGCGT GAGAACTTCC GTCCAGGCGA CCGCATCCGC
GGTGTTCTGT ATGCGGTACG TCCAGAAGCG CGCGGTGCGC AGCTGTTCGT TACGCGTTCT
AAAGCAGAGA TGCTGATTGA ACTGTTCCGC ATTGAAGTAC CAGAAATTGG TGAAGAACTT
ATCGAGATCA AAGCAGCGGC CCGCGATCCG GGTTCACGCG CTAAGATCGC GGTAAAAACC
AACGACAAGC GTATCGACCC GGTCGGTGCT TGCGTAGGTA TGCGTGGTGC GCGCGTTCAG
GCGGTATCAA CTGAGCTGGG CGGCGAGCGT ATCGATATCA TCCTTTGGGA CGACAACCCG
GCACAATTCG TGATTAACGC GATGGCTCCG GCAGATGTTG CTTCCATTGT GGTCGATGAA
GACAAGCACA CCATGGATAT CGCTGTTGAA GCCGGTAACC TGGCGCAGGC TATCGGACGT
AATGGTCAGA ACGTACGTTT GGCTTCTCAA CTGAGTGGTT GGGATCTGAA CGTGATGACC
GTTGATGATC TGCAGGCGAA GCATCAGGCT GAAGCTCACG CCGCGATCGC AACCTTCACG
AAGTACCTGG AAATTGACGA AGATTTCGCA ACTGTCCTGG TCGAAGAAGG TTTCTCTTCG
CTTGAAGAAC TGGCCTATGT GCCAATTAAA GAACTGCTGG AAATTGACGG CCTGGATGAA
GCAACCGTTG AAGCCCTGCG TGAACGCGCT AAAAACGCAC TGACCACCCT GGCACTGGCT
CAGGAAGAAA GCCTTGGTGA TAACAAGCCG GCTGATGACC TGCTGAATTT AGAAGGTCTT
GATCGTGCGA TTGCGTTCAA GCTGGCTGCC CATGGTGTTT GTACGCTGGA AGATCTCGCT
GAGCAAGGCG TTGATGACCT GGCTGATATC GAAGGTTTAA CCGACGAGAA AGCCGGCGAA
CTCATCATGG CCGCACGTAA TATTTGCTGG TTCGGCGACG AAGCGTAA
 
Protein sequence
MNKEILAVVE AVSNEKSLPR EKIFEALESA LATATKKKYE QEIDVRVEID RKSGDFDTFR 
RWVIVEEVTQ PTKEITLEAA RYEDESFNVG EYVEDQIESV TFDRITTQTA KQVIVQKVRE
AERAMVVDQF RSHEGEIITG VVKKVNRDNI ALDLGSNAEA VILREDMLPR ENFRPGDRIR
GVLYAVRPEA RGAQLFVTRS KAEMLIELFR IEVPEIGEEL IEIKAAARDP GSRAKIAVKT
NDKRIDPVGA CVGMRGARVQ AVSTELGGER IDIILWDDNP AQFVINAMAP ADVASIVVDE
DKHTMDIAVE AGNLAQAIGR NGQNVRLASQ LSGWDLNVMT VDDLQAKHQA EAHAAIATFT
KYLEIDEDFA TVLVEEGFSS LEELAYVPIK ELLEIDGLDE ATVEALRERA KNALTTLALA
QEESLGDNKP ADDLLNLEGL DRAIAFKLAA HGVCTLEDLA EQGVDDLADI EGLTDEKAGE
LIMAARNICW FGDEA