Gene Phep_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0140 
SymbolnusA 
ID8251225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp164881 
End bp166116 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content42% 
IMG OID644933790 
Producttranscription elongation factor NusA 
Protein accessionYP_003090428 
Protein GI255530056 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATA TTAATTTAAT CGATTCATTT CAAGAGTTTA AAGACTTCAA GAACATCGAC 
CGTCCTACAG TGATCAGTGT GCTGGAAGAG GTATTTCGCA GTATGCTGCG CAAAAAATAT
GGTACTGATG AGAATTGTGA CGTAATTGTT AACCCGGATA ACGGTGATTT GGAAATCTGG
CGTACCAGAA AAGTGATGGA GGATGGTTTT TCTGAGGATG ACGATCTGGA AATTGAACTT
GCAGAGGTAA AACAACTGGA TCCGGATATG GAAGTTGGCG ACGATTACAT TGAGCAGATC
ACTTTGGAAA GCTTTGGCCG CAGGGCGATT TTAGCTGCCC GTCAGACCCT GGTTTCTAAA
GTTCTGGAAC TGGAGAAAGA CGAGATCTTT AAAAAATATA AAGACAGGGT TGGTGAGATT
GTGACCGGTG AGGTTTACCA GGTATGGAAA AAAGAAACCC TGGTGCTGGA TGATGAAGGC
AACGAGCTGA TGATGCCTAA AACAGAGCAG ATACCGGCCG ATTATTTCAA AAAAGGGGAT
ACTGTACGTG CAGTGATCCT GAAGGTGGAT ATGGTAAATG CTACACCTAA GATTATCATT
TCGAGGATTG CACCTGAATT TTTACAGCGC CTGTTTGAAA TTGAGGTTCC TGAGATCTTT
GATGGTCTGA TCACCATTAA AAAGATTGTT CGTGAGCCAG GCGAAAGAGC TAAGGTTGCG
GTAGAATCTT ACGATGACAG GATTGACCCG GTGGGTGCCT GTGTAGGTAT GAAAGGTTCG
CGTATCCATG GGATCGTAAG AGAGCTGAAA AACGAGAATA TTGACGTAAT TAACTTTACC
AATAACATTT CACTATACAT CACAAGGGCT TTGAGCCCGG CCAAGATCAC TTCTATTAAA
TTAGATGATG AAACCAAACA TGCTTCGGTT TACCTGAAGC CTGACCAGGT TTCACTGGCC
ATAGGCCGTG GTGGGCATAA CATTAAACTG GCCGGTAAAT TGACCGGTTA TGAAATTGAT
GTATACCGTG AAGCAGGTGA AGAAGACGAA GATGTGGACA TCGAAGAATT CTCGGATGAG
ATCGATAGCT GGATCATTGA TGAGCTGAAA GCGATAGGCT GTGATACGGC AAAAAGTGTG
CTGGCACTTT CTGTAGACGA ATTGGTTAAA CGTACCGATT TAGAGGAAGA AACCATCAAA
GAAGTGATGA GTATTTTAAA ATCAGAATTT GAATAA
 
Protein sequence
MSNINLIDSF QEFKDFKNID RPTVISVLEE VFRSMLRKKY GTDENCDVIV NPDNGDLEIW 
RTRKVMEDGF SEDDDLEIEL AEVKQLDPDM EVGDDYIEQI TLESFGRRAI LAARQTLVSK
VLELEKDEIF KKYKDRVGEI VTGEVYQVWK KETLVLDDEG NELMMPKTEQ IPADYFKKGD
TVRAVILKVD MVNATPKIII SRIAPEFLQR LFEIEVPEIF DGLITIKKIV REPGERAKVA
VESYDDRIDP VGACVGMKGS RIHGIVRELK NENIDVINFT NNISLYITRA LSPAKITSIK
LDDETKHASV YLKPDQVSLA IGRGGHNIKL AGKLTGYEID VYREAGEEDE DVDIEEFSDE
IDSWIIDELK AIGCDTAKSV LALSVDELVK RTDLEEETIK EVMSILKSEF E