Gene Dfer_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_0033 
SymbolnusA 
ID8223599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp35354 
End bp36595 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content50% 
IMG OID644927913 
Producttranscription elongation factor NusA 
Protein accessionYP_003084470 
Protein GI255033849 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.122242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.232116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAGCG GAATATTAAT TGAGTCATTC GCGGAGTTCG CAAGCTCTAA AAACATCGAT 
CGTCCCACGA TGATCAGTGT TTTGGAAGAG GTATTTCGTA CTATGATTCG CAAGAAATAT
GGCACCGACG ATAATTTTGA TGTGATTATC AACCCGGAAA GCGGTGACCT CGAAATGTGG
CGGACACGCG AGATCGTAGA CGACAACTCC GAGGACATCT GGGAATATAA TAAAATACCG
TTGAACGAAG CGCGCAAAAT ACAGAGCGAC TTCGAGGTGG GCGAAGAAGT GGCGGAAGAA
GTGAAGCTGG TAGAATTCGG TCGCCGCCTG GTGCAAACTG CCCGCCAGAC CCTGATCCAG
AAGATCAAGG ATATGGAAAA AGAGATCATG TATGAAAAAT ACAAGGATCA GGTGGGTGAA
ATGATCACCG GAGAGGTGTA TCAGACTTTG AGACATGAAG TAATCATCGT GGATTCGGAA
GGTAACGAAC TCTCGCTGCC CCGCACCGAG CAGATCTCGA AAGACCGTTT CCGCAAGGGC
GAATCCGTGA AATCGGTGAT CCATAAAGTG GAAATGAACA ACGGCTCGCC GAAGATCACG
TTGTCGAGAA CATCGCCGGT TTTCCTTGAA AGACTGTTCG AAGTGGAGAT CCCGGAAATC
TACGACGGAA TCATTTCTGT CCGCAAAGTT GTTCGTGAAC CAGGGGAACG TGCCAAAGTG
GCCGTGGAAT CCTACGACGA CCGGATCGAT CCGGTGGGTG CCTGCGTTGG TATGAAAGGC
TCACGCATCC ATTCTATCGT GCGTGAACTG GGCAATGAGA ATATCGACGT GATCAACTAC
ACCGATAACC TCGAACTGCT CATCAGCCGC GCATTGAGCC CTGCGAAAGT AAGCTCGATG
CAGATCGACC GCGAAGCGAA ACGCGTATCC GTGTTCCTCA AACCCGACCA GGTATCACTG
GCCATCGGTA AGGGCGGACA GAATATCAAG CTCGCGGGCA AGCTCGTGGG AATGGAAATC
GATGTATTCC GCGATATCGA AGAACAAAAT GATGAAGACG TCGATCTGAC CGAATTCTCG
GATGAAATCG ACGAGTGGAT CATCGACGAA TTGCACAAAA TCGGTCTGGA TACCGCTAAA
AGCGTACTGG CCCTGAGCAA GGAAGAACTC GTGCGCCGTA CCGACCTTGA AGAAGCGACG
GTAGAGGACG TGCTGGATAT CCTGCGGAAA GAATTTGAAT AA
 
Protein sequence
MDSGILIESF AEFASSKNID RPTMISVLEE VFRTMIRKKY GTDDNFDVII NPESGDLEMW 
RTREIVDDNS EDIWEYNKIP LNEARKIQSD FEVGEEVAEE VKLVEFGRRL VQTARQTLIQ
KIKDMEKEIM YEKYKDQVGE MITGEVYQTL RHEVIIVDSE GNELSLPRTE QISKDRFRKG
ESVKSVIHKV EMNNGSPKIT LSRTSPVFLE RLFEVEIPEI YDGIISVRKV VREPGERAKV
AVESYDDRID PVGACVGMKG SRIHSIVREL GNENIDVINY TDNLELLISR ALSPAKVSSM
QIDREAKRVS VFLKPDQVSL AIGKGGQNIK LAGKLVGMEI DVFRDIEEQN DEDVDLTEFS
DEIDEWIIDE LHKIGLDTAK SVLALSKEEL VRRTDLEEAT VEDVLDILRK EFE