Gene Dgeo_1234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1234 
SymbolnusA 
ID4057743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1312999 
End bp1314186 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content65% 
IMG OID641230248 
Producttranscription elongation factor NusA 
Protein accessionYP_604699 
Protein GI94985335 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000227935 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.970518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAAC CTGAATTTAA TTTTGCGGAC GCGTTGCGTG AAGTGGCGCA GGCCCGCAAC 
ATCAATGAGC TGCAGCTGAT CGAGGCGTTC GAGCAGTCGC TCGCCCAGGC CTACAGCCGC
AACGTCGAAC CGGACCGCCG GGTGGAGGTG CACCTTGACC CGGTGAGCGG CGAACTCGAA
GTGCTGATTG TGCGCGAGGT GGTCGAGAAG GTCGAGGACG AGAACCTCCA AATCTCACTC
GCGGACGCCC TCGAACTCGA CCCTGGCGTC GAGATCGGCA TGGAGATGGA GTTCCCGGTC
GACCGTGAGA AGTTCTCCCG GATCGCCCTC CAAGCCGCCA AGCAAACCCT GACGCAAAAG
ATGCGCGAGA CCGAGCGCAA CGTGGTCTAC AACGAGTACA AGGACCGCGA GGGCCAGGTG
CTCACGGCGC AGGTCGTCCG CTCCGACAAC AAAGGCAACT GGTTTGTGGA GCTGGGCGCG
GGTGAGGCGA TTTTGCCGCC CCGCGAGCAG ATCCCGGGTG AAAAGCTGGT GCCCGGCAAC
CGTGTCAAGA TCTACCTCAA GGAAGTCCGC AAGACGCCCA AGGGGCCAAC CATTCTGGCA
AGCCGTGCCG ACGAGCGGCT GCTGGAGTAC CTCCTGCGGC AGGAAATTCC GGAAGTTGCC
AACGGCATCG TCGAGATCAA GGCGATCGCG CGCGAGGCGG GACAGCGCTC CAAGGTGGCG
GTCTACAGCC ATAACCCCAA CGTGGACCCC ATCGGCGCCT GTATCGGGCA CCGTGGCAAC
CGCATTCAGG CTGTGACCGG CGAGCTGGGC CGCGAGCGAG TGGACGTGAT CCTGTGGGAC
GCAAATGCGC GCGACTTCAT CCGCAACGCC CTGTCACCTG CCAAGGTGGG CCTCATCGAG
GTCCGGCCCG ATACCCGTGA GGCGACCGTC ACGGTCACAC CCGATCAGCT CTCGCTGGCC
ATCGGCAAGG GCGGGCAGAA CGTGCGCCTC GCGGCCAAGC TGACCGGCTT TAAAATCGAC
CTGCGCGAAA CCGCCGCCAT TCAGGACCTC GACGCTGCCA TGCAGCAGGC GCTGCAGGAG
GAGCAGGGGA ACACCGGGCC AAGCAGCGCT GCCGCGTCCG CCTTCGACGC GCTCTTCCGG
GACAGCAAGT CGGTGGCGAC CGCCAGCCCG GACGACGAGC AGGAGTAA
 
Protein sequence
MAQPEFNFAD ALREVAQARN INELQLIEAF EQSLAQAYSR NVEPDRRVEV HLDPVSGELE 
VLIVREVVEK VEDENLQISL ADALELDPGV EIGMEMEFPV DREKFSRIAL QAAKQTLTQK
MRETERNVVY NEYKDREGQV LTAQVVRSDN KGNWFVELGA GEAILPPREQ IPGEKLVPGN
RVKIYLKEVR KTPKGPTILA SRADERLLEY LLRQEIPEVA NGIVEIKAIA REAGQRSKVA
VYSHNPNVDP IGACIGHRGN RIQAVTGELG RERVDVILWD ANARDFIRNA LSPAKVGLIE
VRPDTREATV TVTPDQLSLA IGKGGQNVRL AAKLTGFKID LRETAAIQDL DAAMQQALQE
EQGNTGPSSA AASAFDALFR DSKSVATASP DDEQE