Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_1234 |
Symbol | nusA |
ID | 4057743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | + |
Start bp | 1312999 |
End bp | 1314186 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641230248 |
Product | transcription elongation factor NusA |
Protein accession | YP_604699 |
Protein GI | 94985335 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000227935 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.970518 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCAAC CTGAATTTAA TTTTGCGGAC GCGTTGCGTG AAGTGGCGCA GGCCCGCAAC ATCAATGAGC TGCAGCTGAT CGAGGCGTTC GAGCAGTCGC TCGCCCAGGC CTACAGCCGC AACGTCGAAC CGGACCGCCG GGTGGAGGTG CACCTTGACC CGGTGAGCGG CGAACTCGAA GTGCTGATTG TGCGCGAGGT GGTCGAGAAG GTCGAGGACG AGAACCTCCA AATCTCACTC GCGGACGCCC TCGAACTCGA CCCTGGCGTC GAGATCGGCA TGGAGATGGA GTTCCCGGTC GACCGTGAGA AGTTCTCCCG GATCGCCCTC CAAGCCGCCA AGCAAACCCT GACGCAAAAG ATGCGCGAGA CCGAGCGCAA CGTGGTCTAC AACGAGTACA AGGACCGCGA GGGCCAGGTG CTCACGGCGC AGGTCGTCCG CTCCGACAAC AAAGGCAACT GGTTTGTGGA GCTGGGCGCG GGTGAGGCGA TTTTGCCGCC CCGCGAGCAG ATCCCGGGTG AAAAGCTGGT GCCCGGCAAC CGTGTCAAGA TCTACCTCAA GGAAGTCCGC AAGACGCCCA AGGGGCCAAC CATTCTGGCA AGCCGTGCCG ACGAGCGGCT GCTGGAGTAC CTCCTGCGGC AGGAAATTCC GGAAGTTGCC AACGGCATCG TCGAGATCAA GGCGATCGCG CGCGAGGCGG GACAGCGCTC CAAGGTGGCG GTCTACAGCC ATAACCCCAA CGTGGACCCC ATCGGCGCCT GTATCGGGCA CCGTGGCAAC CGCATTCAGG CTGTGACCGG CGAGCTGGGC CGCGAGCGAG TGGACGTGAT CCTGTGGGAC GCAAATGCGC GCGACTTCAT CCGCAACGCC CTGTCACCTG CCAAGGTGGG CCTCATCGAG GTCCGGCCCG ATACCCGTGA GGCGACCGTC ACGGTCACAC CCGATCAGCT CTCGCTGGCC ATCGGCAAGG GCGGGCAGAA CGTGCGCCTC GCGGCCAAGC TGACCGGCTT TAAAATCGAC CTGCGCGAAA CCGCCGCCAT TCAGGACCTC GACGCTGCCA TGCAGCAGGC GCTGCAGGAG GAGCAGGGGA ACACCGGGCC AAGCAGCGCT GCCGCGTCCG CCTTCGACGC GCTCTTCCGG GACAGCAAGT CGGTGGCGAC CGCCAGCCCG GACGACGAGC AGGAGTAA
|
Protein sequence | MAQPEFNFAD ALREVAQARN INELQLIEAF EQSLAQAYSR NVEPDRRVEV HLDPVSGELE VLIVREVVEK VEDENLQISL ADALELDPGV EIGMEMEFPV DREKFSRIAL QAAKQTLTQK MRETERNVVY NEYKDREGQV LTAQVVRSDN KGNWFVELGA GEAILPPREQ IPGEKLVPGN RVKIYLKEVR KTPKGPTILA SRADERLLEY LLRQEIPEVA NGIVEIKAIA REAGQRSKVA VYSHNPNVDP IGACIGHRGN RIQAVTGELG RERVDVILWD ANARDFIRNA LSPAKVGLIE VRPDTREATV TVTPDQLSLA IGKGGQNVRL AAKLTGFKID LRETAAIQDL DAAMQQALQE EQGNTGPSSA AASAFDALFR DSKSVATASP DDEQE
|
| |