Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nwi_0022 |
Symbol | nusA |
ID | 3676462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter winogradskyi Nb-255 |
Kingdom | Bacteria |
Replicon accession | NC_007406 |
Strand | + |
Start bp | 22246 |
End bp | 23862 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637711557 |
Product | transcription elongation factor NusA |
Protein accession | YP_316642 |
Protein GI | 75674221 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0361846 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCCG TCAGCGCCAA TAAGCTCGAA CTGCTGCAGA TCGCAGACGC GGTGGCGCGC GAAAAGACCA TCGACCGCGG CATCGTGATC GCGGCGATGG AGGACGCCAT CGCGAAGGCG GCGCGGGCGC GGTACGGCAG CGAGACTGAC GTTCATGCGG AGATCCACCC GAAGACCGGA CAGCTCCAGC TCACCCGCCA CATGCTGGTG GTCGAGCAGG TCGAGAATGC CGCGAACCAG ATCTCGCTGA AGGACGCCCA GCGGGCCAAT CCCGGCGCAC AGATCGGCGA CACCATCGCC GATACGCTGC CGCCGCTGGA ATATGGCCGC ATCTCCGCGC AGTCCGCCAA ACAGGTGATC GTGCAGAAGG TGCGCGAGGC CGAACGCGAC CGGCAGTATC AGGAGTTCAA GGATCGCATC GGCGATATCG TCAACGGGAT TGTCAAGCGT GTCGAATACG GCAGCGTGAT CGTCGACCTC GGCCGCGGCG AAGCCGTCAT TCGCCGCGAC GAGATGCTGC CGCGCGAGGT ATTCCGCAAC GGAGACCGCG TCCGGGCCTA TATCTTCGAC GTGCGCCGCG AAACCCGCGG CCCGCAGATC TTCCTGTCGC GCACCCATCC GCAGTTCATG GTGAAGCTTT TTACGCAGGA AGTGCCGGAA ATCTACGACG GCATCGTCGA GATCAAGGCG GTGGCGCGCG ACCCCGGTTC CCGCGCCAAG ATCGGCGTGG TGTCGCGGGA TTCCTCGGTC GATCCGGTCG GCGCCTGCGT CGGCATGCGC GGCTCGCGCG TTCAGGCGGT CGTCAACGAG TTGCAGGGCG AAAAGATCGA CATCATCCCG TGGTCGCCGG ATATCGCGAC CTTCGTCGTC AACGCGCTGG CGCCGGCGGA GGTCGCGAAA GTCGTGATCG ACGAGGACCG CGAGCGGATC GAGGTCGTGG TCCCCGACAC CAACAACCAG TTGTCGCTCG CGATCGGACG GCGCGGACAG AACGTGCGGC TCGCCTCGCA ACTGACGGGC TGGGACATCG ACATCCTGAC CGAGCAGGAG GAATCGGAGC GACGCCAGGC GGACTTCGAG AATGCGACCC GCATGTTCAT GGAAACACTC AACGTCGACG AGGTCGTCGG GCAGTTGCTG GCGTCCGAGG GTTTCACCTC TGTTGAAGAA CTGACGCTGG TCGACACGCG GGAAATCGCC GGCATCGAGG GCTTTGACGA CGAAACCGCG ACCGAATTGC AGAACCGGGC CCGAGAATAT CTGGAGCAGC TCGAGGCCGA ACTGGAAAAC AGGCGCAAGG AGCTCGGCGT GGATGATGCG CTGAAGACCG TGCCCGGCGT GACCTCGAAG ATGCTGGTGA AACTCGGCGA AAACGAAGTC AGGACCATCG AGGATCTGGC CGGCTGCGCC ACCGACGATC TGGTCGGCTG GACCGAACGC AAGGAGGGCG AAGCGGTCAA GCACGCGGGT TATTTCGACG GCATCGAGAT CTCGCGGGAG GACGCGGAGG CCATCATCAT GCAGGCTCGC CTGAGTGTCG GCTGGATCAA CGAGGCCGAC CTCGCGAAAC CGGCGGAGGC TGAGGATATC GCTGCTGAAG ATCAGCCGGC CGAATGA
|
Protein sequence | MAAVSANKLE LLQIADAVAR EKTIDRGIVI AAMEDAIAKA ARARYGSETD VHAEIHPKTG QLQLTRHMLV VEQVENAANQ ISLKDAQRAN PGAQIGDTIA DTLPPLEYGR ISAQSAKQVI VQKVREAERD RQYQEFKDRI GDIVNGIVKR VEYGSVIVDL GRGEAVIRRD EMLPREVFRN GDRVRAYIFD VRRETRGPQI FLSRTHPQFM VKLFTQEVPE IYDGIVEIKA VARDPGSRAK IGVVSRDSSV DPVGACVGMR GSRVQAVVNE LQGEKIDIIP WSPDIATFVV NALAPAEVAK VVIDEDRERI EVVVPDTNNQ LSLAIGRRGQ NVRLASQLTG WDIDILTEQE ESERRQADFE NATRMFMETL NVDEVVGQLL ASEGFTSVEE LTLVDTREIA GIEGFDDETA TELQNRAREY LEQLEAELEN RRKELGVDDA LKTVPGVTSK MLVKLGENEV RTIEDLAGCA TDDLVGWTER KEGEAVKHAG YFDGIEISRE DAEAIIMQAR LSVGWINEAD LAKPAEAEDI AAEDQPAE
|
| |