Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_3075 |
Symbol | |
ID | 4028879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 3425829 |
End bp | 3427322 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637968287 |
Product | NusA antitermination factor |
Protein accession | YP_575118 |
Protein GI | 92115190 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAG AGATTCTGCT GGTCGTAGAT GCGATCTCGA ACGAAAAGGG GGTGCCGCGT GAGGTGATCT TCGAGGCCGT GGAGTCGGCG CTGGCGAGTG CATCGCGTAA GCGCTTCGAT GGCCAGGAAG TGAGCACTCG CGTCAAGATC GATCGCGCGA CCGGTGATTA CGACACCTTC CGTCGCTGGA CTGTCGTCGA GGACGAGGCG TTCGAGAATC CGGATAGCGA AATCCCGCTG TCCGAGGCGG AGCGTCGCGA TCCGCCCCTG GCGCTGGGTG ACGTGGTCGA GGAGCAGGTC GAGTCGGTCG CATTCGGTCG CATCGCCGCC CAGACCGCCA AGCAGGTCAT CGTGCAGAAG GTGCGCGAGG CCGAGCGTGC CGAGGTCGTC CGCCAATACG CCGAGCGTGA AGGCGAACTG GTGGCCGGTA TCGTCAAGAA GACCACGCGC GAAGGATTGA TCGTCGACCT GGGCGAGAAT GCCGAAGCGT TCCTGCCGCG CAGCGAGATG ATCCCCGGCG AGCGCTATCG CATGAACGAA CGCGTGCGTG CCCTGTTGTG GAAGGTCGAC GCCGAGGCGC GCGGGTCGCA GTTGATCCTG ACGCGGACGC GCCCCGACAT GATCGTCGAG CTCTTCAAGA TCGAGGTGCC CGAGATCGCC GAGCAGCTCA TCGAGATCAA GGGCGCGGCG CGCGATCCCG GCGCTCGCGC CAAGATTGCA GTCAAGACCA ATGACAAGCG CATCGACCCG GTCGGTGCGT GTGTCGGCAT GCGCGGGTCG CGCGTGCAAG CGGTGTCCAA CGAGCTGCGC AACGAGCGCG TGGACATCAT CATGTGGGAC GACAACCCGG CGCAACTGGT GATCAATGCC ATGGCGCCCG CGGAAGTCGG CTCCATTCTC GTCGACGAAG ACGCCCATTC CATGGATGTC GCCGTCGCCG AGGACAACCT GGCGCAGGCC ATCGGCCGCA GCGGCCAGAA CGTTCGCCTG GCTTCCGAGC TGACGGGCTG GACGCTCAAC GTGATGACCG AAGAAGAGGC CGAAGGCAAG CGCGAGCAAG AAATCGACAG CCTGATCGAG TACTTCATCA ATCACCTCGA AGTCGACGAG GAACTCGCCC GTCTGCTGGT CGAGGAAGGC TTTACCTCGC TCGAGGAGCT GGCGTATGTG CCGCTCGAAG AGTTGCTCGA GATCGAGGAA TTGGATGAAG CACTGATCGA AACGCTGCGC GCTCGAGCCA AGGACGAATT GCTGACGCTG GCCATCGCTT CCGAAGAGGC GCTGGACGGT GCGCAGCCCG ATGACGACCT CCTTGAAATG GAGGGCATGG ATCGTCACTT GGCATTTACG CTCGCCAGTC GAGGCATCGT CACGCGTGAG GACTTGGCCG AGCAGTCCAT CGATGATCTC AAGGACATCG ACGATGTGGA CGAAGAGCGC GCCGCGGCGC TGATAATGAC CGCTCGTGCG CCTTGGTTCG AGAGCGAACA GTAA
|
Protein sequence | MSKEILLVVD AISNEKGVPR EVIFEAVESA LASASRKRFD GQEVSTRVKI DRATGDYDTF RRWTVVEDEA FENPDSEIPL SEAERRDPPL ALGDVVEEQV ESVAFGRIAA QTAKQVIVQK VREAERAEVV RQYAEREGEL VAGIVKKTTR EGLIVDLGEN AEAFLPRSEM IPGERYRMNE RVRALLWKVD AEARGSQLIL TRTRPDMIVE LFKIEVPEIA EQLIEIKGAA RDPGARAKIA VKTNDKRIDP VGACVGMRGS RVQAVSNELR NERVDIIMWD DNPAQLVINA MAPAEVGSIL VDEDAHSMDV AVAEDNLAQA IGRSGQNVRL ASELTGWTLN VMTEEEAEGK REQEIDSLIE YFINHLEVDE ELARLLVEEG FTSLEELAYV PLEELLEIEE LDEALIETLR ARAKDELLTL AIASEEALDG AQPDDDLLEM EGMDRHLAFT LASRGIVTRE DLAEQSIDDL KDIDDVDEER AAALIMTARA PWFESEQ
|
| |