Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0976 |
Symbol | |
ID | 4026199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1093279 |
End bp | 1095123 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637966153 |
Product | sigma 70 (RpoD) |
Protein accession | YP_573032 |
Protein GI | 92113104 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.741163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGAA ATACGCAGCA GTCACGTCTG AAGGAGTTGA TCGCCCAGGG TAAGGAACAG GGCTTCCTGA CCTATGCGGA GGTCAACGAC CACCTTCCCG AGGATATCGC CGATCCCGAT CAGGTGGAAG ACATCATCGC GATGATCAAC GACATGGGTA TCAGCGTCGT CGAGGAAGCG CCTGACGAAG ATACCCTGAT GATGTCCGAT CAATCCGCCG ACGAATCGGC GGCGGAAGAA GCCGTCGCCG CGCTGGCCGC GGTGGAAAGC GATGTCGGCC GCACCACCGA TCCCGTGCGC ATGTACATGC GCGAGATGGG TACGGTGGAA TTGTTGACGC GCGAGGGCGA GATCGAAATC GCCAAGCGTA TCGAGGAAGG CACCCGTGAG GTGATGTCGT CGCTGGCCTA TTTGCCTGGC GCCGTGGACT CCATTCTCAG CGCTTACGAT GCCACCCAGG ATGAAGAAGC GCCGGGCCGG CTGTCCGACC TGTTTTCCGG CTTCATCGAC CCCGACGAAG GGATTCCCGG CGTGGCCGAG GCGGAGGTGC CCGAGCCCGA GCCCGAGCCG AGCGCCGCGG ACGGCGATGT CTCCGAGGAC GATGACGAGG CCGAGGAAGA GGAAACCGGT GGTGGTCCGG ACCCCGAAGA AGCGCGGGCA CGCTTCGAGC AGATCCGTGA GCAGAACGAG CGCGCCAAGG CCGCGATCGA AGAGCACGGG CTCGGCTCGA AGGAAGCCAG CGCCGAGCAG GCCCGTCTGG CCGAGCTGTT CTCGCCGATC AAGCTGGTGC CCAAGCATTT CGAGCGTCTG GTGGGCCAGG TGCGGATCAG CGTCGAGCAG ATCCGCGGCC AGGAAAAGGC CATCATGCAG ATCTTCGTGA AGCAGGCCAA GGTGCCGCGC AAGACCTTCA TCGGGTCGTT TCCGGGCGCC GAGTCCGACC TCGAGTGGCT GGACCGGTTC ATGGCCAAGC ACACCAAGTT CGCCGATCGT CTCGAGCCGT TCCGCGCCGA CATCCAGCGT GCCCAGCGCA AGATCGGCTT CGAGGAAGAG ATGGTCCTGC TGCCGGTCTC GCACATCAAG GAAGTCAATC GCCGCCTGTC GATCGGCGAG GCCAAGGCGC GTCGCGCCAA GAAGGAAATG GTCGAGGCCA ACCTGCGTCT GGTCATCTCG ATCGCCAAGA AGTACACCAA CCGTGGCCTG CAGTTCCTGG ACCTCATCCA GGAAGGCAAC ATCGGCCTGA TGAAGGCGGT GGACAAGTTC GAGTATCGTC GTGGCTACAA GTTCTCGACC TATGCCACCT GGTGGATTCG TCAGGCGATC ACGCGGTCGA TCGCCGACCA GGCGCGCACC ATCCGTATTC CGGTGCACAT GATCGAGACC ATCAACAAGC TCAACCGCGT GTCGCGGCAG ATGCTGCAGG AGATGGGGCG CGAGCCGACA CCGGAAGAAT TGGGCGAGCG TCTCGAGATG CCCGAGGACA AGGTGCGCAA GGTGCTCAAG ATCGCCAAGG AACCGATCTC CATGGAGACT CCGATCGGCG ACGACGACGA CTCGCATCTC GGCGATTTCA TCGAGGATGG CACGATGCTG TTGCCGATCG ACTCCGCGAC CGGCGAAGGC TTGATCGAGG CGACCCGCAA CGTGCTGGGC GGTCTGACCG CGCGCGAGGC CAAGGTGCTG CGCATGCGCT TCGGCATCGA CATGAACACC GACCACACGC TGGAAGAGGT CGGCAAGCAG TTCGATGTCA CCCGCGAGCG TATCCGTCAG ATCGAGGCCA AGGCACTGCG CAAGTTGCGT CACCCCTCAC GCTCCGAGCC GCTGCGCACC TTCCTCGACG AGTAA
|
Protein sequence | MAGNTQQSRL KELIAQGKEQ GFLTYAEVND HLPEDIADPD QVEDIIAMIN DMGISVVEEA PDEDTLMMSD QSADESAAEE AVAALAAVES DVGRTTDPVR MYMREMGTVE LLTREGEIEI AKRIEEGTRE VMSSLAYLPG AVDSILSAYD ATQDEEAPGR LSDLFSGFID PDEGIPGVAE AEVPEPEPEP SAADGDVSED DDEAEEEETG GGPDPEEARA RFEQIREQNE RAKAAIEEHG LGSKEASAEQ ARLAELFSPI KLVPKHFERL VGQVRISVEQ IRGQEKAIMQ IFVKQAKVPR KTFIGSFPGA ESDLEWLDRF MAKHTKFADR LEPFRADIQR AQRKIGFEEE MVLLPVSHIK EVNRRLSIGE AKARRAKKEM VEANLRLVIS IAKKYTNRGL QFLDLIQEGN IGLMKAVDKF EYRRGYKFST YATWWIRQAI TRSIADQART IRIPVHMIET INKLNRVSRQ MLQEMGREPT PEELGERLEM PEDKVRKVLK IAKEPISMET PIGDDDDSHL GDFIEDGTML LPIDSATGEG LIEATRNVLG GLTAREAKVL RMRFGIDMNT DHTLEEVGKQ FDVTRERIRQ IEAKALRKLR HPSRSEPLRT FLDE
|
| |