Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2696 |
Symbol | |
ID | 4028185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 3022411 |
End bp | 3024213 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637967904 |
Product | sulfite reductase (NADPH) alpha subunit |
Protein accession | YP_574742 |
Protein GI | 92114814 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0369] Sulfite reductase, alpha subunit (flavoprotein) |
TIGRFAM ID | [TIGR01931] sulfite reductase [NADPH] flavoprotein, alpha-component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCAAG GACCCCTGTC CGAATCGAAC AGTCCCCTCA GCGCCGATCA GGCACATCGG CTCAACGCGG CGCTGGCCGA CCTCGATGCC CGACAGCGCG CGTGGCTGCA GGGCTACCTG GCCGGCCTGG ACGCGCAGGC GCCGGTGAGC GAGGCGAGTG TCGTGTCGTC GGCCTCGCCC CCGGGCGAAC CGTTGACCGT GCTGTTCGGC AGTCAGACCG GCAATGCCGA GGGCGTGGCC GAGCAGGCCG CGGCTCGGGC ACGCGAGCGG GGCCTGGAGG TCGAATTGAA GGATATGGCG TCCTTCGGCA AGCAGGATCT CAAGCGCGCC ACACGCCTGA TGGCAGTGGT CAGCACGCAA GGCGACGGCG ATCCCCCCGA TGGTGCCCTG GGCTTTTACG AGTTGCTGGC CGGCCGCAAG GCGCCGCGCC TGGGCGAGGA TCATCATTTC GCGGTGCTGG GGCTGGGGGA CGCCAGCTAC GAATACTTTT GCGAATCCGG CAAGGTACTC GACTCGCGGC TCGAATCCCT GGGCGCCCAG CGCTTGTGCG CGCGTGTCGA TGCCGATGTC GACTACGAGG ACACCGCCAG CCAGTGGATC GAGTCCACTC TCGCGGCCTT CGCCGAGCTT GCGGGGGCCT CGGCGCCGGA TGATGGCGGG AGCGCTGCCA GCACCGAGCG TGCTGCAGCG ACCTATTCGC GTTCCCATCC CTTCCAGGCG GAAGTGCTCG AGGTACAGCC GCTCAACACC GAGGACTCCG ACAAGCAGAC CCTGCATGTC GAGCTCTCGC TGGAGGAGTC GGGGCTGGAC TACCTGCCGG GCGATGCCGT GGGCATCGTG CCGCAGAACG ATCCCGCCTA CGTCGATGAG CTGCTGGCGG CCTTGCGGCT GGATGGCGAG GCGCCGCTCG AGGAGGGGCG GCGCTTGCGG GACGCCTTGC TCCGCGATTT CGAGATTACC ACTCTGACGC GCCCCTTCCT CAATCAGTGG GCCGAGATCA GCGACGATGC CGAGCTGCGA CGCCTGCTCG ACGAGGAATC GCGTGACGAG CTGCGTGACT GGCTGCAGGG GCGGCATATC ATCGATGTGC TGGAGCGCTT TCCCGTCGAG GGCGTGGAAG CGGAGAGCTT CATTCGTGCC CTGCGCAAGC TGCCGCCACG CTTGTACTCG ATCGCCTCGA GCCAGGCCGC CGCGCCGGAC GAGGTACACC TCACGGTGGG GGTGGTGCGC TACGAGACCC ACGGCCGTGC CCGCAACGGC GTGGCCACGA CCTATCTGGC CGACCGCGTG AAGCCCGGCG ACCAGGTGCC GATTTACATC GACCACAACA AGCATTTCAA GCTGCCCGAC GACGATTCGG CACCGGTCGT GATGATCGGC CCCGGTACTG GCGTGGCGCC GTTTCGTGCC TTCCTGCAGG AGCGCGAGGC ACGGGACGCG AGCGGCGACA ACTGGTTGTT CTTCGGCGAT CGCCGACGGC GCAGCGACTT CCTGTATCAG GCCGAGTGGC TGCAGTGGCG CAAGACGGGA TTGCTGACAC GGCTCGACGT GGCGTTTTCG CGCGACCAGC AAGACAAGGT CTATGTGCAG GACCGTCTGC GCGAACAGGC CGCCACACTC TATGAATGGT TGCAGGCCGG TGCCTATCTT TACGTCTGCG GCGATGCCGA TCGCATGGCG CCGGACGTGC ATCAGGCCTT GCTCGATGTC ATCCGCGAGC AAGGCGGTCA CGATGAAGAG GCCGCTGCCG AGTACCTGCG CGACCTGCAG CAGCAAAAGC GCTATCAGCG CGACGTCTAC TGA
|
Protein sequence | MSQGPLSESN SPLSADQAHR LNAALADLDA RQRAWLQGYL AGLDAQAPVS EASVVSSASP PGEPLTVLFG SQTGNAEGVA EQAAARARER GLEVELKDMA SFGKQDLKRA TRLMAVVSTQ GDGDPPDGAL GFYELLAGRK APRLGEDHHF AVLGLGDASY EYFCESGKVL DSRLESLGAQ RLCARVDADV DYEDTASQWI ESTLAAFAEL AGASAPDDGG SAASTERAAA TYSRSHPFQA EVLEVQPLNT EDSDKQTLHV ELSLEESGLD YLPGDAVGIV PQNDPAYVDE LLAALRLDGE APLEEGRRLR DALLRDFEIT TLTRPFLNQW AEISDDAELR RLLDEESRDE LRDWLQGRHI IDVLERFPVE GVEAESFIRA LRKLPPRLYS IASSQAAAPD EVHLTVGVVR YETHGRARNG VATTYLADRV KPGDQVPIYI DHNKHFKLPD DDSAPVVMIG PGTGVAPFRA FLQEREARDA SGDNWLFFGD RRRRSDFLYQ AEWLQWRKTG LLTRLDVAFS RDQQDKVYVQ DRLREQAATL YEWLQAGAYL YVCGDADRMA PDVHQALLDV IREQGGHDEE AAAEYLRDLQ QQKRYQRDVY
|
| |