Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1915 |
Symbol | |
ID | 4028357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2174964 |
End bp | 2177825 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637967109 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_573966 |
Protein GI | 92114038 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTGA CTCGAAAATC CTCCCAACCC GCCGGCCAGG GCGGTCTCGG CATCAGCCGC CGCCAATTCC TCCAGCGCAG CGGCGTGGCA ACCGGCGGTC TCGCCGCGGC CGGCTTCATG GGCCATGGCA TGATGCAGGC GGCCAGTGCC AAGGAAAAGA CGGCCTATTC CGATGCCCCC GTCGAAACCA AGCGCACCAT CTGCTCGCAC TGCTCGGTCG GCTGCGGCGT CTACGCCGAG GTGCAGGAAG GCGTCTGGAC CAAGCAGGAA CCGGCGTTCG ATCACCCCAT CAATCGGGGC GCGCATTGCG CCAAGGGCGC CTCGCTGCGC GAGCACGGGC ACTCCACCCA GCGCCTGAAG TATCCCATGA AGCTGGTCGA CGGCAAGTGG CAGCGCCTCG AGTGGGACCA GGCCATCGAG GAGATCGGCG ACAAGGTACT GGAACTGCGC GAACAGCACG GCCCCGACAG CGTCTACTGG CTGGGCTCGG CCAAGTTCAG CAATGAACAA GCCTACTTGA TGCGCAAGCT CGCCTCGCTG TGGGGCACCA ACAATACCGA TCACCAGGCA CGCATCTGCC ACTCCACCAC CGTGGCGGGC GTGGCCAATA CCTGGGGATA CGGGGCGATG ACCAACTCGC TGAACGACAT GCACTTCAGC AAGTCGATCC TGTTCATCGG CTCCAACCCC AGCGAAGCGC ACCCGGTGGC CATGCAGCAC ATCCTGCATG CCAAGGAGCG CAATCAGGCG CAGATCATCG TCGTCGACCC GCGCTTCACG CGTACCGCGG CCAAGGCCAA CAAGTACGTG CGTCTGCGTC CGGGGTCCGA CGTGGCCTAC ATCTGGGGCC TGCTCTGGCA TATCTTCGAA AACGGCTGGG AAGACCAGCA GTTCATCGAT CAGCGCGTGT TCGGCATGGA CGAAGTGCGC CGTGAAGTCG CCCAGTTCAC GCCGGACGTG GTAGAGCGCA TCACCGGCGT CAGCGAAGCC GACATGTACG ATGTCGCCAA GCGCATCAGC GAGAATCGCC CCGGCTGCGT AGTGTGGTGC ATGGGCGGGA CCCAGCACAC CACCGGCAAC AACAACACTC GTGCCTACTG CATTCTCGAG CTGGCGCTGG GCAATATCGG CGTTTCCGGC GGCGGTGCCA ACATCTTCCG TGGTCACGAC AACGTCCAGG GCGCGACCGA CCTGGGGCTG GGCTCCGACT CGCTGCCCGG TTACTACGGG CTCAGCGAAG GCGCCTGGCA GCACTGGGCC AAGGTCTGGA ACGTCGATTA CGAGTGGATC AAGAACCGTT ACGACCAGAC CGAATACAAC GGTGCCCTGC CGATGAACTC CAACGGCATC ACCGTCTCGC GCTGGGTCGA CGGGGTCCTC GAGGAAGACG AGCACATCGC TCAGCGCTCC AGCCTCAAGG CGATGTTCTA CTGGGGGCAC GCGGTCAACT CCCAGACCCG CGGCCCGGAA ATGAAAAAGG CCATGAGTCA GCTCGAACTG ATGGTCGTGG TCGATCCCTA CCCCACCGTC GCTGCCGTCA TGCACGACCG CACGGATGGC GTGTACCTGC TGCCCGCGGC CACGCAGTTC GAGACCACCG GTAGCGTGAC CGCCACCAAT CGCTCCCTGC AATGGCGCGA TCAGGTCATC GAACCCATGT TCGATTCCAA GCCCGACCAC GAGATCATGT ATCTCATGGC CCAGAAGCTG GGCTTCGGCG ACGAGTTCAC CCGCAACTTC GCCATCGAGG ACGGCCGTCC GGTCATCGAG GACGTGTTGC GCGAGATCAA CGCGGGCATG TGGACCGTGG GCTATACCGG GCAAAGCCCC GAGCGCCTCA AGGAACACCA GAAGAACTGG CACACCTTCA ACTTCGATAC CCTGAAGGCG GAAGGCGGCC CCTCGGATGG CGACTTCTAC GGCCTGCCGT GGCCGTGCTG GGGCAAGCCC GGCATGAAAC ACCCGGGCTC GCCCAACCTC TACGACATCA GCAAGAGCGT CGCCGAGGGT GGCATGCCGT TCCGCGCCCG CTTCGGCATC GAGCATGAAG GGCAACCACT GCTCGCCGAT GGCTCCTACT CCAAGGATTC GGAACTCAAC GACGGATATC CCGAGTTCAC CTCCGACATG CTCAAGCAGC TGGGCTGGTG GGACGACCTC ACCGCCGAGG AAAAAGCCGC CGCGGAGGGC AAGAACTGGA AGACCGACCT GTCCGGCGGG ATTCAGCGCG TAGCCATCGC GCACGGTTGC GCGCCCTACG GCAACGCCAA GGCCCGCTGC CGGGTCTGGA CCTTCCCGGA TGAAGTGCCC AAGCATCGCG AACCGCTCTA TACCAGCCGG CGCGATCTCG CCGACGAGTA CCCGACCTGG GACGACAAGG CATCGCTGTT CCGTCTGCCG ACGCTGTATC GCTCGATTCA GGAGAAGGAC TTCAGCAAGG AGTTCCCGAT CATCCTCACC TCGGGACGTC TGGTCGAGTA CGAAGGCGGT GGCGAGGAAA CACGCTCCAT GTCGTGGCTC GCCGAGCTGC AGCAGGAGAT GTTCGTCGAG ATCAACCCGG CGCAGGCCAA CGACCTGGGC ATCAAGAACG ACGACATGGT CTGGGTCCAC GGTCCCGAGG GCGGCAAGGT CAAGGTCAAG GCCATGGTGA CGCCGCGTGT CGCACGTGAC GTGGCCTTCA TGCCCTTCCA TTTCGGCGGT GTCTATCAGG GCGAGAGCCT GGCCGACAAG TACCCCGAAG GCGCGCGCCC GTACGTGCTC GGCGAAGCGG CCAATACCGC CACGACCTAC GGCTACGATC CGGTGACCCT GATGCAGGAA ACCAAGTGCA CCCTGTGCCG GATCGAAAAA GCCTCCGCCT GA
|
Protein sequence | MRLTRKSSQP AGQGGLGISR RQFLQRSGVA TGGLAAAGFM GHGMMQAASA KEKTAYSDAP VETKRTICSH CSVGCGVYAE VQEGVWTKQE PAFDHPINRG AHCAKGASLR EHGHSTQRLK YPMKLVDGKW QRLEWDQAIE EIGDKVLELR EQHGPDSVYW LGSAKFSNEQ AYLMRKLASL WGTNNTDHQA RICHSTTVAG VANTWGYGAM TNSLNDMHFS KSILFIGSNP SEAHPVAMQH ILHAKERNQA QIIVVDPRFT RTAAKANKYV RLRPGSDVAY IWGLLWHIFE NGWEDQQFID QRVFGMDEVR REVAQFTPDV VERITGVSEA DMYDVAKRIS ENRPGCVVWC MGGTQHTTGN NNTRAYCILE LALGNIGVSG GGANIFRGHD NVQGATDLGL GSDSLPGYYG LSEGAWQHWA KVWNVDYEWI KNRYDQTEYN GALPMNSNGI TVSRWVDGVL EEDEHIAQRS SLKAMFYWGH AVNSQTRGPE MKKAMSQLEL MVVVDPYPTV AAVMHDRTDG VYLLPAATQF ETTGSVTATN RSLQWRDQVI EPMFDSKPDH EIMYLMAQKL GFGDEFTRNF AIEDGRPVIE DVLREINAGM WTVGYTGQSP ERLKEHQKNW HTFNFDTLKA EGGPSDGDFY GLPWPCWGKP GMKHPGSPNL YDISKSVAEG GMPFRARFGI EHEGQPLLAD GSYSKDSELN DGYPEFTSDM LKQLGWWDDL TAEEKAAAEG KNWKTDLSGG IQRVAIAHGC APYGNAKARC RVWTFPDEVP KHREPLYTSR RDLADEYPTW DDKASLFRLP TLYRSIQEKD FSKEFPIILT SGRLVEYEGG GEETRSMSWL AELQQEMFVE INPAQANDLG IKNDDMVWVH GPEGGKVKVK AMVTPRVARD VAFMPFHFGG VYQGESLADK YPEGARPYVL GEAANTATTY GYDPVTLMQE TKCTLCRIEK ASA
|
| |