Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0959 |
Symbol | |
ID | 4026736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1076922 |
End bp | 1078823 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637966136 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_573015 |
Protein GI | 92113087 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCCC CTCAGTTTCT CTCCGACACC GCGCACGTCG ATGCCTCGGC CATTCGTCCG CTGCCCGGTT CATGCAAGCG ATACGTTCAG GGCTCGCGGC CGGACCTGCG CGTCCCCTTC CGTGAAATCG CCCTGTCGCC GACCACGACC TCCGGCGCCG CCGAAGAGAA TCCACCGCTG CTGGTCTACG ATACCTCGGG GCCGTACACC GATCCCGAGT GCGCCATCGA CCTGCGCAAG GGCCTCCCGG CCCTCAGAAA GGACTGGATC GACGAGCGCG ACGACACTCG ATGGCTCGAC GGCCCCACCA GTCGCTACGG GCAGCGCCGC GCCAACGACC CGAGGCTCGC ACCGCTGCGC TTCGACCTGA CCCGCACGCC GCGGCGAGCC AAGGAAGGTC GCAACGTCAC CCAGCTGCAT TACGCGCGCC AGGGCATCAT CACGCCGGAG ATGGAGTTCA TCGCCATCCG CGAGAATCAG CGTCGTCAGG CGCTGGGTCC CGAGGAAGTC GAACGCATCC TCGGCCACCA GCACCCGGGC CAGGGGTTCG GTGCACGCCT TCCGGAGGAG ATCACGCCGG CATTCGTGCG GGATGAAGTC GCCCGCGGCC GGGCGATCAT CCCCTGCAAC ATCAACCACC CGGAGTCCGA GCCGATGATC ATCGGCCGCA ACTTCCTGGT GAAGATCAAC GGCAACCTCG GCAATTCGGC GGTGACCTCG TCCATCGAGG AAGAGGTCGA CAAGATGACC TGGGGCATCC GCTGGGGCGC GGACACGATC ATGGATCTTT CCACCGGCCA GAACATCCAC GAGACGCGGG AATGGATCAT CCGCAACTCG CCGGTGCCCA TCGGCACGGT GCCGATCTAT CAGGCGCTGG AGAAGGTCGG CGGCGTGGCC GAGAACCTGA CCTGGGAGAT CTTCCGCGAC ACCCTGATCG AGCAGGCGGA ACAGGGCGTG GACTACTTCA CCATCCATGC CGGCGTACGC CTGCATCACG TGCCGATGAC GGCCAGGCGC GTCACCGGCA TCGTCTCCCG GGGCGGCTCG ATCATGGCCA AGTGGTGCCT GTACCACCAC CGGGAAAGCT TCCTCTACAC CCACTTCGAG GACATCTGCG AGATCATGAA GGCCTACGAC GTCGCGTTCT CGCTGGGTGA CGGCCTGCGT CCCGGTTCCA TCGCCGATGC CAACGACGAG GCCCAGTTCG CCGAGCTGGA AACCCTGGGC GAGTTGACGC AGCTCGCCTG GAAGCACGAC GTGCAGGTGA TGATCGAGGG CCCCGGCCAT GTCCCGATGC ACCTGATCAA GGAGAACATG GACAAGCAAC TGGCGTGCTG CGAGGAGGCG CCGTTCTACA CGCTCGGACC GCTGACCACC GATATCGCCC CCGGCTACGA TCACATCACT TCGGGCATCG GCGCGGCACA GATCGGCTGG TACGGCTGTG CCATGCTCTG CTACGTCACC CCCAAGGAGC ACCTGGGCCT GCCCAACAAG GACGACGTCA AGACCGGCAT CATCACCTAC AAGATTGCCG CCCATGCTGC CGACCTCGCC AAGGGGCACC CCGGCGCGCA ACGCCGCGAC AATGCGCTGT CCAAGGCGCG CTTCGAATTT CGCTGGGAAG ATCAGTTCAA CCTGGGGCTC GACCCGGATA CGGCACGCGC CTTCCATGAC GAGACACTGC CCAAGGACTC GGCCAAGGTG GCACACTTCT GCTCGATGTG CGGACCGAAG TTCTGCTCGA TGAAGATCAG CCAGGAAGTG CGCGACGCCG TCTCCCGGGA GGGGGACTGG AGCGACCCGC TCGAGAACAC CCAGGAGGCC ATCGAACAAG GCCTGGAGGA ACAGGCCGAA CGCTTCCGCC GCTCGGGAAA AACGCTCTAT CGGAAGGTGT GA
|
Protein sequence | MTSPQFLSDT AHVDASAIRP LPGSCKRYVQ GSRPDLRVPF REIALSPTTT SGAAEENPPL LVYDTSGPYT DPECAIDLRK GLPALRKDWI DERDDTRWLD GPTSRYGQRR ANDPRLAPLR FDLTRTPRRA KEGRNVTQLH YARQGIITPE MEFIAIRENQ RRQALGPEEV ERILGHQHPG QGFGARLPEE ITPAFVRDEV ARGRAIIPCN INHPESEPMI IGRNFLVKIN GNLGNSAVTS SIEEEVDKMT WGIRWGADTI MDLSTGQNIH ETREWIIRNS PVPIGTVPIY QALEKVGGVA ENLTWEIFRD TLIEQAEQGV DYFTIHAGVR LHHVPMTARR VTGIVSRGGS IMAKWCLYHH RESFLYTHFE DICEIMKAYD VAFSLGDGLR PGSIADANDE AQFAELETLG ELTQLAWKHD VQVMIEGPGH VPMHLIKENM DKQLACCEEA PFYTLGPLTT DIAPGYDHIT SGIGAAQIGW YGCAMLCYVT PKEHLGLPNK DDVKTGIITY KIAAHAADLA KGHPGAQRRD NALSKARFEF RWEDQFNLGL DPDTARAFHD ETLPKDSAKV AHFCSMCGPK FCSMKISQEV RDAVSREGDW SDPLENTQEA IEQGLEEQAE RFRRSGKTLY RKV
|
| |