Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1268 |
Symbol | |
ID | 4710601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1374356 |
End bp | 1375966 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639855741 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001002845 |
Protein GI | 121998058 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.438746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTCCG CAACCACCCG TCCGCCGACC CCGGACCAGA CCCGCGACAC CAGCACCTGC CCGCTCAGCC AGGCTGTCCG AGAGCGCGCC TGCGCTGCGG CCGGCCCGTT CCCGGCCTCG CATCGGGTCT ACCTGCAGGG AAGCCGGGAT GATCTGCGGG TGGCCCTGCG AGAGATCGTC CAGAGCCCAA CCCAGACCCC GAACGGACCG CGCGAGAATC CGCCGCTGCG GGTCTACGAC ACCGGCGGCC CGTACACGGA CCCCGAGGCG GAGCTCGACC CGGCGCGGGG GATCCACCCG ATCCGTGCCC CCTGGATCGC CGACCGCGGC ACCGACGCGT GCACCCAGCT CGACTACGCG CGGGCTGGAG TGACCACCCC GGAGATGGAG TACATCGCCC TGCGTGAGGG CCTGTCGGCC GAGTTCGTGC GCGATGAAGT CGCCCGCGGG CGGGCCGTGA TCCCGGCGAA CCACTGCCAC CCGGAGGCCG AGCCAATGGC CATCGGCCGC CATCTCAAGG TCAAGGTGAA CGCCAATATC GGCAACTCCC CGCTGAGCTC CGGCATCCCG GAGGAGCTGG AGAAGCTCAT CCACGCCGTG CGCTGGGGGG CCGATACGGT CATGGACCTA TCCACCGGCG AGGCCATCCA CGAGACCCGC GCCTGGCTGC TCCGCAACTC GCCGGTGCCG ATCGGCACCG TACCGATCTA CCAGGCGCTC GAGAAGGCCG GCCGCCCCGA AGACCTGACC TGGGAGGTGT TCCGCGACAC CCTCATCGAA CAGGCCGAAC AGGGCGTGGA CTACTTCACC ATCCACGCCG GCGTGCGGCA TGGCCACGTC GATCTGGCCC GCGGGCGGTT GACCGGAATC GTCTCCCGCG GCGGGTCGAT CATGGCCAAG TGGTGCAGCC ACCACCAGGC CGAGAGCTTC CTCTACGAAC GTTTCGATGA GATCTGCGAG ATCCTGGCCC GCTACGACAT CACCGTCTCG CTCGGTGACG GCCTGCGCCC TGGTTCTGTC GCCGATGCCT CCGACGAGGC GCAGCTGGCC GAGCTGCGCA CCCTGGGCGA ACTCACGGAA CGCGCCTGGG CGCGCGGCGT GCAGGTCATT ATCGAGGGGC CGGGTCACAT CCCCATGAAC CAGATCGAGG AGAACATGCG GCTACAGAGC GAGGCCTGCC TCGAGGCGCC GTTCTACACC CTCGGCCCCA TCGTGACCGA CATCGCTCCC GGCTACGACC ACATCACCTC GGCCATCGGC GCCGCCCAGA TCGGCTGGCA CGGCACAGCG ATGCTCTGCT ACGTGACCCC CAAGGAGCAC CTGGGCCTGC CCAATGCCGA CGATGTGCGC ACCGGCATCG TCACGTACAA GGCGGCGGCC CATGCGGCCG ACGTGGCCCG CGGGCACCCC GGGGCACGCG ATCGTGACGA TGCCCTCTCG CGCGCCCGTT ACGAGTTCCG CTGGGAGGAT CAGTTCAACC TCTCCCTGGA TCCGGAACGG GCCCGCGCCT ATCACGACGA GACCCTGCCC AAGGCGTCCC ATAAGGAGGC GGCGTTCTGC TCGATGTGCG GGCCCCGCCA CTGCGCCATG GCGATCAGTC AGGAACTCTG A
|
Protein sequence | MDSATTRPPT PDQTRDTSTC PLSQAVRERA CAAAGPFPAS HRVYLQGSRD DLRVALREIV QSPTQTPNGP RENPPLRVYD TGGPYTDPEA ELDPARGIHP IRAPWIADRG TDACTQLDYA RAGVTTPEME YIALREGLSA EFVRDEVARG RAVIPANHCH PEAEPMAIGR HLKVKVNANI GNSPLSSGIP EELEKLIHAV RWGADTVMDL STGEAIHETR AWLLRNSPVP IGTVPIYQAL EKAGRPEDLT WEVFRDTLIE QAEQGVDYFT IHAGVRHGHV DLARGRLTGI VSRGGSIMAK WCSHHQAESF LYERFDEICE ILARYDITVS LGDGLRPGSV ADASDEAQLA ELRTLGELTE RAWARGVQVI IEGPGHIPMN QIEENMRLQS EACLEAPFYT LGPIVTDIAP GYDHITSAIG AAQIGWHGTA MLCYVTPKEH LGLPNADDVR TGIVTYKAAA HAADVARGHP GARDRDDALS RARYEFRWED QFNLSLDPER ARAYHDETLP KASHKEAAFC SMCGPRHCAM AISQEL
|
| |