Gene Hhal_1268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1268 
Symbol 
ID4710601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1374356 
End bp1375966 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content69% 
IMG OID639855741 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001002845 
Protein GI121998058 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.438746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCCG CAACCACCCG TCCGCCGACC CCGGACCAGA CCCGCGACAC CAGCACCTGC 
CCGCTCAGCC AGGCTGTCCG AGAGCGCGCC TGCGCTGCGG CCGGCCCGTT CCCGGCCTCG
CATCGGGTCT ACCTGCAGGG AAGCCGGGAT GATCTGCGGG TGGCCCTGCG AGAGATCGTC
CAGAGCCCAA CCCAGACCCC GAACGGACCG CGCGAGAATC CGCCGCTGCG GGTCTACGAC
ACCGGCGGCC CGTACACGGA CCCCGAGGCG GAGCTCGACC CGGCGCGGGG GATCCACCCG
ATCCGTGCCC CCTGGATCGC CGACCGCGGC ACCGACGCGT GCACCCAGCT CGACTACGCG
CGGGCTGGAG TGACCACCCC GGAGATGGAG TACATCGCCC TGCGTGAGGG CCTGTCGGCC
GAGTTCGTGC GCGATGAAGT CGCCCGCGGG CGGGCCGTGA TCCCGGCGAA CCACTGCCAC
CCGGAGGCCG AGCCAATGGC CATCGGCCGC CATCTCAAGG TCAAGGTGAA CGCCAATATC
GGCAACTCCC CGCTGAGCTC CGGCATCCCG GAGGAGCTGG AGAAGCTCAT CCACGCCGTG
CGCTGGGGGG CCGATACGGT CATGGACCTA TCCACCGGCG AGGCCATCCA CGAGACCCGC
GCCTGGCTGC TCCGCAACTC GCCGGTGCCG ATCGGCACCG TACCGATCTA CCAGGCGCTC
GAGAAGGCCG GCCGCCCCGA AGACCTGACC TGGGAGGTGT TCCGCGACAC CCTCATCGAA
CAGGCCGAAC AGGGCGTGGA CTACTTCACC ATCCACGCCG GCGTGCGGCA TGGCCACGTC
GATCTGGCCC GCGGGCGGTT GACCGGAATC GTCTCCCGCG GCGGGTCGAT CATGGCCAAG
TGGTGCAGCC ACCACCAGGC CGAGAGCTTC CTCTACGAAC GTTTCGATGA GATCTGCGAG
ATCCTGGCCC GCTACGACAT CACCGTCTCG CTCGGTGACG GCCTGCGCCC TGGTTCTGTC
GCCGATGCCT CCGACGAGGC GCAGCTGGCC GAGCTGCGCA CCCTGGGCGA ACTCACGGAA
CGCGCCTGGG CGCGCGGCGT GCAGGTCATT ATCGAGGGGC CGGGTCACAT CCCCATGAAC
CAGATCGAGG AGAACATGCG GCTACAGAGC GAGGCCTGCC TCGAGGCGCC GTTCTACACC
CTCGGCCCCA TCGTGACCGA CATCGCTCCC GGCTACGACC ACATCACCTC GGCCATCGGC
GCCGCCCAGA TCGGCTGGCA CGGCACAGCG ATGCTCTGCT ACGTGACCCC CAAGGAGCAC
CTGGGCCTGC CCAATGCCGA CGATGTGCGC ACCGGCATCG TCACGTACAA GGCGGCGGCC
CATGCGGCCG ACGTGGCCCG CGGGCACCCC GGGGCACGCG ATCGTGACGA TGCCCTCTCG
CGCGCCCGTT ACGAGTTCCG CTGGGAGGAT CAGTTCAACC TCTCCCTGGA TCCGGAACGG
GCCCGCGCCT ATCACGACGA GACCCTGCCC AAGGCGTCCC ATAAGGAGGC GGCGTTCTGC
TCGATGTGCG GGCCCCGCCA CTGCGCCATG GCGATCAGTC AGGAACTCTG A
 
Protein sequence
MDSATTRPPT PDQTRDTSTC PLSQAVRERA CAAAGPFPAS HRVYLQGSRD DLRVALREIV 
QSPTQTPNGP RENPPLRVYD TGGPYTDPEA ELDPARGIHP IRAPWIADRG TDACTQLDYA
RAGVTTPEME YIALREGLSA EFVRDEVARG RAVIPANHCH PEAEPMAIGR HLKVKVNANI
GNSPLSSGIP EELEKLIHAV RWGADTVMDL STGEAIHETR AWLLRNSPVP IGTVPIYQAL
EKAGRPEDLT WEVFRDTLIE QAEQGVDYFT IHAGVRHGHV DLARGRLTGI VSRGGSIMAK
WCSHHQAESF LYERFDEICE ILARYDITVS LGDGLRPGSV ADASDEAQLA ELRTLGELTE
RAWARGVQVI IEGPGHIPMN QIEENMRLQS EACLEAPFYT LGPIVTDIAP GYDHITSAIG
AAQIGWHGTA MLCYVTPKEH LGLPNADDVR TGIVTYKAAA HAADVARGHP GARDRDDALS
RARYEFRWED QFNLSLDPER ARAYHDETLP KASHKEAAFC SMCGPRHCAM AISQEL