Gene Hhal_1892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1892 
Symbol 
ID4710690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2079079 
End bp2080467 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content64% 
IMG OID639856365 
ProductThiS, thiamine-biosynthesis 
Protein accessionYP_001003458 
Protein GI121998671 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATGC AAATAGAGCG GGGGGCCCGA AGTCGGTCCT CTGCGGCCAG TTGCGCAGAT 
CGCCCCGGGC GGGGTACTCC GCGCCGGCTC CAACAGTGCT CGCTCGGTGT CGTTACAGCG
CTAACCATGG CGGTGTCGTC CAGTGTTTCC GCTGCGCCCG GATTGGTCAC GACGCCCGGC
ATGGGCCTCG GCGACGGCTC CCACCCAGCG ACGCTGCACA CGGTTGCGGG TAACCCCGCA
GGGGCGGCAG CAGCGGATCG GGTCGGTGTC CAGTTTGGGC TTGGCAGCGT AGGCGTGGGC
TATGAACTCG GGCCGGTTGA TGGGCTGGTC GATGAAATCG ACGACATCAT CGACATCCTT
GACCGGGATG ATCTGGGCCC GAGCGAGGCG GATGACCTGA TTGAACGTAC CGAGGGCGTC
CTCGCTCAGC TGGGGCAGGA CGGTTGGGGG AAGCTCCAGT TTGGTGGCCG TCCGCCGCTT
GCCCCGCTGG TTGTTGGCAG TGCCCGCTCC GGGTGGTCGG TGGCTCTGGA CGCTGAAGCC
ACGGGCCACC TCGGGTTCAG CATACTCGAT GACGAGTTGC GGTTTGTCCG CGAGACCGAG
CAAATCCAGA GCAATACCGC GGCATACCTG AAAGGCGGCG GTATCCTCCG GCTCTCTGCA
GCCCCGAGCC TGCGCGTTGC AGAGTGGGAG GGAGGGCGCG AGCTGTTCGT CGGTGCTCGG
GTCAGCCATT ACCAGGCGGA ACTCTCCAAG GCGGTGGTCG CCCTTGCCGA GGATACGAAC
CGGGATTTCG GGGATATCGT CGAGGATGAG ATCGACCGCC AGCAGGAGAC CTCCAGCGCG
GTTGGCCTCG ACCTGGGTGT TATGTACCAG ACCCGTTTCT TCCGTGCCGG CGGCGCCTGG
AAGAACATCA ATGAACCCAC CTTCGACTTT CCGGCAGTTG GTACCGATTG CTCCGGGGAA
GCGGACCCGG ATCTCCAGGC CAACTGCCTC ACGGCGGCCA ATTTCTCAGA CCGGATTACG
CGCGAGGAGG CGTTTCGTCT CAACGAGCAG GTGACTCTGG AAGGCGCTCT CCACGATCCC
GCCCAGCGGC TCGTCCTGGC CGCCAGTTAC GATGCGAATA CCGTTCGGGA TATCAGCGGC
GATGAGTACC AGTGGCTGGC GTTCAGCCTC TCCTACCGCA TGCCGTGGTA TCTGAAGTGG
GTTCCCGATC TCCGCGTGGG CTACCGGGAG AATATGAGCG GTTCGGAGCT GAGCTACACC
ACTGCCGGCC TGACCTGGCT GGGTGCGGTG ACCCTGGACG TGGCCGTCGC GGATCAGGAC
CTGGAACACG ACGGCGAATC GATCCCGCGT AGCGCGATGG CACACCTGGG CTTTCAATTG
CGCTTCTGA
 
Protein sequence
MSMQIERGAR SRSSAASCAD RPGRGTPRRL QQCSLGVVTA LTMAVSSSVS AAPGLVTTPG 
MGLGDGSHPA TLHTVAGNPA GAAAADRVGV QFGLGSVGVG YELGPVDGLV DEIDDIIDIL
DRDDLGPSEA DDLIERTEGV LAQLGQDGWG KLQFGGRPPL APLVVGSARS GWSVALDAEA
TGHLGFSILD DELRFVRETE QIQSNTAAYL KGGGILRLSA APSLRVAEWE GGRELFVGAR
VSHYQAELSK AVVALAEDTN RDFGDIVEDE IDRQQETSSA VGLDLGVMYQ TRFFRAGGAW
KNINEPTFDF PAVGTDCSGE ADPDLQANCL TAANFSDRIT REEAFRLNEQ VTLEGALHDP
AQRLVLAASY DANTVRDISG DEYQWLAFSL SYRMPWYLKW VPDLRVGYRE NMSGSELSYT
TAGLTWLGAV TLDVAVADQD LEHDGESIPR SAMAHLGFQL RF