Gene Hhal_1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1497 
Symbol 
ID4709150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1617924 
End bp1619225 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content64% 
IMG OID639855964 
Producthypothetical protein 
Protein accessionYP_001003066 
Protein GI121998279 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATC GCAAGCATAT CTACGTCGTC GGACTCGACG ATTTCCACCT TGAACACCTC 
AAGACCGTGC GCGGCAGCGA GGGGTACCTA TTCCATCCCC TGGCCGAATA CAGCGCCATC
GTGCTCCCTG AGCGCTACGA TATCCCGGCC ATCCTCGATC ATGCGCGTCA AACGCTCGAT
CTGGCCCCGC GCGTGGACGC GATCATCGGC CACTGGGACT TTCCCACGAC TTCAATCGTC
CCCGTCCTAC GCCGGGAGTA CGGCCTTCCC ACGCCGTCCC TGGAGAGCAT CCTGCTTTGC
GAGAACAAGT ATTGGAACCG CCTGGCCTGT GAAGCGTCCG TGCCGGAGTG CACCCCTCCG
TTCCAGGCCA TCGATCCCTA CGGCGAGGAT CCGCTCGGCG CCCTCAAGCT CGGCTACCCG
TTCTGGCTGA AACCCGCGGT GGCCTTCTCG TCCCTGCTCG GTTTCCGCGT CGAGGACGAC
GCGCAGTTCC AGGATGCCAT CGCTGCCATC GCCCAGGGCA TCCCCACCTT CGCCGAGCCG
TTCCAGGCGT TCACCGACCT TGTTGAAAAC CCGAAACGCC TGCCGCGGAC GGGGAGCGGC
GCCACGGCCC TGGCCGAGGG CATCATCCAG GGCCGCCTCT GCACGCTCGA AGGCTACGTC
TACAACGGCG AGGTGGTGAC CTACGCCATC CTCGACTCCC TGCGCGGGGC CAACCAGGTC
AGCTTCGTCA GCTACCAGTA CCCGTCCAGC CTGCCGATAC CAGTCCAGGA GCGGATGAAG
GACTACGCGC GCCGCCTGCT GACGCACATC GGCCTCGATC AGACCGCGTT CAACATGGAG
TTCTTCTGGG ACGAGGATGT CGACAAGATC TGGTTGCTTG AGGTCAACCC ACGGATCTCC
AAGTCGCACT GCCCGATCTT CGAGATCGCC ACGGGCAGCT CCCACCACGA GGTCGCCATC
GACGTAAGCC TGGGCCGTCG ACCCCAATTT CCTCGCGCCG AGGGGCGCTT CCCCATGGCG
GCGAAATTCA TGCCGCGGGT GTACGGCGAC GCTCGGGTCC TGCGCATACC CGACCCGGCG
CAGATCCACG CTCTGCAGCT CACCCACCCG GAACTCTCCA TCCACATTGC AGTGGCCGAG
GGTATGCAGC TCTCCGAACT ACGGGGCCAG GACAGCTACA GCTACGAGAT CGCCGAGCTG
TTTATCGGCG GCGAGGACGA ACAGCACCTC CACGACAAGT TCCGGACGAT CATGAGGCAG
CTCGACTTCC GCTTCTCCGC GCCGCTGCCA ACCAACTACT GA
 
Protein sequence
MDDRKHIYVV GLDDFHLEHL KTVRGSEGYL FHPLAEYSAI VLPERYDIPA ILDHARQTLD 
LAPRVDAIIG HWDFPTTSIV PVLRREYGLP TPSLESILLC ENKYWNRLAC EASVPECTPP
FQAIDPYGED PLGALKLGYP FWLKPAVAFS SLLGFRVEDD AQFQDAIAAI AQGIPTFAEP
FQAFTDLVEN PKRLPRTGSG ATALAEGIIQ GRLCTLEGYV YNGEVVTYAI LDSLRGANQV
SFVSYQYPSS LPIPVQERMK DYARRLLTHI GLDQTAFNME FFWDEDVDKI WLLEVNPRIS
KSHCPIFEIA TGSSHHEVAI DVSLGRRPQF PRAEGRFPMA AKFMPRVYGD ARVLRIPDPA
QIHALQLTHP ELSIHIAVAE GMQLSELRGQ DSYSYEIAEL FIGGEDEQHL HDKFRTIMRQ
LDFRFSAPLP TNY