Gene Hlac_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1100 
Symbol 
ID7400172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1105107 
End bp1106366 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content68% 
IMG OID643708166 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_002565765 
Protein GI222479528 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGCG ACGCGGCCAC CGACGGCGGG ACCGCTTCCG ACGACGCGTC GGCCGACGAT 
GACTCCCCCG CCGGCCCCCT CGACGGCGTG ACCGTGATCG AGGCCGGGTC GATGATCTCG
ATCGGGACGG TCGGACGCCT ACTCGCGGAC TTCGGCGCGG ACGTGATCAA GGTCGAACAC
CCGGAGACCG GCGACCACCT GCGCCACTTC GGTCCCCAGA AAGAAGGCGT CGGGCTCTGG
TGGAAGTACC TCGGTCGCAA CAAGCAGTCG GTGACCCTCG ACATCTCCAC CGAGGAGGGG
AAAGTCGTCT TCGAGGACCT CGTCGCCGAG GCCGACGCGC TCATCGAGAA CTTCCGCCCG
GGCACCTTAG AGCGGTGGGG ACTCGGCTAT GATCACCTCT CCGATCTCAA CTCCGGGCTC
GTGATGCTCC GGCTGAGCGG GTTCGGGCAG ACCGGCCCGT ACAGCGACCG CCCCGGATTC
GGCACACTCG CCGAGGCGAT GTCGGGGTTC GCGTACCTCA ACGGCTACCC CGATCAGGAG
CCGCTGTTGC CGCCGACCGG GCTGGCAGAC GGGATCGCAG CGATGTTCTC CACGATGGCG
GTCGCGTTCG CGCTGTACAA CCGCGACGCG AACGGCGGGA CCGGCCAGTA CATCGACACG
AGCCTCATCG AGCCGATCTT CTCGCTCATC GGTCCCCAGC CGCTCCGCTA CCAGCAGCTC
GACGAGATCG AAACGCGGTC GGGGAACCGC TCGACGTCGT CCGCGCCGCG GAACGTGTAC
CAGACGGGCG ACGGACGGGC GGTCGCCATC TCGGCGAGCG CGCAGCCGAT CGCGATGCGG
GTGTTCGACG CGATCGAGCG GCCAGATTTA AAAGACGATC CCCGCTTCGC GGACAACGAA
AAGCGGCTGG AGAACGTCGA GGCGCTCGAC GCGGCCATCC AAGACTGGAT GGACGACCAC
ACCCGCGAGG CGGTCATCGA CCGTTTCGAG GAGTACGAGG CGACGATCGC CCCGATCTAC
AACGTCGCCG ATATCCTCGC AGACGAGCAC TACCAGGCCC GCGACGCGGT CGTGGAGGTC
CCGGACGACC AGCTCGGCGC CGGTGCGGTC CAGAACACCG TGCCCCGCTT CTCGGAGACG
CCGGGGGAGA TCACCCACCT CGGTCCGCAG CTCGGCGCGC ACAACGAGGC GGTGTACGGC
GAGCGCCTGT CGTACGACGA CGAGACGCTT GCGGAGCTCG ACTCGGAGGG CGTGATATGA
 
Protein sequence
MSRDAATDGG TASDDASADD DSPAGPLDGV TVIEAGSMIS IGTVGRLLAD FGADVIKVEH 
PETGDHLRHF GPQKEGVGLW WKYLGRNKQS VTLDISTEEG KVVFEDLVAE ADALIENFRP
GTLERWGLGY DHLSDLNSGL VMLRLSGFGQ TGPYSDRPGF GTLAEAMSGF AYLNGYPDQE
PLLPPTGLAD GIAAMFSTMA VAFALYNRDA NGGTGQYIDT SLIEPIFSLI GPQPLRYQQL
DEIETRSGNR STSSAPRNVY QTGDGRAVAI SASAQPIAMR VFDAIERPDL KDDPRFADNE
KRLENVEALD AAIQDWMDDH TREAVIDRFE EYEATIAPIY NVADILADEH YQARDAVVEV
PDDQLGAGAV QNTVPRFSET PGEITHLGPQ LGAHNEAVYG ERLSYDDETL AELDSEGVI