Gene Hlac_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0221 
Symbol 
ID7402150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp238970 
End bp240190 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content71% 
IMG OID643707284 
Productacyl-CoA dehydrogenase domain protein 
Protein accessionYP_002564896 
Protein GI222478659 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.338225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0253718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTACG ACGATATCGG TCGCGGGCCG GAGATCGCCG AGGCCGTCAG GGAGTTCGTC 
GACGAGACGG TGCTTCCGGT CGAGCGGGAG TGGCTCGGCC GAGGACCGAT CCCGCCGGCG
GACATAGAGG CGCTGCGCGA CGCCGCGCGC GACGAGGGGA TCTACGCCCC GCAGGTCGCC
GAGGAGTACG GCGGGCTCGG ACTCGGGTTC CGCGAGATGC TGCCGGTGTT CGAAGAGGCG
GGCCGGAGCC TGCTCGGGCC GACGGCGCTC CGGTGTGCGG CGCCCGACGA GGGGAACATG
CACACCCTCG AGATCGCGGC CACCGACGCA CAGAAGGAGC GCTGGCTCCG CCCGTTGGCG
GCCGCGGAGA TCGACTCGGG GTTCGCGATG ACCGAGCCGA TGCAGGGCGG GGGGTCGGAC
CCGAAAATGC TGGCGACGAC CGCCGAGAAA GACGGCGACG AGTGGGTCAT CGACGGCCAC
AAGTGGTGGA CGACCGGCGG CGTCGAGGCG AACCTCCTCC TCGTGTTCGC CCGCACCGAT
CAGGAGGCGC ATCCGTACGC CGGTTGTTCG GTTATCCTCG TGCCCGCCGA CGCCGACGGC
GTCGAGGTCG TTCGGAACAT CCCGCACCTC GGCGAGGGGC TGGTCGGGAC GACGCACGCC
GAGATCCGGT TCGACGACGT GCGCGTGCCG GTCGAGAACA CGCTCGGCGA GGAGAACGAG
GGGTTCACCC TCGTCCAACA GCGGCTGGGT CCGGCCCGGC TCACCCACTG CATGCGGTAC
GCCGGGATGG CCGATCGCGC GCTCGACATC GCGACCGCCT ACCTCTCCGA GCGGGAGGGG
TTCGGCGAAC CGCTCTCGGA GAAGCAGGGG CCGCGGTTCC GGATCGCCGA CCGCCGCACC
GAGCTCCACG CCGCGCGCAC GATGGTCCGG CACGCCGCCG GGCGGATCGC CGACGGTCAC
GAGGCGCGCA TCGAGGTCGC GATGGCAAAG ACGTTCGCGG CGAACGTGAC GCAGGAGGCG
ATCGACGACG CGCTCCAGTT CTGCGGCGGC AACGGGATCG CGTACGACCT GCCGATCGCG
CGCTTCCACG AGAACGTCCG GCAGTTCCGC CTCGTCGACG GCGCCGACGA GGTCCACCGC
CGGTCGATCG CGCGGGACGC CTTCGAGGAC CCGCCGGCCG AGGAGCTTGA GACCGTCACG
CGGTTCGGCG AGTTCGACTA A
 
Protein sequence
MEYDDIGRGP EIAEAVREFV DETVLPVERE WLGRGPIPPA DIEALRDAAR DEGIYAPQVA 
EEYGGLGLGF REMLPVFEEA GRSLLGPTAL RCAAPDEGNM HTLEIAATDA QKERWLRPLA
AAEIDSGFAM TEPMQGGGSD PKMLATTAEK DGDEWVIDGH KWWTTGGVEA NLLLVFARTD
QEAHPYAGCS VILVPADADG VEVVRNIPHL GEGLVGTTHA EIRFDDVRVP VENTLGEENE
GFTLVQQRLG PARLTHCMRY AGMADRALDI ATAYLSEREG FGEPLSEKQG PRFRIADRRT
ELHAARTMVR HAAGRIADGH EARIEVAMAK TFAANVTQEA IDDALQFCGG NGIAYDLPIA
RFHENVRQFR LVDGADEVHR RSIARDAFED PPAEELETVT RFGEFD