Gene Hlac_0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0407 
Symbol 
ID7401024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp423835 
End bp425229 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content65% 
IMG OID643707471 
Productprotein of unknown function DUF21 
Protein accessionYP_002565080 
Protein GI222478843 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.426667 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTGT CGTCTAGCGT GCCGGTGGCC GACCTCCTCC AGATACCGAT GCCCGGCGAC 
GGCGTCGTGC TCGCTCTCGG CGTCGCCGCG ATCCTCTTTT TGATCGGGCT GTCGGCGTTC
TTCTCCTCGT CGGAGATCGC GATGTTCTCG CTACCGCAAC ACCGCGTCGA CAGCCTCGTC
GACGAGGGCG TAAAAGGGGC AGAGACGATC CGCGGCATGA AACAGAACCC CCATCGCCTG
TTGGTGACGA TCCTCGTCGG CAACAACATC GTCAACGTGG CGATGACCTC CATCGCGACC
GCGCTGTTCG GGATCTACCT CTCGCGGGGG GAGTCGGTGC TGGCGACGAC GTTCGGCATC
ACGACGCTCG TGTTGATCTT CGGCGAGAGC GCGCCGAAGT CGTACGCCGT CGAGAACACC
GAGTCGTGGG CGCTCCGGAT CGCCCGCCCG CTGAAGCTCT CCGAGTACGC GTTGTACCCG
CTCGTCGTCC TCTTCGATTA CATCGTCAAG GGTATCAACA AGATCATCGG TGGCTCGGCC
GCCATCGAGT CGACGTACGT CACCCGTGAC GAGATCCAAG ACATCATCGA GACGGGCGAA
CGCGAGGGCG TCATCGAGGA GGAGGAACGC GAGATGCTCG ACCGCATCTT CCGATTCAAC
AACACCATCG CCAAGGAGGT GATGACGCCC CGTCTCGACG TCACCGCGGT GGCGAAGGAG
TCCTCGGTCG AGGAGGCGAT CGAGACGTGC ATCCAAGCGG ACCACGAGCG CGTCCCCGTC
TACGAGGGGA ACCTCGACAA CATCATCGGC GTGGTGACCG TCCGGGATCT CGTCCGCGAA
CTGCGCTACT CCGAGGGTGA GCCGTCGCTG GAGCGCGTCG TGAAGCCGAC GCTGCACGTC
CCCGAGTCGA AGAACGCGGA CGAGCTGCTC GCGGAGATGC AGGACAACCG CCTCCAGATG
GTCACCGTCA TCGACGAGTT CGGGACCACG GAGGGGATCA TCACCTTAGA GGACATGGTC
GAGGAGATCG TCGGCGAGAT CTTGGAGGGC GACGAGGAGG CTCCGGTGGA GTTCTTAGAA
GACAACGTCG CCGTCGTGCA GGGCGAGGTA AACATCGACG AGGTCAACGA GATGCTCGGG
ATCGACCTCC CCGAGGGCGA GGAGTTCGAG ACGCTCGCCG GCTTCGTGTT CAACCGCGCC
GGGCGCCTCG TCGAGGAGGG CGAGGAGATC GAGTTCGACG AGATCCGGAT CCGGATCGAG
CGCGTGGACA ACACCCGGAT CATGTCCGCG CGGGTCACCG TGCTCGACGG CGCGGAGGCG
GCCGACGTGG TCGCCGAGGA CGACGCGCTC GAGTCGAGCG GCGAGCCCGA GGCGCCTCCG
AACGACGCGG AGTGA
 
Protein sequence
MGLSSSVPVA DLLQIPMPGD GVVLALGVAA ILFLIGLSAF FSSSEIAMFS LPQHRVDSLV 
DEGVKGAETI RGMKQNPHRL LVTILVGNNI VNVAMTSIAT ALFGIYLSRG ESVLATTFGI
TTLVLIFGES APKSYAVENT ESWALRIARP LKLSEYALYP LVVLFDYIVK GINKIIGGSA
AIESTYVTRD EIQDIIETGE REGVIEEEER EMLDRIFRFN NTIAKEVMTP RLDVTAVAKE
SSVEEAIETC IQADHERVPV YEGNLDNIIG VVTVRDLVRE LRYSEGEPSL ERVVKPTLHV
PESKNADELL AEMQDNRLQM VTVIDEFGTT EGIITLEDMV EEIVGEILEG DEEAPVEFLE
DNVAVVQGEV NIDEVNEMLG IDLPEGEEFE TLAGFVFNRA GRLVEEGEEI EFDEIRIRIE
RVDNTRIMSA RVTVLDGAEA ADVVAEDDAL ESSGEPEAPP NDAE