Gene Hlac_0400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0400 
Symbol 
ID7401017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp418413 
End bp419930 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content70% 
IMG OID643707464 
Productprotein of unknown function DUF58 
Protein accessionYP_002565073 
Protein GI222478836 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCCCG ATATCGCCGC TCTCCGTCGC TGTCTCGGTG GTTCAGAGCG TCCCGAACGA 
ACCGCAGTCG ACGGGGGATC CCGGTGGGGC TCCGCGGCCA GAACCGACGG CGGGGCCACA
GCCGAACCGG CGTCCGAGAC GAACCGACCG TCAGCAGTTC GGTGGACCGG CCGCTGGCGG
GGGATCGCCG CCGTCACGCT TCTCGCGGTG GCGATCGGCG TCCTCGCGAA ACGGCCTCCG
TTGTTGCTGG TCGGCGCTCT CACCGGCGCG TACGTCGCCT ACCCGCGTCT CACCGCGACT
CCCAACCCCG ATCTGTCCGT CCGACGGGAA ATAGATCCGG CGTCGCCGGC GGACGGGGAG
TGGGTGTCCG TGCGGACGAC GATCACGAAC GAGGGAGACT CAGCGCTCGC CGACGTCCGA
GTCGTCGACG GACCCCCGGC GATGCTCTCG GTCTCGGACG GATCGCCGCG ATGTGCGACG
GCGCTGCTCC CGGGTGGTGA GGTGACGATA CGCTACGAGC TCCGGGCGCG CCCGGGTCGT
CACGCGTTCC AGCCGACGAC CGTGCTCTGT CGCGACGCGA GCGGGTCGGT AGAGGTCGAA
CTCTCACTCA CTGCGGCCAG TTCCTTCGAG TGCGAAGCGG AGATCCCGAC CGTCCCGCTA
CGCGCCCAGA CCGGTCACCA TCCGGGCCCG CTGGTCACTG ACGACGGGGG CGAGGGGATC
GAGTTCCACT CCGTCGAGGA GTACCGGCGC GGGGACCCCG CGAACCGGAT CGACTGGCGA
CGATACGCGC GGACGGGCGA ACTCACCTCG ATGACGTTCC GCACCGAACG GCTCGCAGAG
GTAGTGGTGT GCGTGGATGC ACGACCCGCG TCGTATCGAG CCGCGGACGC GACCGAACCC
CACGCGGTCG CGCTCGCTGT CGACGCCGCC GGCCGAATCG GCGACGCGCT GTTCGACGCG
AATCACCAGG TCGGCCTCGC CGGGTTCGGA CGGCACGTTT GCGTGCTTCC GACCCGGAAC
GGTCCCGATC ACGCCAGCCG GTTCCACCGT CGGCTGGCGA CGGATCCGGC CTTCGGGCTG
GATCCGCCGG AAACGGCACG CACGACCGAT CGTGAGGGCG GAACAGCTCT GCCCGGATCC
GTCGCTGGGG ACGGACCGGA CGGCGGTTGG GTGACGAGAG GTGATGGGGC CGACGGGCCC
GACGGGGCCG TCCCGGTCGA CACCCAGCTG TCCAGAATTC GGGCCGAGAT CGGGGCGAAC
ACACAGGTCG TGTTGATCAC GCCGCTGTGT GACGACGAGG CCTCCCGCAT CGCCCAGCGG
TTCGAGAGCG GTGGGACCGC CGTCAGGGTT GTCAGTCCGG ACGTCACCAC GACGGAGACC
GCGGGGGGAC GGCTCGCCCG GCTGGAGCGA ACGCATCGGC TGAGCATGCT ACGGAACGCC
GGTATTCCCG TCGTCGACTG GACACCCACC CAACCGCTGG GCGCGGCGAT GGCGGCCGTC
GAGGGGTGGG GTCGATGA
 
Protein sequence
MTPDIAALRR CLGGSERPER TAVDGGSRWG SAARTDGGAT AEPASETNRP SAVRWTGRWR 
GIAAVTLLAV AIGVLAKRPP LLLVGALTGA YVAYPRLTAT PNPDLSVRRE IDPASPADGE
WVSVRTTITN EGDSALADVR VVDGPPAMLS VSDGSPRCAT ALLPGGEVTI RYELRARPGR
HAFQPTTVLC RDASGSVEVE LSLTAASSFE CEAEIPTVPL RAQTGHHPGP LVTDDGGEGI
EFHSVEEYRR GDPANRIDWR RYARTGELTS MTFRTERLAE VVVCVDARPA SYRAADATEP
HAVALAVDAA GRIGDALFDA NHQVGLAGFG RHVCVLPTRN GPDHASRFHR RLATDPAFGL
DPPETARTTD REGGTALPGS VAGDGPDGGW VTRGDGADGP DGAVPVDTQL SRIRAEIGAN
TQVVLITPLC DDEASRIAQR FESGGTAVRV VSPDVTTTET AGGRLARLER THRLSMLRNA
GIPVVDWTPT QPLGAAMAAV EGWGR