Gene Hlac_0983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0983 
Symbol 
ID7401877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp975003 
End bp976883 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content61% 
IMG OID643708048 
Productprotein of unknown function DUF839 
Protein accessionYP_002565650 
Protein GI222479413 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAACA ACTATACCCG GCGCACAGTC GTATCGAGTC TAGCCGCGTT AGCGGCGGCG 
AGCCAGACAG CGGGCAGTGT GGCTGGCAAA GAGGCAGAAG GCCAAGGCCA CGGCATCGAA
CAGGAAGGAG CGACGCTCAA CCGCTTTGCG ACGACCATCA TCGGTGCTGA GATCACTGGC
ATGTTCATCA CCGAGGACGG GCGGTTCTTC TTCAACGTCC AGCACCCGGA CGCGAACCTC
GACGGGGAGG ACGAACCGGG AATTCTCGGC GCAGTCACGG GAGTAGACAT GAACCAGCTC
CCCAGGGATT TCCAGAGCGT CCAGATTCCC GAGGGAGACG ACGACGATTA CAGCGACGAC
GGCGACGGAG TACCCGAGCC GTACGACCAG CGGGTGCGGA CGGCACTGGG TGACTATCAG
CGGCTTGCGA CCGGCGGCGA CGAGACCGAC GACGGCGAGG AACTGGGATC GGTCTACACA
CCTGAGGGCG ACTCGCTCAC CGGACAGATC AACCCCGATT TCAACGGCTA CGTCCCATCG
AGCGAGGAAC CCGACGAAGG CTACCTGTTC ACCAACTGGG AACACCGTCC GGGAGCGATG
ACGCGAGTTC ACCTGCAGCA GAACGGCCGT AACGGCACGT GGCGGGTTCT CGGCATGGAG
AATCTCGACT TTTCCGCCGT GGAAGGAACC TGGGTCAACT GCTTCGGGAC CGTCTCTCCG
TGGGGCACCC CGCTGACCTC CGAGGAGAAC TACTCCATTC CGGATACGCC GGTGTGGAAC
AACCCTGACT GGCAATACAA AGGCGGTGTC GAGCGGCTTG CACGGCACCT CGGCCACGAA
CGAAACGATG ACGGCATCTT TGCCGATAAG TTCCCGAACC CGTACCGCTA CGGGTACATC
GTCGAACTGA AAGAGCCGGA AAGCGAGGAG CCGATACCCG AGAAGCGGTT CGCACTCGGT
CGCTCGACGC ACGAGAACGC GGTCGTCATG CCGGACGAGA AGACCGCCTA CACCACCTCC
GACGGGACCG CCCGTGGCTT CTACAAATTT GTCGCCGACG AGAAGGGTGA CCTTTCCTCA
GGAACGCTAT ACGCTGCGAA GGCCACTCAA AAGGGACCGC TCGGCGGCGA TCCCGACAAG
GTCAGCTTCG GCATCGAGTG GATCGAACTC GGGCACGCCA GCGACGAGGA AATCGAGAAG
TGGATTGCCG AGTACGACGA CATCACCCAG GCGGACTACG AGGACGGTGA GAACTCGTAT
ATCTCCGAAG GGGAGATGGA CGAGTGGGCC GCAGGGGACG CAGACGACGA CCGCGTCGCC
TTCCTCCAGT GTCGACAGGC CGCAATGCGG AAAGGCGCAA CGACGGAATT CCGCAAGATG
GAGGGGATCA ACATCCGGCG CGGTGCCGAA GCGGGCGAGG ACTACATGTA CGTCGCCATG
TCGAACACCA ACCGAACGAT GGGCGACGAC GAGGGCGACA TCCAGCTTAA CGGCGACGAA
TGGGGTGCCG TCTACCGAAT GCCACTGGAG AGCGATTACG ACATCAGCGA GATGGAGCCG
ATCGTCACCG GTGGGCCGGA GGCCAATATC TGTGGTGGCT GTCCCTACGA CGCGAATCCG
AACGCTAACG ACAAGGCGTG CCAATCGTGC GCGTTCAACC CGACAAAGGA CGACGAAGAC
CAAGGTCGTT TAAAGGGCAC GATGAATCTG GCAAAATCGA TGGCCATGAG TGGGCAAACC
TCACTCGACG TGGAGAACAC GATTGCCGAA CCTGACAACA TCGTTGTCAT GGACGACGGA
CGGGTCGTCA TTGGCGAGGA TACGGGTAAT CGTGGTCACG AGAACAACAT GATTTGGGTG
TTCGATCCAG GTTCTGCTTG A
 
Protein sequence
MPNNYTRRTV VSSLAALAAA SQTAGSVAGK EAEGQGHGIE QEGATLNRFA TTIIGAEITG 
MFITEDGRFF FNVQHPDANL DGEDEPGILG AVTGVDMNQL PRDFQSVQIP EGDDDDYSDD
GDGVPEPYDQ RVRTALGDYQ RLATGGDETD DGEELGSVYT PEGDSLTGQI NPDFNGYVPS
SEEPDEGYLF TNWEHRPGAM TRVHLQQNGR NGTWRVLGME NLDFSAVEGT WVNCFGTVSP
WGTPLTSEEN YSIPDTPVWN NPDWQYKGGV ERLARHLGHE RNDDGIFADK FPNPYRYGYI
VELKEPESEE PIPEKRFALG RSTHENAVVM PDEKTAYTTS DGTARGFYKF VADEKGDLSS
GTLYAAKATQ KGPLGGDPDK VSFGIEWIEL GHASDEEIEK WIAEYDDITQ ADYEDGENSY
ISEGEMDEWA AGDADDDRVA FLQCRQAAMR KGATTEFRKM EGINIRRGAE AGEDYMYVAM
SNTNRTMGDD EGDIQLNGDE WGAVYRMPLE SDYDISEMEP IVTGGPEANI CGGCPYDANP
NANDKACQSC AFNPTKDDED QGRLKGTMNL AKSMAMSGQT SLDVENTIAE PDNIVVMDDG
RVVIGEDTGN RGHENNMIWV FDPGSA