Gene Hlac_1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1865 
Symbol 
ID7400058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1869740 
End bp1871707 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content67% 
IMG OID643708935 
Producthypothetical protein 
Protein accessionYP_002566513 
Protein GI222480276 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0131162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGGG CGGTGTCTCT CATAAGTATG ACCGAGCAGG AGGCGATCCA CGTCGCAGAC 
GTCAGTGAGG GGATCGGCGG CGACGCGACG GCGGAGCCGG GCTCGCCGGT AGAGCTTCCG
GTCGTCGACG TGCTCACCGG CCGATCGTTT ATTACGGGAA AAAGCGGATC CGGCAAGAGC
AACACGGCGT CCGTCGTCAT CGAGAACCTC CTCGATAACG GCTTCCCCGT GATGATCGTG
GACACGGACG GGGAGTATTA CGGCCTTAAG GAAGAGTACG AAATTCTGCA CGCCGGCGCC
GACGACGAGT GCGACATCGT CGTCTCACCC GAACACGCCG AGAAGCTCGC CAATCTCGCC
TTAGAGCAGA ACGTTCCGAT CATCCTCGAC GTCTCGGGCT ACCTCGAAGA AGACACCGCC
AACGAGCTCC TCTTGGAGGT CGTTAAACAG CTGTTCGCGA AGGAGAAGCG CCTCAAGAAG
CCGTTCCTGC TCGTCGTCGA GGAGTGTCAC GAGTACATCC CGGAGGGCGG CGGGATGGAC
GAGACGGGGA AGATGCTGAT CAAGGTGGGC AAGCGCGGCC GGAAACACGG CCTCGGCATC
GTCGGGATCA GCCAGCGCCC GGCCGACGTC AAAAAGGACT TCATCACCCA GTGCGACTGG
CTCTGCTGGC ACCGGCTCAC CTGGGACAAC GACACGAAGG TCGTGGGACG CATCCTCGGC
TCGAAGTACG CGAGCGCCGT CGAGGATCTG GGCGACGGCG AGGCCTTCCT GATGACCGAC
TGGGACGAGT CGATCCGCCG GATCCAGTTC CACCGCAAGC AGACGTTCGA CGCGGGCGCG
ACGCCCGGGC TCGACGACTT CGAGCGCCCG GAGCTCAAGT CGGTCTCCAG CGATCTCGTC
GGGGAGCTCC AGTCCATCTC CGACGAGGAG GAGCGTCGCG AGTCCGAGAT CGCCGACCTC
AAACAGGATC TCGACAAACG GGACCAGCGC ATCCAGGAGC TCGAACGCGA GCTAGAGGAC
GCCCGCGACC TGAGCAACAT GGCCGACCAG TTCGCGCAGG CGCTTCTCGG GAAGGCCGAA
GCGCCGTACC GCGGCGGAAA CGGCCCGTCC GTGGCCGCCG GCGGAGAGGG AACGACTGGC
GACGACGGCG ACCAGTCCGT GCTCAAGTCG TACGACGAGG CGGTCGCCGC GACCGAGAGA
AGCGAGGACG CCGACGGAGA CGCCGATGTC GACACAGACA CCACCGCGGG CAACCGCGAC
ACAGCCCCGA ACGACACCGA CTCCGACGCC CCGGCCGACG ACTCCCCGAC CGGCGAGACC
CCAACCGACG ACTCGAGCGA CGGCGCCCCC GTCGATGACG TCGATCCGGT CACGCGCGCC
GACGTGGCCG AGAGCGCGCT CCGGTTCGAC GACGCGGTCG AGCTCGGCAC CCGCGAGGCC
GTTATCGAGG AGCTTCGGAG TCGGATCGAG GCACTTCCGG AGCTCTCACA GGGGATGCTC
CGACACTACC GACGCGAGGG CGTCAGCGAC CCCGTCGCCG CCCACATCGA CGCCGGCGGC
GACGGCGACT CCGGCCACGC GTACAGCCGT CACCGCCCCA TCCGACGGGC GGGGGTCATT
CGTCACGCCG GTCGAGGCCA CTACGCCTAC GCCGTCCCCG ACCTGATCCG CGAGGCGTAC
GCCGACCGAC TCGACGAGGA GAGCGTCCGT GAGATCGTCC GCGCGGTCGA GACCGCCTTC
GTCCCGCCGG CCGAGCGGAG CTACCCGCCG GACGCCGATC CGGCCGAGGT CGGCGTCGGA
AACGGGTCGG CAGAGACCGA TTTGGACGGT GAGGTCGTTT CGCCCGGCGA AGGTGACGAC
AGCGACGGAA GCGCGTCGGA AACCGAGGCG AGTGCGGATC AGCCGACGAG CCACCTGAGC
GACGCCGCAC GGAAGATCGC GGAGCGCGGG CACACCGTCG ACGAGTAA
 
Protein sequence
MTRAVSLISM TEQEAIHVAD VSEGIGGDAT AEPGSPVELP VVDVLTGRSF ITGKSGSGKS 
NTASVVIENL LDNGFPVMIV DTDGEYYGLK EEYEILHAGA DDECDIVVSP EHAEKLANLA
LEQNVPIILD VSGYLEEDTA NELLLEVVKQ LFAKEKRLKK PFLLVVEECH EYIPEGGGMD
ETGKMLIKVG KRGRKHGLGI VGISQRPADV KKDFITQCDW LCWHRLTWDN DTKVVGRILG
SKYASAVEDL GDGEAFLMTD WDESIRRIQF HRKQTFDAGA TPGLDDFERP ELKSVSSDLV
GELQSISDEE ERRESEIADL KQDLDKRDQR IQELERELED ARDLSNMADQ FAQALLGKAE
APYRGGNGPS VAAGGEGTTG DDGDQSVLKS YDEAVAATER SEDADGDADV DTDTTAGNRD
TAPNDTDSDA PADDSPTGET PTDDSSDGAP VDDVDPVTRA DVAESALRFD DAVELGTREA
VIEELRSRIE ALPELSQGML RHYRREGVSD PVAAHIDAGG DGDSGHAYSR HRPIRRAGVI
RHAGRGHYAY AVPDLIREAY ADRLDEESVR EIVRAVETAF VPPAERSYPP DADPAEVGVG
NGSAETDLDG EVVSPGEGDD SDGSASETEA SADQPTSHLS DAARKIAERG HTVDE