Gene Hlac_0125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0125 
Symbol 
ID7401646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp132752 
End bp134467 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content67% 
IMG OID643707189 
Productvon Willebrand factor type A 
Protein accessionYP_002564801 
Protein GI222478564 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.178394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACA ACGACAACAT CGACACAATC GGACTTTCGC GGCGCACGCT GCTGGCGGGC 
CTCGGCGCGA TCGGCGTGGC CTCAGCGGGG GCTGGGCTCG GGACGACGGC GTACTTCAAC
GACACCGAGT CGTTCGAGAA CAACACGCTC ACCGCCGGCC AGCTCGACCT GCTCGTCGAC
TGGCAGCAGA CGTACGACTT CGGCGATGGC CATCAGTTCG TCAGCGCGCA CCCGGACCAC
GACGGCGACG GGGAGCAGTC GATCGCCGCT GACAACGACG CCGGAGAGAT CAAGTACAGC
GACTTCCCGG ACGACGAGGA CGAAGACAGC AACGGGGCGA ACATCCCCGT CCTCGACTGC
GAGACCATCC CGCCGCTTTC GGAGGCGGAC TTCGGCACGG ACCCCGTCAC CGGCGAGGAG
ATGGAGACGC TCGTCCAGTT CACCGACGTG AAGCCCGGCG ACTCGGGCGA GATCACCTTC
TCGTTACACC TCTGTGACAA CCCCGGCTAC ATCTGGATGC AGGCGGGCAA CGTCAGCGAT
GAGGGCGGCG CGTTCACGGA GCCGGAAGAC GTCGCTGCCG GCGAGAACGC CGCCGACCTC
GCGGACGCCA TCGACGCGAA GCTCTGGTAC GACGAGGACT GCGACAACGT TCACGACGAC
GCGGGACCGA TCGACATCAT GCTCACGCTC GACTTCTCGG GGTCGATGCT GTACGACCAG
TACGGCGGCG TGGTGAGCAC GGACCCGATC CAGGTCGACG GCCAGAGCTA CGGCGAGACG
ACGAAGATCG ACCTCGTCGA GCTCGGCACG CGGCAGTTCA TCGACTACCT GCAGGCGCAG
AATGCCGACG TGCAGGTCGG CGTCGTCTAC TTCGACGGCG AGGGGAGCGG CGAGAACACC
CCTCGGACCG GCATCCTCGA ACCGCTAACG ACCAACCTCT CGGCGGTCGA CACCGCGCTG
TCGAACCTCC GTCAGAAGCT CGCGAACGTT GTGAGCGACG CGGCCCCGTC GACGCCGTTC
GACAACGACG GGAACCCGGA CCCGTACTCG AACGCCGACG GCATCGCCAC GGGCACCTAC
ATCAGCGAGG GGCTCGACGA CGCGCAGACG GAACTCGCCA GCAACGGTCG AGCGAGCGCG
GAAAAGCGGA ACATCGTCCT CTCGGACGGC GAGTCGTTCA ACGGCGACGG GAACACGAAC
TACGCGCCGC CCGCGAGCGC CGCGGCCAAC GCCCGCGCCG CGTCCCCGGC GCCCGCGACC
GACGTGTACA CGATCAACGT CAACGGCAGC GCCAGCACGC TCCAAGCGAT GGCCGGCCCG
GCCGGCGGCT CGGGCGGCGA TCCCGTGTTC TTCAACGACA TCAACGACCC GCTCAACATC
CCGACGGTGT TCGGCAACCT CGCCGCACAG ACCGTCGCGG AGAAGGTCAT CATGGAGGAC
ACGCTCGCGA ACGTCCTCGA CGCGCTCGCG GACGGGAACG GCGTCCCGCT CGACGGGAAC
CGTGCGACGC TGTACGACGA ACTCAGCGAC GACCCCAACG ACCCGGACCG CGAGGCGTTC
CGCGGTGACG GCGTGATGCA CTGCGTCGCG CTCTCGTGGG AGCTCCCGTT CGACGTGGGC
AACGAGATCC AAGGCGACAC GCTCAGATTC GACCTCGGGT TCTACACCGA ACAGGCGCGC
CACAACGACG GTACCGGTCC CGATCTGCCG GCCTGA
 
Protein sequence
MADNDNIDTI GLSRRTLLAG LGAIGVASAG AGLGTTAYFN DTESFENNTL TAGQLDLLVD 
WQQTYDFGDG HQFVSAHPDH DGDGEQSIAA DNDAGEIKYS DFPDDEDEDS NGANIPVLDC
ETIPPLSEAD FGTDPVTGEE METLVQFTDV KPGDSGEITF SLHLCDNPGY IWMQAGNVSD
EGGAFTEPED VAAGENAADL ADAIDAKLWY DEDCDNVHDD AGPIDIMLTL DFSGSMLYDQ
YGGVVSTDPI QVDGQSYGET TKIDLVELGT RQFIDYLQAQ NADVQVGVVY FDGEGSGENT
PRTGILEPLT TNLSAVDTAL SNLRQKLANV VSDAAPSTPF DNDGNPDPYS NADGIATGTY
ISEGLDDAQT ELASNGRASA EKRNIVLSDG ESFNGDGNTN YAPPASAAAN ARAASPAPAT
DVYTINVNGS ASTLQAMAGP AGGSGGDPVF FNDINDPLNI PTVFGNLAAQ TVAEKVIMED
TLANVLDALA DGNGVPLDGN RATLYDELSD DPNDPDREAF RGDGVMHCVA LSWELPFDVG
NEIQGDTLRF DLGFYTEQAR HNDGTGPDLP A