Gene Hlac_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1032 
Symbol 
ID7400103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1025053 
End bp1026708 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content66% 
IMG OID643708099 
Producttype II secretion system protein E 
Protein accessionYP_002565699 
Protein GI222479462 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.621216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.536815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGTG ACGCACGCGA GGACGACGCC GGGGATCGGA TCGAGGCCCT CCGGCGGCGG 
CTCTCGCGGA CGTGGGAGGT GCTACGGGGG TCCGACATCG ACGTTCGGGC CTTCAGACCG
GGCGATGACG GCCCGCTCGC CGACTTCGCG ATCCCCGACG GCGAAAGCGA GGTCGACCGG
TACTGGGTGA ACGCCCCGTA CGCGTACGTC GTGATCACCT ACCACGACGC GGAAAGCGAA
CACCGGTACT ACGCGGTTGA GCCGGAGCTG GACCGGTTCG AGCGCGACCT CCTCGACCGC
GTGGTCGACG ACATCCGGGA CCCGCTGCTG TATCGCGAGG GTACCGGACG GACCGACGAG
GAAACGCTCA GAACCGAGCT TGAGGGGCTG TTAGAGGGGT ACGGCGTCGA CATCGGGATG
GATACGTTTC ACGCGCTCGC GTACTACCTC TACCGCGACT TCCGCGGCTA CGGGAAGGTC
GACCCCTTCC TCAACGACCG CCACATCGAG GACATCTCGT GTGACGGTTA CGACCTCCCG
ATCTTCGTGT ACCACGACGA CTACACCGAC ATCGAGACCA ACGTCTCGTT CGGGAAGTCG
GCGCTCGACA ACTACGTGAT CCGCCTCGCT CAACAATCTG GGCGCCACAT CTCCGTCGGG
GACCCCATGG TGGAGACGAC GCTGCCGGAC GGGTCGCGCG CGGAACTCGC GCTCGGCGAA
GAGGTGACCC CGCGTGGCTC CGCGTTCACG ATCCGCCAGT ACGCCGAGGA CCCGCTCACG
CCGATCGATC TGGTGGAGTA CGGCACGTTC TCGATCGAGC AGATGGCGTA CTTCTGGCTC
TGTATCGAGC ACAACAAGAG CCTCATCTTC GCGGGCGGCA CCGCCTCCGG GAAGACCACC
TCGATGAACG CGGTGTCGAT GTTCGTCCCG CCGCGCGCGA AGATCCTCAC CATCGAGGAC
ACCCGCGAGC TCTCTCTGTA CCACGACAAC TGGCTCTCCT CGATCACCCG CGAGCGTCGC
TACGAGGGCG CCGACATCGA CATGTACGAT CTGCTCCGGT CCGCGCTGCG CCACCGCCCC
GAGTACATCG TGGTCGGGGA GGTCCGTGGG GCGGAAGCCA TCACCCTGTT TCAGGCGATG
AACACGGGCC ACACCACCTT CTCAACGATG CACGCCGACT CGATCGAGAC GGTCATCAAC
CGGTTGGAGA ACGAGCCGAT CAACGTCCCG CGCGCGATGG TCCAGTCGCT CGACATGCTG
TCGATCCAGA CGCTGACCCG GTCGGGCGAC CAGCGCGTCC GGCGCGCGAA GACGATCGGC
GAGATAGGCG GAATCGACCA GCGCACCGGG GAGCTCGACT ACTCCTCGGC GTTCGAGTGG
GACGCGGAGA CCGACGAGTT CAGCCGGAAC GACTCGTCGC TCATGGAGGA GATCGCCGAC
GAGCGCGGCT GGTCCCGGTC GGAGCTACTT CGCGAGGTCC GGCGCCGCGA GCGCTTCCTC
GAACTGCTCC GCGGGCTCGG CGTCACCGAC TACCGGGCGT TCACCGCCCT CGTCAACGAG
TACTACGCCG ATCCTGAACG CGTGATGGAC CGGCTCGAAG AACGGAGCGA CGGCGCGGCC
AAACCCTCGG GACCGGTCGG TGACGCCGCC GACTGA
 
Protein sequence
MASDAREDDA GDRIEALRRR LSRTWEVLRG SDIDVRAFRP GDDGPLADFA IPDGESEVDR 
YWVNAPYAYV VITYHDAESE HRYYAVEPEL DRFERDLLDR VVDDIRDPLL YREGTGRTDE
ETLRTELEGL LEGYGVDIGM DTFHALAYYL YRDFRGYGKV DPFLNDRHIE DISCDGYDLP
IFVYHDDYTD IETNVSFGKS ALDNYVIRLA QQSGRHISVG DPMVETTLPD GSRAELALGE
EVTPRGSAFT IRQYAEDPLT PIDLVEYGTF SIEQMAYFWL CIEHNKSLIF AGGTASGKTT
SMNAVSMFVP PRAKILTIED TRELSLYHDN WLSSITRERR YEGADIDMYD LLRSALRHRP
EYIVVGEVRG AEAITLFQAM NTGHTTFSTM HADSIETVIN RLENEPINVP RAMVQSLDML
SIQTLTRSGD QRVRRAKTIG EIGGIDQRTG ELDYSSAFEW DAETDEFSRN DSSLMEEIAD
ERGWSRSELL REVRRRERFL ELLRGLGVTD YRAFTALVNE YYADPERVMD RLEERSDGAA
KPSGPVGDAA D