Gene Hlac_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1031 
Symbol 
ID7400102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1022906 
End bp1025053 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content70% 
IMG OID643708098 
Producttype II secretion system protein 
Protein accessionYP_002565698 
Protein GI222479461 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1955] Archaeal flagella assembly protein J 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.549235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGT ACCTCCCCCT GCTCGCCGCG TTCGGGTGCT GTCTGGCGCT GCTGCTGCCC 
GCGGTCGACG ACGGTGCCGA CCTGGTCGTG ACCCGCGTCG CGCTGTCGCT CTTCGGCGAC
TACGTCGGCG AGGACGGGCC GCGGCGCCAG CGCCAGCGCG ACCTGATGCG CGCCGCTCAC
GTCGCGGGCA CCCACCGGAC GTACGCCGCC AAGACGCTCC TGTACGCGGG CGTCCTCGGC
GTCGCCGGCA GCGTCATCGG CGTGTACGCC GCGGCCGGGC TCCTCTCGGT GCTCGACGTG
AGCGAGGCGG CCCTCCGTGG GACGCTCCCC GCGTCGCTCG GGTTCGTCGC CGGCGTGACC
CGGCTCACGG AGCTCGGGCT CCCCAAACTC TTCCTCCTGT TGACGCTGGC GTCGGCGACC
CTCGGCGCCG CGCTCGCGGT CGGGATGTAC TACGGGCGGT GGGAGCTGCT CGACCAACGC
GCTCACGCTC GGGGAGCGGA GATCGACGCC ACCCTGCCCC GGACGGTCGC GTTCATGTAC
GCGCTCTCGC GGTCGGGGAT GCCGTTCCCA CGCGTCATGG ACACGCTCGC GGAGAACGAG
GCGGTGTACG GCGAGGCGGC GACGGAGCTG TCGGTCGCCG TCCGCGACAT GAACGCCTTC
GGTACCGACG CGCTGACCGC GCTCCAGCGC ACCTCCCGGC GCACCCCGAG CGACGATCTG
GCCGACTTCG CGGAGAACCT CGCGTCCGTG CTCGGCACCG GCCAACCCAT CTCGACGTTC
CTCAGCGACC AGTACGAGCT GTATCAGGAG GAGGCCGAGT CGAAACAGCA ACAGTACCTC
GAACTCCTGT CGACGTTCGC GGAGGCGTAC GTCACCGCGC TGGTAGCGGG ACCGCTGTTT
TTCATCACCA TTCTCGTCGT GATCGGGCTC GTGTTGGAGG ACACGCTCCC CCTGTTGCGC
GTCGTGGTCT ACCTCGGGGT ACCGCTGGCG ACGTTCGGGT TCGTGGTCTA CGTCGACAGC
GTCACGCAGG GAATCGGCGG CACCGAGACT GTTGACCTCT CAGAGGCGAT CGAAGACGAA
TCAGGGATGA GCGACACCGC AAGCGCGCAG GCGGACGGGA ATCCCCAGCC CGACGGCGGC
GTGGCGGGCG GGTCGCCGTC GGCGGACCGC TGGGCGGCGA GCCGCGAGCG CCTCCGAGCG
TACGATCGGA TCCGAACGCT GCGGCGGTGG GCTGCGTCGC CGGTCGAGAG CGTGCTCGGC
GCGCCGCGAA CGGTGTTTCT CCTGTCCGTT CCGCTCGCGG TCGTCGTCCT GCTCGTGACC
GCCTTCCCGA TCACCCTCGG ACCGCCGACC GAGATGGTCG CACAGGTCGA GACCCCGATC
GTCGCCGCGA CCGTGGTCGT GCTCGCGAGC TACGCCGTCG TCTACGAGGC CCACAAGCGC
CGAGTCCGCC GGATCGAGTC GGCGGTTCCA GACTTTCTCG ACCGCCTCGC CAGCGTCAAC
GAGGCGGGCA CGTCCGTGGT CGGGAGCGTC CGGCGCGTCG CGGACTCGAA CCTGGAGGCC
CTGACCGACG ACCTGCAGCG CACCCGGCGC GACATCGACT GGGGCGCCGA CGTGGGCACC
GCTCTCCGGC GGCTAGAGCG ACGAGTCCGA TCGCCGATGA CCTCGCGGGC GGTCGCGCTC
ATCACGAACG CGATGCGTGC CAGCGGCGAT GTCGGCCCCG TGCTCCGAAT CGCGGCCGAC
GAGTCGCGCG CGACGTGGTC GCTCCGTCGG GAGCGCCGAC AGGTAATGCT CACGTACCTC
ATCGTGATCT ACATCTCCTT CCTCGTGTTC CTCGGAATCA TCGCCTCGCT GTCGGTGTCG
TTCATCCCCG CGATCGAGGA GGCGGCGATC CCCGGTGCCG GCGCCGGCAC TAGTGCGAGC
GACCTTCCGG GCGCGCCGAG CGGTCCCGGA GGGATCACGG ACGGGCTTGG GGACATCAAC
ACGACCGCCT ACGAGCAGCT GTTCTTCCAC GCCGCCGCGA TACAGGCGGT CTGTTCCGGA
CTCGTCGCCG GACAGCTCGG CGAGGGCTCG GTCAGAGACG GCGTGAAACA CGTCGTCGCC
CTGCTGTTGT TGACGCTCGC CACGTTCCTC GTCATCGACT TGGTGTGA
 
Protein sequence
MIAYLPLLAA FGCCLALLLP AVDDGADLVV TRVALSLFGD YVGEDGPRRQ RQRDLMRAAH 
VAGTHRTYAA KTLLYAGVLG VAGSVIGVYA AAGLLSVLDV SEAALRGTLP ASLGFVAGVT
RLTELGLPKL FLLLTLASAT LGAALAVGMY YGRWELLDQR AHARGAEIDA TLPRTVAFMY
ALSRSGMPFP RVMDTLAENE AVYGEAATEL SVAVRDMNAF GTDALTALQR TSRRTPSDDL
ADFAENLASV LGTGQPISTF LSDQYELYQE EAESKQQQYL ELLSTFAEAY VTALVAGPLF
FITILVVIGL VLEDTLPLLR VVVYLGVPLA TFGFVVYVDS VTQGIGGTET VDLSEAIEDE
SGMSDTASAQ ADGNPQPDGG VAGGSPSADR WAASRERLRA YDRIRTLRRW AASPVESVLG
APRTVFLLSV PLAVVVLLVT AFPITLGPPT EMVAQVETPI VAATVVVLAS YAVVYEAHKR
RVRRIESAVP DFLDRLASVN EAGTSVVGSV RRVADSNLEA LTDDLQRTRR DIDWGADVGT
ALRRLERRVR SPMTSRAVAL ITNAMRASGD VGPVLRIAAD ESRATWSLRR ERRQVMLTYL
IVIYISFLVF LGIIASLSVS FIPAIEEAAI PGAGAGTSAS DLPGAPSGPG GITDGLGDIN
TTAYEQLFFH AAAIQAVCSG LVAGQLGEGS VRDGVKHVVA LLLLTLATFL VIDLV