Gene Hlac_2682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2682 
Symbol 
ID7400889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2669892 
End bp2671721 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content70% 
IMG OID643709756 
Productconserved repeat domain protein 
Protein accessionYP_002567323 
Protein GI222481086 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1361] S-layer domain 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.549235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTA TCCGGAAGAT CGCGGTCGTC GCGCTTGTCT GCTGTCTGCT AGTGGCCGGC 
ACCGCGCTGG TCGCGGCCAA CGACACGGTC AGCGGCGATC CCGACATCGA GGCGTTCGCG
CCCGAGACGG CGTTCGTTCC CGGCGAGGAG TCGACGCTGC AGGTGAGTCT GAACAACCGC
GGCGACGTGA GCGAGGAGGG GTTCGACGAC CTCGAAAGCG AGGTGGTGAC CGCCCACGAG
ACCACCGCGC GGATCCTCTC CGGCGACGAG ACCAACCGCG ACGTTCCGTT CGACGTGCGC
ACCGGCGAAC AGACGGTGGG CGACGTGCCT CGGGGCGTCA CCGGTCCGAT CGACTTCACC
GTCGTCCCCG ACGAGGACGC GGAACCGGGC GTCTACCAGG TCCCGATCCG ACTGGAGTAC
CGGAACGTCT ACAACGCCGA GGACGACAAC GGCGCGACCC TCCGCGACGA GCGCGTCGAG
ACCGAGACCG TGGTCGTCGA CGTCGAGATA ACCGACCGCG CGCAGTTCGC GGTCACCGCC
GTCGACGGCG CGGTCCAAGC CGGCGACACC GGCGTCGTCG ACGTGACGAT GCGAAACGTC
CAGAACGAAA CCGCCCGCGA GGCGTCGGTG GCGGCGACCC CGGTCGACCC TGATCTCACC
TTCACCACCG AGGCCGGCAC CACCGAGACG TACGTCGACG ACTGGGCGCC CGGCGAGAAC
CGCACCTTCA CCTACCGGTT CGACGCCGCC GCCGACGCGA CGCCACGGGC TTCGACGCTG
GAGTTCGACG TGGAGTACCG AGACGCCGAG CGTGCGGACG CTACCGCCCG AACGGTTCGG
ACCGGCGTGA CCCCGCTCTC GCGGCAGGCG TTCGACGTGA CCGGCCTCAA CAGTTCGCTG
GAGGTCGGCA AGGACGGCTC GTTCACGGTC GCGGTCCGCA ACGACGGCCC GCGTCCGGTG
GAGAACGCCG TCGTCGCCTT CGACAACGAG GCGCCGGCGC CGGAGGGGGT CGGGGCGGAC
ACGATTCCGA CCGACGAGAA CGTCGTCCCC CGCGAGACGC GGGTGACCGT CGGCGACCTC
GGGGTCGGCG AGACCGCGAC GGCGACGTTC GACGCCGGGA TCCGCACCGA CGCAACCCCG
GGCAATCGGA CGCTCAACCT CGTCGTGCGC TATCGCGGGC TCGACGACGA CGTGGTCGTC
TCCGACGCGT ACGACGCCGT CGTCGATGTG CGCCCGGAAC AGGAGACCTT CGCCGTCTCA
CCGGTCGAAC CGCGTGTCGA CGAGGACGGT GCGGGCGGTA ACGAGACCGG CAGTGACGGT
AACGGGACTG ACGAGGGCGA CAACAGAACC GCCGCCGTCG CCGGCGCCGC GACCGGAATC
GCGCCGGGTG AGACCGCCCG GTACGACGTG GTCGTTCGTA ACACGGGCTC CGAGCCGGTC
TCGGACGTGC AGGCGAAGCT GTTCGTGGAC GAGCCGATCT CCTCGGACGA CGACGAGGCG
TTCGTCACGT CGCTGGATCC GGGCGAGGAG ACGACCCTGC GCTTCGAAGT AAGCGCTGAC
GGCGGGGCCG CGCCGAAGAC GTACTCCGCC GCGGTCGACT TCCGGTACGA CGACGCCGAG
GGCGACGAGC AGCTCTCGGA CACCTACCGG CTCCCCGTCG AGGTCGCCAC CGACGACGGT
GGCAGCGTGC TCCGGTCACC GGTCGGAATC GTCGCCCTGC TGAGCGCCCT TGCGATCGGC
GCCGCGGCGG TCTGGAAGCG CGGCCGGGTG AGCCGAGCGA TCTCGCGGTT CCGCCGGCGG
GCCCGCGACC GGTTCGGCGG CAACCGGTAG
 
Protein sequence
MSRIRKIAVV ALVCCLLVAG TALVAANDTV SGDPDIEAFA PETAFVPGEE STLQVSLNNR 
GDVSEEGFDD LESEVVTAHE TTARILSGDE TNRDVPFDVR TGEQTVGDVP RGVTGPIDFT
VVPDEDAEPG VYQVPIRLEY RNVYNAEDDN GATLRDERVE TETVVVDVEI TDRAQFAVTA
VDGAVQAGDT GVVDVTMRNV QNETAREASV AATPVDPDLT FTTEAGTTET YVDDWAPGEN
RTFTYRFDAA ADATPRASTL EFDVEYRDAE RADATARTVR TGVTPLSRQA FDVTGLNSSL
EVGKDGSFTV AVRNDGPRPV ENAVVAFDNE APAPEGVGAD TIPTDENVVP RETRVTVGDL
GVGETATATF DAGIRTDATP GNRTLNLVVR YRGLDDDVVV SDAYDAVVDV RPEQETFAVS
PVEPRVDEDG AGGNETGSDG NGTDEGDNRT AAVAGAATGI APGETARYDV VVRNTGSEPV
SDVQAKLFVD EPISSDDDEA FVTSLDPGEE TTLRFEVSAD GGAAPKTYSA AVDFRYDDAE
GDEQLSDTYR LPVEVATDDG GSVLRSPVGI VALLSALAIG AAAVWKRGRV SRAISRFRRR
ARDRFGGNR