Gene Hlac_2723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2723 
Symbol 
ID7401334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2713362 
End bp2714678 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content70% 
IMG OID643709798 
Productprotein of unknown function DUF21 
Protein accessionYP_002567364 
Protein GI222481127 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTCA CGATCGCGCT CGTCGGCGTG GCCGCAATCC TCGTGTTGAC GGGGATCAGC 
GCCTTCTTCT CCAGTTCGGA ACTCGCCGTG TTCTCGGTCC CGAGCCATCG CGTCGACTCG
CTGCTCGCCG CCGACGTACC CGGCGCGCGG GCGCTATCGG CGCTGCGCGC AGACTCGCAT
CGCTTCCTCG TTACCGCCCT GGTGAGCAAC AACGTCGCCA ACATCGCGGC CGCGTCGGTC
GCGACTGCGG TGTTCGTCCG GTTCGGCTTC TCCGGTGGCG AGGCGGCCAC GGGCTCGACG
CTGGTCACCT CCGTGTTCGT CATCGTGTTC GGCGAGATCG CGCCGAAATC GTACGCGGTG
GCGAACGCGG AGAAGCACGC CCTCCGCGTG TCGCGACCGG TGGTCGCGAT CCAGCGCCTT
ATCCGACCCG TGTTATACAT CTTCGAGGCG CTCTCCGGCG TCGTCAATCG CTTCACCGGC
GGCGAATCCG ACATCGAATC GTACCTCACG CGCGAGGAGA TCGAGACGCT CGTGCTCTCC
GGCGAGGCGG CGGGCGCGCT CGACCCGGAC GAGGGCGCGA TGATCCGCGG CGTCCTCGAT
CTGGAGTCGA CCCGCGTGTC GGCGGTGATG GTCTCTCGGA CCGACATGGT CGCGCTGCCG
GACACCGCCA CGCCCGCCGA GGCGGTCTCG ACCGCCGCCG CGGAGGGCGT CACGCGGATG
CCGGTGTACA GCCAGAACCG CGACGACGTG GTCGGCGTCG TCGACCTGCG CGACGCGATC
GGCGCCAACG AGCGCGGAGA GCCCCTCGCA AGCGCGCTCC ACGAGCCGAC CTTCGTCCCG
GAGACGCAGC CGGTCGACGA GCTGTTCGCG ACAATGCGGT CGAGCGCTCT CCGGATGGCG
ATCGTCGTCG ACGAGTTCGG CGCGGTGGTA GGGATCGTGA CCTTGGAGGA TGTACTCGAG
GAGATCGTCG GCGAACTGGT CGGGGGATGG GAGACCGACC ACGTCGACGT GGTCGCGCCC
GACGCCGCGG TGGCCCGCGG GTGGACCACC GTCGCGCACC TCAACGAGAC GCTCGGGCTG
GACCTCCCGA TCGACGGGGG CACCGAGACC GTCGCGGGAC TGGTGACCCG GCAGCTCGGG
CGGGTCCCCG CTGAGGGCGA CCGCGTCGAG ATCGGCGACG TGACGCTCGC GGTCACGGGC
GCGACCGCGA CGCGAGTGAC CCGAGTGCGG GTGGAGCATC CGGGGATCGG AACCGAGGGC
GAAAGCGGAT CGGACATCTC CGGGTCCCCG GACGCGACGG GCGACGACAC CGAGTGA
 
Protein sequence
MDLTIALVGV AAILVLTGIS AFFSSSELAV FSVPSHRVDS LLAADVPGAR ALSALRADSH 
RFLVTALVSN NVANIAAASV ATAVFVRFGF SGGEAATGST LVTSVFVIVF GEIAPKSYAV
ANAEKHALRV SRPVVAIQRL IRPVLYIFEA LSGVVNRFTG GESDIESYLT REEIETLVLS
GEAAGALDPD EGAMIRGVLD LESTRVSAVM VSRTDMVALP DTATPAEAVS TAAAEGVTRM
PVYSQNRDDV VGVVDLRDAI GANERGEPLA SALHEPTFVP ETQPVDELFA TMRSSALRMA
IVVDEFGAVV GIVTLEDVLE EIVGELVGGW ETDHVDVVAP DAAVARGWTT VAHLNETLGL
DLPIDGGTET VAGLVTRQLG RVPAEGDRVE IGDVTLAVTG ATATRVTRVR VEHPGIGTEG
ESGSDISGSP DATGDDTE