Gene Hlac_0111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0111 
Symbol 
ID7401631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp116382 
End bp117887 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content69% 
IMG OID643707174 
Productreplication factor A 
Protein accessionYP_002564787 
Protein GI222478550 
COG category[L] Replication, recombination and repair 
COG ID[COG1599] Single-stranded DNA-binding replication protein A (RPA), large (70 kD) subunit and related ssDNA-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.462396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGTGA TAGAGGACGT CTACGAGGAT CTCGACACCG ATGTCGAGTT CGAGGAATTC 
GAGGCAGCCG TCGAGGACAA AGTGGAGCAG ATGGGCGGAC TCGCCGACGA GGAGACCGCG
GCCATGCTCA TCGCCCACGA GCTGCGCGAC GAGGAGGCCG ACACGATCGC CGACATCGAG
CCCGGCATGA ACGACGTGAA GTTCCTCGGG AAGGTGACCT CCATCGGCGA CATCCGCACG
TTCGAGCGCG ACGACGAGGA CGCCGAGGAG GGCCGCGTCT GCAACGTCGA CGTGGCGGAC
GCCTCCGGCT CCGTGCGCGT CGCACTGTGG GACGATATGG CAGCCGCGGC CGAAGAGCAG
TTGGAGGTCG GACAGGTGCT CCGCGTCATG GGCCGCCCGA AAGAGGGGTA CAGCGGACTC
GAAGTGAGCG CGGACAAGGT CGAGCCGGAC GAGGACGCCG AGGTCGACGT GCAGGTGCTC
GACACCTATC GGGTGGAGGA CCTCTCGCTC GGTGCCTCCG ACGTGGACCT CGTCGGGCAG
GTGCTCGACA CGGACTCGAT CCGCACGTTC GACCGCGACG ACGGCTCTGA GGGGCGGGTC
GCCAACCTCA CCGTCGGCGA CGAGACGGGT CGCGTGCGCG TAACACTTTG GGACGACAAG
GCCGACCTCG TCGAGGAGTT CGAGGCCGGC GAGGTCGTCG AGGTCGGCGA CGGCTACGTC
CGCGAGCGCG ACGGCGACTT GGAGCTCCAC GTCGGCGACC GCGGCACCGT CGAGCGCGTC
GACGAGGACG TGGAGTACGT CCCGGAGACC ACGGACATCG CCGAACTGGA GATCGGCGAG
ACCGTCGACA TCGGCGGGGG CGTCATCGAG ACGGATCCGA AGCGCACGTT CGACCGCGAC
GACGGCTCGG AAGGGCAGGT CCGGAACGTC CGCATCAAGG ACGACACCGG CGAGATCCGC
GTCGCGCTGT GGGGCGACAA GGCCGACCGT GAGATCGAGC TGGCCGACCG CGTCGTCTTC
ACCGACGTGG AGGTTCAGGA CGGCTGGCAG GACGACCTCG AGGCCTCCGC GAACTGGCGT
TCGACGGTCA CCGTCCTCGA CGAAGGGAGC GATGCGGTGG GCGGCGCCTC AGGGTCCGGT
AGCGCCGGGA GCAGCGACGA CGGCGACGCG CACGGGACAA ACAATACCGG GCTCGGCGCC
TTCGCTGGAG ACGAACAGAA GGCGGCCGCC GAGGCCGTCG CCGGCGAGAC CGGCGGAGAC
GGTGGGAACA GCGGGGTCGG CGGCGGGAAC GGTGGATCCG GCGGCGCCGC GACCGCGACA
GCGGAGCAGT CGGCCGAAGA GATCGAGTTC ACCGGGACAG TCGTCCAGAC CGGCGATCCT
GTCGTGTTGG ACGACGGCCA GCGGACGAAG AGCGTCGAGA CCGACGCGAG CCTCCGGCTC
GGCGAGGAGG TCACGGTGCG CGGGACAGAA CGAGACGGAA CGATCGACGC CGACGACGTG
TTCTAA
 
Protein sequence
MGVIEDVYED LDTDVEFEEF EAAVEDKVEQ MGGLADEETA AMLIAHELRD EEADTIADIE 
PGMNDVKFLG KVTSIGDIRT FERDDEDAEE GRVCNVDVAD ASGSVRVALW DDMAAAAEEQ
LEVGQVLRVM GRPKEGYSGL EVSADKVEPD EDAEVDVQVL DTYRVEDLSL GASDVDLVGQ
VLDTDSIRTF DRDDGSEGRV ANLTVGDETG RVRVTLWDDK ADLVEEFEAG EVVEVGDGYV
RERDGDLELH VGDRGTVERV DEDVEYVPET TDIAELEIGE TVDIGGGVIE TDPKRTFDRD
DGSEGQVRNV RIKDDTGEIR VALWGDKADR EIELADRVVF TDVEVQDGWQ DDLEASANWR
STVTVLDEGS DAVGGASGSG SAGSSDDGDA HGTNNTGLGA FAGDEQKAAA EAVAGETGGD
GGNSGVGGGN GGSGGAATAT AEQSAEEIEF TGTVVQTGDP VVLDDGQRTK SVETDASLRL
GEEVTVRGTE RDGTIDADDV F