Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0623 |
Symbol | |
ID | 7401758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 640651 |
End bp | 641925 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643707689 |
Product | replication factor A |
Protein accession | YP_002565295 |
Protein GI | 222479058 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1599] Single-stranded DNA-binding replication protein A (RPA), large (70 kD) subunit and related ssDNA-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.431292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTCA ACAGCCATGC CGAGGAGCTC GCCTCCGACC TCGGTGTCGA CAAAGAGGAG GTCAAAGCCG ACCTGCAGAA CTTACTGGAG TACAGCGTCC CGATCGACGA GGCCAAACAG AGCGTCCGCC GGAAGCACGG CGGTGGGGGT GGTGGCTCGA CGCCGACGCC AGATTCGGTC GACGTGGACG AGATCACCAC CGACCACGAC GGCGTGACGG TGACCGTCCG CGTGCTCACG CAGGGAACCC GCACGATCCG GTATCAGGGC GACGATCTCA CGATCCGTGA GGGGGAGATC GCCGACGAGA CCGGCGTCAT CTCCTACACC GCGTGGCAGG ACTTCGGGTT CGAGCCGGGC GACTCGCTGA CGATGGGTAA CGCCGGCGTC CGAGAATGGG AGGGCGAGCC GGAGCTCAAC CTCAATGACT CCACCACCGT TGCCATCGCC GACGAGACGG TCGAGGTCGA CCAGGAGGTC GGCGGCGACC GGAGCCTCGT CGACATCGCC GCGGGCGACC GCGGCCGCAA CGTCGAGGTC CGCGTGCTGG AGGTCGACGA GAAGACCATC TCCGGGCGCG ACGGCGAAAC GGAGATCCTG GAGGGCGTCG TGGGCGACGC CACCGCGAAG CTCCCCTTCA CCGACTGGCA GCCCCGCTCC GAACTGGAGC CGGGCGCCGA CCTCCGGATC GAGGACGTGT ATGTCCGAGA ATTCCGCGGC GTGCCGTCGA TCAACCTCAC CGAGTTCTCG ACGGTCACTC CCCTCCCCGA CCCCGTCGAA GTTGCGGAGG ACGCACCCCG CCTTTCGGTC GCCGACGCGG TCGCCTCCGG CGGGATGTTC GACGTGGAGA TCGTCGGCAA CGTCCTCGAA ATCCGGGACG GCTCCGGGCT CATCGAGCGC TGTCCGGAGT GCGGCCGCGT GGTCCAGAAC GGCCAGTGCC GGAGCCACGG CGACGTGGAC GGTGAGGACG ACCTCCGCGT GAAAGCGATC GTCGACGACG GCACCGACAC CGTCACCGTC GTCCTCGACG ACGAGCTCAC CGCCGAGGTG TACGGCGGGG GGCTCGACGA CGCCCTCGAC GCCGCGAAAG ACGCGATGGA CAAGGAGGTC GTCGCCGAGG CGATCGCCGA TTCACTGGTC GGACACGCCT ACCGCGTCCG CGGGAACCTT TCGGTCGACG ACTACGGCGC CAACCTCGAC GCCAGCGAGT TCGCGCTCGC GGATGACGAC CCCGCCGATG CCGCCCGTGC CGCGCTCGCG GAGGTGGGCG AATGA
|
Protein sequence | MDVNSHAEEL ASDLGVDKEE VKADLQNLLE YSVPIDEAKQ SVRRKHGGGG GGSTPTPDSV DVDEITTDHD GVTVTVRVLT QGTRTIRYQG DDLTIREGEI ADETGVISYT AWQDFGFEPG DSLTMGNAGV REWEGEPELN LNDSTTVAIA DETVEVDQEV GGDRSLVDIA AGDRGRNVEV RVLEVDEKTI SGRDGETEIL EGVVGDATAK LPFTDWQPRS ELEPGADLRI EDVYVREFRG VPSINLTEFS TVTPLPDPVE VAEDAPRLSV ADAVASGGMF DVEIVGNVLE IRDGSGLIER CPECGRVVQN GQCRSHGDVD GEDDLRVKAI VDDGTDTVTV VLDDELTAEV YGGGLDDALD AAKDAMDKEV VAEAIADSLV GHAYRVRGNL SVDDYGANLD ASEFALADDD PADAARAALA EVGE
|
| |