Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1891 |
Symbol | |
ID | 8384182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1898341 |
End bp | 1900647 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644972959 |
Product | DNA polymerase I |
Protein accession | YP_003130793 |
Protein GI | 257052960 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0417] DNA polymerase elongation subunit (family B) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCTTCA CGATCGACTT CCTCGGCGAC GGCGACCCGC TCGTGTGGTC CCTCGACGGC ACCGTCGATG AACCGGCGTG GTCGGCGTCG CGCGCGGCCG CGTATCGCCC GACGGTGTAC GCGGTCGCCG CGCGTGGGAT GACCGAACGC GATCCCGACC GCGAGGCCTG CATCGCCGAT TTGAACGATC TCCGGGCGGA TCTCGACATG CATCCGGCGG TTGCAGACCT TCGTTTCGAG TGGAGGTCGC CGGGGTTTCG CTTCGCCGAC CAGCCCGTCC TCCGGATCGA CGTCGACCGG GTCGATGCGG TCCGGGAAGT CGCTCGCTTC GTCGAGAATC GCGGCCCACC CGGTCGGGTT CCGTACCGGG CGTTCGACGT GGACTTCTCG CCGGAGTTCC GGTACTGTCT CGAAACCGGG ATCGACCCGA CGCCGGGGCG GCCCCCGCGC GTGCTTCGGC TGGATCTGCC CCGGACTGCC GCGGCCGAAG GTGACCTCAC CGAACTGGCG ATCGGCGCGC GAACGACGGC CCCGACAGCT TCGTCGGCCA CGGACGCGAC CTCGACGGCC ACCGCCGAGC CCGCACTGCC GGGGACGCAG CCGGCCGGCG ACACCGTCGA GGAGGTGCTG GTGACCCTTC GACGGCGGCT GGCGGTCGAG GATCCGGACG TGCTTCAGGT CGAACGGAGT GACATCCTCC CGTTGCTCGA CGAGGCGGCC ACCGAGCACG GCGTCGACCC CGGCCTCCAG CGGGTTCCGG ACGGTACGCC GCGGGCCGAG ATCCCGGCCG TCCAGCAACT CGCGGGGGCG TCAACGTTCG AATCGTACGG GCGGCGGATG CACTCTCCGG CGCGGTACAA CGTTCCGGGA CGGGTCGTGA TCGATCGCTC GAACACGTTC TTCCTCGGCG AGACGAACCT CGCGGGGGCG CTCGATCTCG TCGCCCGGTC GGGCAAGCCG CTGCAGGAAC TGTCCTGGGC GTCGATCGGG AACGTGCTCA CGGCGATCCA GATCCGGGCC GTCCGAAAGC GGGATGTCCT CGTCCAGTGG CGGGCCTGGC GACCGGAGCG GTTCAAGACC GCCCGGACGC TACACGACGC CGATCGGGGC GGAACCACGC TCTCACCGAT CGTCGGCGTT CACGACGCTG TCCACGAACT CGACTTCGCG TCGATGTACC CCAACATCAT CTGCGAACAC AATCTCTCGC CCGAGACGGT CCGGTGTGGG TGTCACGATG GCGAGGACGT GCCCGAGTTG GGCTATTCCG TCTGTGATCG CGAGGGCTAT CTCCCCGACG TCCTCAAGCC GATCATCGAC GATCGTGCCG AGAAGAAACG CCGGCTGGCC GAGGACGACC TCTCGGCCGC CGAGCGGCGC GCACTCTCCG GGCAAGTGGA CGCCCTGAAG TGGATCCTCG TCTCGTGTTT CGGGTATCAG GGGTTCAGCA ACGCCAAGTT CGGCCGGATC GAGGTCCACG AGGCGATCAA CGCCCACGCA CGCGACGTCC TCCTCTCGGC CAAAGAACGC CTGGAAGCCG GCGGCTGGTC GGTCCTGCAC GGCATCGTCG ATTCGATCTG GGTGACGCCC CGCGAGGGGG CGACCCAGGA GCCACTCGAG GAGATCGCCG CAAAGATCAC CGACGACGTC GGGATCGCCC TGGAGCACGA GGGGGACTTC GACTGGGTGG CGTTCTGTCC CCGACGCGAC GGCGAGGGCG GGGCGCTGAC GAAGTACTTC GGCAGTCGGC GCGATCCGGA CGGGGACGAC CCCTTCAAGG TTCGCGGGCT CGCCTGCCGG CAGCGCTCGA CGCCGCCGTG GGTCGCGGGG CTTCAGCGGT CGCTCATCGA GACGTTCGAC GGGACGCGCG ATCCGAGCGC GGTGATCGAC ACGCTCGCGG CCCACCTCGC AACTCTTGCG GCCGGCGATG TCCCGACGAC GGACCTGCTC GTCCGGAATC GCGCCTCGAA GGACGTCGAC GCCTACACCC ACCGGACCCG CACCGTCGCC GCGCTGGAGC GGGCGGATTC GATCGGGCTT GACTCCGCGC CGGGCCAGGA CATCGTCTAC GTCGTCCGCG ACGACGACAG AGAGGGGATG GACCGAGTGC GGCTGGCGTC GGAGATCGAC GACGAGAGCG ACTACGACGC GGGCTACTAC CGCGAGGCGG CGATCAGGGC CGCGGTCGGC GTCCTCGGCC CGCTGGACTG GACGGACGCG GACGTCCGGG ACGCACTCGC CGGCGAGCGG GACGCGACGC TGTCGACGTT CGACGGGATG GAGCTGGATC CCGACCGCCG GATCTAA
|
Protein sequence | MVFTIDFLGD GDPLVWSLDG TVDEPAWSAS RAAAYRPTVY AVAARGMTER DPDREACIAD LNDLRADLDM HPAVADLRFE WRSPGFRFAD QPVLRIDVDR VDAVREVARF VENRGPPGRV PYRAFDVDFS PEFRYCLETG IDPTPGRPPR VLRLDLPRTA AAEGDLTELA IGARTTAPTA SSATDATSTA TAEPALPGTQ PAGDTVEEVL VTLRRRLAVE DPDVLQVERS DILPLLDEAA TEHGVDPGLQ RVPDGTPRAE IPAVQQLAGA STFESYGRRM HSPARYNVPG RVVIDRSNTF FLGETNLAGA LDLVARSGKP LQELSWASIG NVLTAIQIRA VRKRDVLVQW RAWRPERFKT ARTLHDADRG GTTLSPIVGV HDAVHELDFA SMYPNIICEH NLSPETVRCG CHDGEDVPEL GYSVCDREGY LPDVLKPIID DRAEKKRRLA EDDLSAAERR ALSGQVDALK WILVSCFGYQ GFSNAKFGRI EVHEAINAHA RDVLLSAKER LEAGGWSVLH GIVDSIWVTP REGATQEPLE EIAAKITDDV GIALEHEGDF DWVAFCPRRD GEGGALTKYF GSRRDPDGDD PFKVRGLACR QRSTPPWVAG LQRSLIETFD GTRDPSAVID TLAAHLATLA AGDVPTTDLL VRNRASKDVD AYTHRTRTVA ALERADSIGL DSAPGQDIVY VVRDDDREGM DRVRLASEID DESDYDAGYY REAAIRAAVG VLGPLDWTDA DVRDALAGER DATLSTFDGM ELDPDRRI
|
| |