Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1921 |
Symbol | |
ID | 8384212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1943710 |
End bp | 1946589 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644972989 |
Product | UvrD/REP helicase |
Protein accession | YP_003130823 |
Protein GI | 257052990 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1074] ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAGC CCACGCCGAA CGCCCGGCAG CGCGAATTGA TTGAGGCGAT CGAGGGGATT CACGTCGTCG ACGCCGGCGC GGGGACGGGC AAGACGTTCG CGATCACGCG TCGGTATGCG AACCTACTGA GTGAGGGGTA CGAGCCTGAC GACGTGCTGC TGGTCACGTT CACGAACAAC GCCGCCACCG AGATGAAAGA GCGCGTCGTC GCGCGGTGTG ACTACTCGAT GTCTGCGCTC CGGGACGCAC CGATCAGCAC GTTCCACAGC TTCTGTCACG ACCTGCTCCT CGAATACGGG GCCGACGCGC CCTCGTACCT CGGCATCGAC GACCAGATCA CCCAATCGAC GCAGCTGCTC GAAAACGAGG TCATCGAGGC CGATCGCTTT CGGACCTTCC TCTCGCAGTT CGTCGACGCC CATCCCGAAC ACGAGGCCGT GTTCCGGGTG CTCAACGATC CGACCGCCCT GCTCGAGTTG ATCAGGGAAC TCGCCGCGAA GGGCGTCTTC CCCACTGTTG ACGGGTGGTA TCGCGATGGT GCGGCCGCTC TGGACGGCGA TTTCGGGGCC TTCGAGGAGT TGTTTGCCGA GGCGAACGCA CCAAACGAGG GAGCGAACGG AGCCACACAG TCGAACCTTC GCACCTCCCT GAGCGGGTTC GAGCGCGATC GCTGCTTCCT CCCGGACGCC CCCAGCGAGG ACGAGTTGCG AGAGGGCTAT CCCTCGATCG ACGACCGGTG GGCGGAGGAA GCGTTCGTGG AGGACCGCGA GGCGCTGACG GAGTTCATCC ACGACGTGTA CGTCGAGTAT ATCCAGTTCG CGGTCCGGCG GAACTACCTC AACTTCAGCT TCCTCCAGCT ATTCGCTTTC GTCCAATTGT GTGAGGATCA CGCGCTCCGG GAGTCGATCG CCTTCGAGCA GGTGATGGTC GACGAGTTCC AGGACACCAG CGAGATCCAG TTCAAGCTCA CACTCTTGCT CGCCGGGACC GACAACCTCT GTGTCGTTGG CGACTGGAAA CAGTCGATCT ACGGGTTTCA GTACGCCGCC GTCGAGAACA TTCGCTCGTT CGAACGACGA CTCCGGGCGT ACAAACGCGA ACTCAACGGC GAGCACGAAC GCGTCGCCTT CCCCATCGAG GAGGTAACCA CGATCCCATT GCGGCGGAAC TACCGCTCGA CGCAGTCGAT TCTGGACCTC TCGCGGCACG CCCTGACGGT GCCCGCGACG GGTAGCGAGT CGGTCGACCG AACCGTCGAG GATATCGACG GCCTCGAGGC GGCGACCGAT CGGGACCACT CGACGATCGC GGCGTTCACG AGTGCGAGCG AACACGAGGC GATCCTCGAT CGGATCGAGA CGATCGTCGG CAACGACGAC TACGCGGTTG ACGAGAACGG CGACCTTCGC ACGCCACGCT ACGACGACAT CGCGGTCCTC ACCCGAACGC GACGCTTCGG GCGGGAGTTA CAGACCACGG CCGACGAGTT CGGCGTCCCG GTCGCCTACG AGGGCGGCGT GAAGCTCTTC GAGACCGATC AGGCGATCCT GCTGCTTGCA TGGCTCCGGA TTCTCGCCGA CGAGGACTCG CGACGTGGCT GGGCGGTCGT CCTCGAACGG GCGGGCTACA CGCTCGCAGA GGTCGAGCAG ATTCTCGACG AGCACACGTA TCCCGACGCG ATGGACGCGT TTCGCGACCG ACTCGCGGCG ATGGAGTCGA TCGGTGCGAT CGCCCGGCAG GTCTTCGAGC GCTACGGATT TGAGGACGCC TACGCGAGTT CGCTCGTCGC CCTCCTGCAG GATGTCTCGG ACGGGACGAC GCGGAATCTC GGCGGGATGA TCCGCTTCAT CGAGCGGAGT CTGGACGCTG AGGCGACCCA CGAGATCGAC GACAACCCGG GCGGGGACTC GATCACCGTC CAGACGATCC ACGCGGCCAA GGGGCTGGAA CACCCGATCG TGATCGTCGC GAATATCAAC CGGTACAGTT TCCCGCCAGC GGGCGGCGGC GACGATCGGA TTCGCTTCGA GGATCCGATC GGCCTGCGCC AGACGAAACT GTCTTCGACA GCCCACGGTC AGCCCCACCT CTACGACAAC TGGCGGTATC GCGTGCTGTC GGCGTGTCTG GGTCGTGACT ACGACGAGGA GCGCCGTCTG CTCTACGTCG CGATGACGCG AGCGCAGGAT CACCTGCTCT TTTCGGCGGG CGCGGAGCCG AGTCCGCTGT TCGAGAATTT GCCGCTGGAA CCCGAATCCG TCGAGCCGGA CCTTGAAGAG TCCAGGATCG ATCGGACCGA ACAGACGCGA TTGCAGGTGT CGATCCCGGA GCCCGACGTG CCGGCTGGCC AGTCACCACA CGCACTGATG GACGATCGCG TGTTCGAGGA TCGCGACGAC GGTCGTGGGA TCGAGTTCGG GAATCGAGTT CACGAGTTCG CCGAACAGTA TGCCGACGGC GAGGCTGTCG AGCCTGCGAG TGATGACGAA CGGCGAGTTC GAGACTTCAT CGATGAGCTG GAGGGAGAGC TGCACGTCGA AGAGGACGCG TACCTGCCGG TGTCGGTCGG CGAGCAGGAA GTGACTATCT CCGGCGTGAT CGACTTGCTT CACGTGACTG GCGATCGTGT CGAGATCGTC GACTACAAGA CCGACCGTAC CCGGGACGCC GAAACCGAGT ATCGCAAACA GCTAAGCGTC TACTATCATG TCGTCCGACA GCTGTATCCC GATCGAGAGA TTGAACCGAG TCTCTTTTAT TCTGGTAAAG GTACTGTCGT AGCCATCGAT CCACTCTCGA TGGAATCTCT CTCGGATCTG GTTAAAATCG AACGAGAATC AAATAGGAGC GATCAGGGAG AGAATCCTAC TATTCGATAG
|
Protein sequence | MTEPTPNARQ RELIEAIEGI HVVDAGAGTG KTFAITRRYA NLLSEGYEPD DVLLVTFTNN AATEMKERVV ARCDYSMSAL RDAPISTFHS FCHDLLLEYG ADAPSYLGID DQITQSTQLL ENEVIEADRF RTFLSQFVDA HPEHEAVFRV LNDPTALLEL IRELAAKGVF PTVDGWYRDG AAALDGDFGA FEELFAEANA PNEGANGATQ SNLRTSLSGF ERDRCFLPDA PSEDELREGY PSIDDRWAEE AFVEDREALT EFIHDVYVEY IQFAVRRNYL NFSFLQLFAF VQLCEDHALR ESIAFEQVMV DEFQDTSEIQ FKLTLLLAGT DNLCVVGDWK QSIYGFQYAA VENIRSFERR LRAYKRELNG EHERVAFPIE EVTTIPLRRN YRSTQSILDL SRHALTVPAT GSESVDRTVE DIDGLEAATD RDHSTIAAFT SASEHEAILD RIETIVGNDD YAVDENGDLR TPRYDDIAVL TRTRRFGREL QTTADEFGVP VAYEGGVKLF ETDQAILLLA WLRILADEDS RRGWAVVLER AGYTLAEVEQ ILDEHTYPDA MDAFRDRLAA MESIGAIARQ VFERYGFEDA YASSLVALLQ DVSDGTTRNL GGMIRFIERS LDAEATHEID DNPGGDSITV QTIHAAKGLE HPIVIVANIN RYSFPPAGGG DDRIRFEDPI GLRQTKLSST AHGQPHLYDN WRYRVLSACL GRDYDEERRL LYVAMTRAQD HLLFSAGAEP SPLFENLPLE PESVEPDLEE SRIDRTEQTR LQVSIPEPDV PAGQSPHALM DDRVFEDRDD GRGIEFGNRV HEFAEQYADG EAVEPASDDE RRVRDFIDEL EGELHVEEDA YLPVSVGEQE VTISGVIDLL HVTGDRVEIV DYKTDRTRDA ETEYRKQLSV YYHVVRQLYP DREIEPSLFY SGKGTVVAID PLSMESLSDL VKIERESNRS DQGENPTIR
|
| |