Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0985 |
Symbol | |
ID | 5055466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 876962 |
End bp | 879070 |
Gene Length | 2109 bp |
Protein Length | 702 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640468541 |
Product | SMC domain-containing protein |
Protein accession | YP_001153217 |
Protein GI | 145591215 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR03185] DNA sulfur modification protein DndD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0000497741 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTAGGC GGGTGGAGCT CATTAATTTT AAGGCGCATG CCAAGGCGGC TTTTAGATTT GGCGAGGGGG TCAACTTCAT CTATGGGCCA AACGGCTCAG GCAAGACGTC GCTGATGGAG GCCATATCAG TTGCTCTCTT CGGCTCTACT TGGGTAAGGA AGGTGGGCTC GAAGTGGTCG GACTACCTGC GCCGGGGATC CACGGCGGGC GAGGTTAGGC TATATCTCAG CTACCAGGGA GGCGAGGTGG TGATTGCCAG ACGTTTTGGG GAAAGCGGCA CCTCTCCTTC AGGCACCTAC ATGGCCGTAA ACGGCTCTAT TATCGCTAGG GGCGATGCGG ACGTCACAGC CGCCGTGGCG ACGAAGTTGG GGATAGGCGT GGAGGAGTTT CGACATCTTC TTTACATACG CCAGGGCGAG TTGAGGAGAA TACTACAGGA GGCTGAGTAT CTAGACAGGA TTTTGAGACT CGACGAGTTT GACAAGGTAG ACGAGCTGTA CAGAGACGTC TACAACGAGC TTAGGGCCAG GAGGGAGAGG ATTGGGGGGA GAGCCGAGGA GCTTGAAAAA CGAATCCAAC CTCTCCGATC AAGGCTGGAA GACCTCCGCA GAAGGCTGGG GGAGGTGGAG GCTAAGCTAA GGGAACTTGA GCCCTATCAA AACCGGCTAC CCGAGGCGGA GAGAAGGTAC CTGGAGCTGA GGGATAGGCA CAACGTCTTG TTGGCAGAAA GAGAGCAACT GGAGAGGAGA CTCGAGGAGC TGGCCCACGC CGCCCTAGAG GCTGAGAAAG ATGTGGAACA GCTAGAAGAG GAGCTGGAGA AGATCCGCAA GGCTGCTCAA GAGCTCAAAA CTCTCCCCAC ATTCGGCGAC GTCGAGAAGG AGTATTTTGA ACTACGACAA GTAGTGTCCG TAGTTGAGAA GATACCGCTG GAGGTAAAGA GCTACGATCC CTCTAGGCTT GAGGAGGCGA GGCAGAGACT TGAAGACTCC TCGAGGAGGC TTGCCCAGGT TAAGTCGCGG CTTGAGCTGT TGAGGGACGT GGTGAGGCTC GCCAGCAGAG CCGAAGGCGG CGTATGTCCA GTCTGCGGCT CACCCCTCCG CCCCGAGGCG GTGAGAAGAC ATGAACTAGA GATTATCGGC CTAGAAAAGG AGGAAAAAAG ACTAGTCTCA TTAATCGAGG AGCTAAGAGA CGAGATTAAG AGACTTGAAT CTCTAGACCG CGCCTATCAG ACCTACAGAG AATACCTCAA CGTGGACCTT CTCTCTGCCA AGAAGAGACT TACTGAGCTG GAAAAGATGT ACAAGGCCAA AGTGGAGACA GAGCGGCGCC GCGCCTATCT GGCCGCCCTA GTAGAAAGAG AGGCCGAAGT GCTGGGCAAG TTAGAAAAAG CAAAGCGGAG GCTGGCAGAG GCTGAGGTAG CCATAGGCGA GGTGGGCGAG CGGCTCAAGA AGGTAGAAGA GGAGCTAGAA AAAACCCAGG CCCTACTTAA AGAAGCCGAG GCTGAGTACC TACTCATTAG AGATAGGCAC AGGGAGTACA CGGCGTTGCA GACCTTGGCA AAGGAACTAA GAGAGCAACT ACAATCAACC GAGCTCGAGC TACAAGAGGC CGTAGCTGAG CTGGAAAAAG CTAGGGAAGA CCTCTCTAAG CTTGACAAGG CCCTTGCCGT AGCTAAGAAC GTGCGGGGGA CCTTAGCCGA GTTAAAGCCC GCGGCCCGGC AGATATTCCT ACGGGCAATA AACGAGGAGC TCAACCACGT CTTCCTGAAA CTAAGACACA AGGACGCCTT TAAGTCTGCG CAACTAGTTG AAGCTAACGG AAGATACGTG GCCAGGATAT CCACCCCAAA CGGCTACATT GACCACGGCC TACTCTCCCT TGGCGAGCAG AACCTCCTAG CTCTCTCCCT CCGCGTCGCC CTCGCAAGAG CTCTCCTCGG AGGAGCCCCA TTCATGATGC TAGACGAGCC AACAGAACAC CTAGACGAAG AACACAGAAA AAGGATAGTA GAACTAGTAA GAGACCTAAC CTCTGTGGTG CCCACAGTAG TCGTCACCTC CCACCTAGGC GAATTCGAAG AAGTCGCCGA CAACATTATC CATCTATAA
|
Protein sequence | MIRRVELINF KAHAKAAFRF GEGVNFIYGP NGSGKTSLME AISVALFGST WVRKVGSKWS DYLRRGSTAG EVRLYLSYQG GEVVIARRFG ESGTSPSGTY MAVNGSIIAR GDADVTAAVA TKLGIGVEEF RHLLYIRQGE LRRILQEAEY LDRILRLDEF DKVDELYRDV YNELRARRER IGGRAEELEK RIQPLRSRLE DLRRRLGEVE AKLRELEPYQ NRLPEAERRY LELRDRHNVL LAEREQLERR LEELAHAALE AEKDVEQLEE ELEKIRKAAQ ELKTLPTFGD VEKEYFELRQ VVSVVEKIPL EVKSYDPSRL EEARQRLEDS SRRLAQVKSR LELLRDVVRL ASRAEGGVCP VCGSPLRPEA VRRHELEIIG LEKEEKRLVS LIEELRDEIK RLESLDRAYQ TYREYLNVDL LSAKKRLTEL EKMYKAKVET ERRRAYLAAL VEREAEVLGK LEKAKRRLAE AEVAIGEVGE RLKKVEEELE KTQALLKEAE AEYLLIRDRH REYTALQTLA KELREQLQST ELELQEAVAE LEKAREDLSK LDKALAVAKN VRGTLAELKP AARQIFLRAI NEELNHVFLK LRHKDAFKSA QLVEANGRYV ARISTPNGYI DHGLLSLGEQ NLLALSLRVA LARALLGGAP FMMLDEPTEH LDEEHRKRIV ELVRDLTSVV PTVVVTSHLG EFEEVADNII HL
|
| |