Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0816 |
Symbol | |
ID | 5055309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 726140 |
End bp | 727498 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640468377 |
Product | hypothetical protein |
Protein accession | YP_001153054 |
Protein GI | 145591052 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.723695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.916322 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGAGC TTGGAATCGT CGTCAGCGGG TCCACCATCG GCTCCATCCC CGTGCAACTC TACCGCTCGG CGGAGCGCTA CGCAGTGGAG GAGCAGCTGG TAGGGATAGT GGACAGGGAG AACCCCGGGG AGGTGGTGGT GGGCTTCCTC CGCCGCGTGA CGAAGCTGGA GCCGGTGATC AGGGACAGGG TGAGGACGCC CTACGTGGAC CGGCCCGAGA TGGTGGACTA CGGCATTCTC CTGCCCTACA CCTCGGCCAT CGTCAAGCCC TACGTGGCGC TCCGCGACGG GAGGCTTGCC GAGGTGTCCA ACGTCGTGAC GCCGGGGTCC AAGGTCTACC TTCTGGATCC CTCCCTTCTG GAGGGGGCCT TCGCCGGGAG CTTCATCTAC GTGGGGGAGC ACAAGTACTC GGGGTGGAGG CTCCCCCTGG ACCCCGCCTA CGTGACGCAC CACGTGGGGG TCTTCGGTGC CACCGGCATG GGGAAGTCGA GGCTAGTCAG GGCGCTTATC AACGAGCTCG CCGCTAGGGG CCGGAAAGTC ATTGTCTTTG ACCACACGGG GGTGGACTAC GCCCCGTACT ATGGCGCTGT TGTCAAGAGC ACGGAGATTA GAATACCGCC CAACGTCCTC GCCTCGGTGA TTGCCAAGGT GGCCCAGCTC CAGTGGCAGA CCTACGGCGA ATATCTGGAA ATCGCCACGA TGACCTACGA GGGGAGGTGG TCTAGGGAGG GGTTTATTGC CCATTTGAGG AGGGTGATGA AGAGGCTAAA CGCCCGGGAC TCCACAATCG AGAAGGCGGA GTTGTTCTTG AAGCAGTTGG CCACCGCCGA GTTCTTCGAG GAGCTGAACA ACAGAGTGAA GACAGCCGAG GATATCCTCT CGGCGGAGGC TTCTCCTCTC GTAGTCGACT TGAGCTACGA CACAGACCTC TCGGTGAAGC AAGCCATTGT GGCCTCGGTG ATAGAGGCGG CGTGGTCAAA GGTGAAGAGG GACAAGGCCC CGGCGAACAT AATCTTTGTC GTGGACGAGG CCCAGAACTA CGCGCCGCAG ACCTGGACCA TCTCAAAAGA CGCCATAGAG ACAACAGTGA GGGAGGGGAG GAAGTGGGGC CTCTCAATGG TCTTGGCCAG CCAGAGGATT GTCGGCGACA TCGACCCGTC CATCAGGGCC AACCTCGGCA CCGTGTTCTT CTCCAGGCTA ACGGCGCCTA CCGACCTAAG GGAGATCTCC TCTTACTTAG ACCTAGCCGA TATTAACGAA AGCGTCTTAG CCCAGCTTCA GCCGAGGGAG TTCTTCGTGG CGGGGCTCAT GAACCCGTTG AGGAAGCCCG TCCTCATAAG AACTAGGGAG GTGGCGTAA
|
Protein sequence | MKELGIVVSG STIGSIPVQL YRSAERYAVE EQLVGIVDRE NPGEVVVGFL RRVTKLEPVI RDRVRTPYVD RPEMVDYGIL LPYTSAIVKP YVALRDGRLA EVSNVVTPGS KVYLLDPSLL EGAFAGSFIY VGEHKYSGWR LPLDPAYVTH HVGVFGATGM GKSRLVRALI NELAARGRKV IVFDHTGVDY APYYGAVVKS TEIRIPPNVL ASVIAKVAQL QWQTYGEYLE IATMTYEGRW SREGFIAHLR RVMKRLNARD STIEKAELFL KQLATAEFFE ELNNRVKTAE DILSAEASPL VVDLSYDTDL SVKQAIVASV IEAAWSKVKR DKAPANIIFV VDEAQNYAPQ TWTISKDAIE TTVREGRKWG LSMVLASQRI VGDIDPSIRA NLGTVFFSRL TAPTDLREIS SYLDLADINE SVLAQLQPRE FFVAGLMNPL RKPVLIRTRE VA
|
| |