Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1581 |
Symbol | |
ID | 5056172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1430605 |
End bp | 1431759 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640469122 |
Product | hypothetical protein |
Protein accession | YP_001153787 |
Protein GI | 145591785 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0000122912 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAACGCT TATGCGTGGA TCACATCGTC ATGCTCGCCG CCGGCGACTT CAAGATGAGA ATTGCCAGGT ACTTGTTGAC AGCCCTGGCT ATAAGCGTCG GTGTCGCCCT AGTGGTGGCG TTGGCGACTG TGAGCGACGC CGCAAGGAGC TATGTGGAGC AGACCCTGTA CAAGGTATAC CCTGCCGATG TGATGATGTA CTCCGAGTCT ATAAACATTC CCCAGCGACT ACTCGACGTT TTGAGAAAAT CTCCATCAAT CGAGTCCGCC GAGGGGATCA TAATAACGAC GGGTCTGTAC AACGGGAAGG TTGTGTCAAT AGTGGGAATC CCCCTCAAGG ACGTAGATTA CTTTGCCATA GATCTCATCT CGGGACGCTT GCCGGTCTCC GGCGGCGAGG CCGTTGTGGA GGAGTCCGTG GAGGCGAGGC CGGGCGACGA GATCACCATC AAGGTCTACT CGGGCGCTCT TGGGGGCGAG AGGACTTTAA GAGTTAGAGT GGTGGGCGTG ATGAGGAGCT TCTTAAAAGG GTTCATAGGC GCCTTTCGGC TAAACCTAGT GGTGGTTCCG CTGGACTGGT TGCAACAAAA CCTAGACACA GGGCCTTTTG TAAACACAGT GTTGATCACC GCCAGGAACA AGGCCGAGGT GAAGGCACTT TACGCCACGC TTAAAGAGAC TTTTAAAGAC GCCCAGGTCT TTTCGCAAGA AAACCTGCTA GAGACAGTTA ACCAAGTCTT CAACGCCCTC AACGCCGTGT TTTCCGTAAT TAGCGGCGCG GCCTTGGCCA CGGCGGCAAT TACAACCTTC GCAGTTATGT CGATTACGAC CAGAGAGAGA CTTAGGGAAT TTGGCCTTCT TAAAGCAATT GGCATATCGT CTCGCGACAT AACGCTGTCG GTTGCCGCCG AGGTTATAGC AATAGCTCTG GCGGCTGGGG CCGTCGGCGT GGTGGCGGGC TTCTACGGCG CAAGTTTTGT GAAACAAATT CTTGTGGGTA TGGGGATAAA CTTCGATATG CCGATTACGT TTAGGCCTAT CTATGCTACC ATTGGGCTGG CCACAGCCAT CGTCGTAGCC GCCGTAGGCG CCCTTGTTCC GATGTACAGA GTAGCCAAGT TAAGACCGCT GGAAATTCTA CGGCTATGGC AGTAG
|
Protein sequence | MQRLCVDHIV MLAAGDFKMR IARYLLTALA ISVGVALVVA LATVSDAARS YVEQTLYKVY PADVMMYSES INIPQRLLDV LRKSPSIESA EGIIITTGLY NGKVVSIVGI PLKDVDYFAI DLISGRLPVS GGEAVVEESV EARPGDEITI KVYSGALGGE RTLRVRVVGV MRSFLKGFIG AFRLNLVVVP LDWLQQNLDT GPFVNTVLIT ARNKAEVKAL YATLKETFKD AQVFSQENLL ETVNQVFNAL NAVFSVISGA ALATAAITTF AVMSITTRER LREFGLLKAI GISSRDITLS VAAEVIAIAL AAGAVGVVAG FYGASFVKQI LVGMGINFDM PITFRPIYAT IGLATAIVVA AVGALVPMYR VAKLRPLEIL RLWQ
|
| |