Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0207 |
Symbol | |
ID | 5054131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 187527 |
End bp | 188648 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640467786 |
Product | uncharacterized linocin/CFP29-like protein |
Protein accession | YP_001152474 |
Protein GI | 145590472 |
COG category | [S] Function unknown |
COG ID | [COG1659] Uncharacterized protein, linocin/CFP29 homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.502019 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTATTCT CGAAGAACCC GGTAGACATT ACTAGGGACA GAAAGTTGTC CTCTGGAGAG ATCGCCGACT CGCTTAGGCT TGCCATTATG GCCGAGCTAG ATGCCATAAG CCTATATCTC CAATTGGCCA GGCTAATCGA CGACGAGAGA GTAAGGAAGG TCTTTGAGGA CATAGCTAAG GAGGAGAAGA CGCACTTCGG CGAATTTCTC GCACTGCTTA AACACATCGA CCCCGAGCAG GTGGAGCAGT TGAAGGCAGG CTTGATAGAA GTCGGCGAGC TGACAGGCAT AAAGGCGCCG ATGAATGATC CCGACAATAA GCGGGTTGGC GAGTCTAACG CGCGCAGTGA TCCGCCGCCT GACGTGGTCT CTTCGTCGGG CCTAAGCCCC GAAGAGCTCA GGTACTTGCA GAATAGGGTG AGGGAGGTCT CCGGCAAGGT GAGGAGGTTT AGGAGGTATC TTGCGACATA TGAGGCGGGC CCCGGGGTCG ACGCAGTGCC GCTGGAGGAG GCGGCCCCCG GCCCCACAAT AGCCGCTAAT AGGTCGGTGG TGCCGCTTAA GGAGCTGAGC GTGAAGTTCT CCATATCGCA GAGGCAGATC GAGTACGCAA GAGCTAGGGG CGAGGCGGTC TACTCGACGT CGGCGGATAG GGCAGCGGTT AGGCTAGCGT ATGAGGAAGA CGCAACAATA CTCGGCGATA TTCTGGGCAA CCCGAAGGTC AAGACAATGG GCATCACGTC GTGGGACGCG CCGGGGTCTG CCGTAGCTGA GGTCTCAAAC GCCGTAAATC TGCTTTACAG CAACTACGTG CCCGAGCCCT ATGTGTTATT TGTAAGCCCC GGCAGATTCA CGAAACTCTT GACAGTTGTC GAGAAGACTG GCGTCATGGA GCTGACGAGA GTTAAGTCCC TAGTTCAAGA CGTCGTCGTG GTGCCCCAAC TGAGAGACGA CACGGCACTG TTGCTGTCAA CCCACCAATC AATCATCGAC GTAGCCGTAG GCGTAGACAC GGCGCTGGTA TACCTCGGCC CCGAAGACGG TACACACGGG TTCAACTTGT GGGAAACCTT GGCGGTGAGG ATCAAGGATC CCAACGGCGT AGTAGTACTA AAACAAACGT AG
|
Protein sequence | MVFSKNPVDI TRDRKLSSGE IADSLRLAIM AELDAISLYL QLARLIDDER VRKVFEDIAK EEKTHFGEFL ALLKHIDPEQ VEQLKAGLIE VGELTGIKAP MNDPDNKRVG ESNARSDPPP DVVSSSGLSP EELRYLQNRV REVSGKVRRF RRYLATYEAG PGVDAVPLEE AAPGPTIAAN RSVVPLKELS VKFSISQRQI EYARARGEAV YSTSADRAAV RLAYEEDATI LGDILGNPKV KTMGITSWDA PGSAVAEVSN AVNLLYSNYV PEPYVLFVSP GRFTKLLTVV EKTGVMELTR VKSLVQDVVV VPQLRDDTAL LLSTHQSIID VAVGVDTALV YLGPEDGTHG FNLWETLAVR IKDPNGVVVL KQT
|
| |