Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3165 |
Symbol | |
ID | 4444225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3554170 |
End bp | 3555876 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639690991 |
Product | Allergen V5/Tpx-1 family protein |
Protein accession | YP_832643 |
Protein GI | 116671710 |
COG category | [S] Function unknown |
COG ID | [COG2340] Uncharacterized protein with SCP/PR1 domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.845212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGAAC AATTGCATAT CTTTCGCGGC GCTTTGTACC GCTCTCGTGA GCTCGTTTCA AGAGGAGCCA AATCAGCGCT TGGCGCGCTG ATCTGCCTGT CATTGTTGTC AGCATCGTTG CTGGCGGGAA TGCCGGCAGC GCAAGCTGCG GAGCTGGTGG GTGGCGAGAC AATCGACCGG ACCAATCAGG CCAAAATCGT CGAAGTTTTC AATGGCATCA ACGACTTCCG GGCCTCCCAA GGGCTCAATC CGGTGAATTT CAATGCCACT GTCTCGGAGA TGGCCGAAGA CTGGTCCGAC CATATGGCGG CTTCCGGTAA TTTTGTCCAC AACCCGAATT TCTACACCGA TGCCAGGGTC ACGGGCCGCT TCGCCGGCGC GGCCGAAATC ATCGCAGCCC GTTCGGATGA CTGGGCGCAG GGACTGGTGG AGCAGTGGAT CGACTCACCC GGACACAACG CCGTCATGAG CGATCCCAAG CTCACCACCG TGGGCGTAGG GATCACCTAT TTGGAGGGCA AACGCAGCGG CGAGCTGACG TTGTATGGCA CGGTGAATTT CTTCACGTTC TGGAATCCGC CGGTTGGGAT GTACACCACC GCCCAGGACT TCTTCGACGG CAAGCCCTCC ATCGACACGG CACAGATAAT CACCGTCGAT ACTGAAGATC CCGTCTTTGA CGACGGGCTC AATAAAGTGA CCATCCCTGA TGCCCAGGGA GTGGACTACT TTGTGAACCA GGTTCCGACG CCGCCCGGAA CCTACGACGC CGTGGTGGGC CGCATGCAGG TCACTGCGAC CGCCAAGGCG GGCTACCGCG TCTTCGGGAC ACTGCGTTGG GAGCATGAGT TCCTGGCACC GCCCACGGAA ATCGTCCCGA TCGCTCCGAC GTTTGATGGC GTAACCGGCC GGTTCACTGT TCCTTACATG GAAGGGGTCC AGTACTGGGT CAACGATTCC CTCTGGGGCT CCTCAACCTT CAGCAGCGAG TGGGACACAA CGGTCTATAT CCGGGCCGAG GCCCTCAAGG GCTACAAGCT CAGCGGCACC ACGTCCTGGC AGTACTACTT CGCCAAGCCG GACCAGCCGG CTGATCCAGG CCCGGAGACG CCCGCAGAGC CGTCCGGCAG CTTTACCGAT GTCCCTGCAG GAACGCAGTT CGCCACCGAG ATCAACTGGC TCGCCAGCCG TGGCATCAGC ACCGGCTGGG TAGAAGGCGA CGGCTCTGCA ACGTACCGGC CGCTGACTCC GGTCAACCGG GACGCCATGG CCGCCTTCAT GTACCGCCTG CTGGGCGAGC CGCCCTTCAA CGCCCCGGCC GCGTCTCCGT TCGCGGACAT GTCCAGCAGC ACCAAGTTCT ACAAGGAGAT CACCTGGCTC GCGGACAAGC GCATTTCCAC TGGCTGGGAA GTCAACGGGG CGCGGACGTA CCGCCCTGTG ACGCCCGTCA ACCGGGACGC CATGGCCGCC TTCCTTTACC GGCTCGCCGG CCAGCCCAGC TTCACGCCCC CGCTCGCCTC GCCGTTCATG GATGTGGATA CGAGCAACCA GTTCTACAAG GAAATCACGT GGCTGGCCGC GCAGGGCATC TCCTCGGGCT GGAACGAAGG CAACGGCAAA GCCAGCTATC GGCCCTGGTC TGCCGTCAAC CGCGACGCCA TGGCCGCCTT CATGTACCGC TGGAACACAA AATTCGGCCG GCCATGA
|
Protein sequence | MGEQLHIFRG ALYRSRELVS RGAKSALGAL ICLSLLSASL LAGMPAAQAA ELVGGETIDR TNQAKIVEVF NGINDFRASQ GLNPVNFNAT VSEMAEDWSD HMAASGNFVH NPNFYTDARV TGRFAGAAEI IAARSDDWAQ GLVEQWIDSP GHNAVMSDPK LTTVGVGITY LEGKRSGELT LYGTVNFFTF WNPPVGMYTT AQDFFDGKPS IDTAQIITVD TEDPVFDDGL NKVTIPDAQG VDYFVNQVPT PPGTYDAVVG RMQVTATAKA GYRVFGTLRW EHEFLAPPTE IVPIAPTFDG VTGRFTVPYM EGVQYWVNDS LWGSSTFSSE WDTTVYIRAE ALKGYKLSGT TSWQYYFAKP DQPADPGPET PAEPSGSFTD VPAGTQFATE INWLASRGIS TGWVEGDGSA TYRPLTPVNR DAMAAFMYRL LGEPPFNAPA ASPFADMSSS TKFYKEITWL ADKRISTGWE VNGARTYRPV TPVNRDAMAA FLYRLAGQPS FTPPLASPFM DVDTSNQFYK EITWLAAQGI SSGWNEGNGK ASYRPWSAVN RDAMAAFMYR WNTKFGRP
|
| |