Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3878 |
Symbol | |
ID | 4446835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4361996 |
End bp | 4363762 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639691702 |
Product | Allergen V5/Tpx-1 family protein |
Protein accession | YP_833353 |
Protein GI | 116672420 |
COG category | [S] Function unknown |
COG ID | [COG2340] Uncharacterized protein with SCP/PR1 domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGGAAGA TTTCGGCACG GACACTGGCA GCCATGATCT GCGCACTGGC ACTCATCACG CCTGCGGCCG GCCAGCTGGG GGGACCTTCC TCAAGCCCAA AACCCGATGT GTCCGCCGTG GCGGACCAGC AGGCGCAAAG CGTCGCCCTC GGCCCGCCAT CGTTCGTGTA CGACGCCGGC TTGGGCACCG CGGATGACAC ACCAGGGCAG CGCCTCGCCC CCGCCGTGGA CCCGGCGGTC AGGGAAGGGG ACCCAGGCGG CACAACTGCT GCTTTGGTCA ATACCTCAGG AACGGCCGGC GCCGGTGGCG GCGCCGGCAC ACTCTCCACC AACACGCCGC CGCTGCCAGC AGGAGACTCC CCTCCCGAGC CGGCGTCCGT CCCACCGTTG TCCGCCACCG ACGCCGACCT GGCGGCCCTC ACCCAGGCCG GGCTGCAGAC CCGGCCTGCA CCCACGGCGG GCACCGAAGC CCTCCGAACC GAATCACTGA CGACGCAGGC CCTCGTCAGG GACGACGCGT CGTCCCAAAT CCTGGCGGTG TTCAAAGCCA TCAACAGCTA CCGGGCTTCG TTCGGACTGC CCGCCGTGAA GTACCACGCC ACAGTGGCCG CCATGGCCCA GGAGTGGTCC GACAGCATCG CGGCCCGGGA AGTGATCGAG CACCGCTCGA GCTTCTGGAC CGATTCGCGG GCGCTCAGCC CCACCAATGG CGCCGGCGAG GTCATTGCCG TCCGCTGGGA CCGCGACGCC GCCCAGCTTG TTGAGTGGTG GAAGGGCTCG CCCGCCCACA ATGCAATCCT GAAGGACCCG CGGTTCAATG TGATGGGGAT CGGGATCACC TTCACGGACG GCAACTGGCA GACCACGCCC AACCGCTACA CCATGTGGGG CGTGGTGGAC TTCTTCGGAT ACGGCACGCT GCCCGCGGGA ACCACCAGCA GCCCGGGCGG CAGCACTGAA ATGCCCGTAC AGCCCGCCAG CGTGTGCGAT CCGCTGGTGC GGCACATGCC GCCGTCGGCG GACCTTGCGG CGGCCGCGAT CAAGGGTCCC GGCGACCTCG TGTCGGTGAA CTCATCCGGG GAACTCATCA ACCGCCCGTC CCTGGGAAAC CGGCAATACG GCGCACAACA GGTCGTCGGG ACCGGATTCG GCTCCGCCAA GGAACTCTTT GTCACCGACT GGGACCGGGA CGGAGTCTTT GACATCCTGG TCCAGTGGAC TGATGGCAGG GTTACGCTGC ACGCCGGCTC GGTGGGCGGC GGATTCCTTC CAGGCGTGAC ACTGGGCCAG TCCGGGTGGG CGGGAATGAC CCTGGCGGTC GGGGGCTGGT GTGCCAACAA CCGCCTCCCG CAACTGGTGG CGCTGGACAC CTCCGGGAAC CTCTGGCTGT ACCCCAACCG GGGCAAAGCG GACCTTGTGC AGCGGACTCT GATGGCGTCC GGCGTTTCAG CCAACCGGCT GGCCATGGCG GATTACGACG GCGACGGCTT CCAGGACCTG TTGGCCCGGC AGTCGGACGG TTATGTCCGG CTCTTCCGCG GCTCGGGCGC GCCGGCACCG CGCGCCGAAA CCCGGGCTGT GGTGGCCAGC GGATGGTCGG ACGTTACAGC CATCCGTCCG CTGCGGGATG TCACGGGCCT GAACTCAACG GGACTGGCCC TCCGACGAGC CGGCGACGTG GTGCAGTATT GGGACCTCAG CACGGGCGCC TTGACGTCGC CGTCGTCCAT CCCCGGAACG TGGGCGGGAC AGCGCCTCGC GCAATAG
|
Protein sequence | MRKISARTLA AMICALALIT PAAGQLGGPS SSPKPDVSAV ADQQAQSVAL GPPSFVYDAG LGTADDTPGQ RLAPAVDPAV REGDPGGTTA ALVNTSGTAG AGGGAGTLST NTPPLPAGDS PPEPASVPPL SATDADLAAL TQAGLQTRPA PTAGTEALRT ESLTTQALVR DDASSQILAV FKAINSYRAS FGLPAVKYHA TVAAMAQEWS DSIAAREVIE HRSSFWTDSR ALSPTNGAGE VIAVRWDRDA AQLVEWWKGS PAHNAILKDP RFNVMGIGIT FTDGNWQTTP NRYTMWGVVD FFGYGTLPAG TTSSPGGSTE MPVQPASVCD PLVRHMPPSA DLAAAAIKGP GDLVSVNSSG ELINRPSLGN RQYGAQQVVG TGFGSAKELF VTDWDRDGVF DILVQWTDGR VTLHAGSVGG GFLPGVTLGQ SGWAGMTLAV GGWCANNRLP QLVALDTSGN LWLYPNRGKA DLVQRTLMAS GVSANRLAMA DYDGDGFQDL LARQSDGYVR LFRGSGAPAP RAETRAVVAS GWSDVTAIRP LRDVTGLNST GLALRRAGDV VQYWDLSTGA LTSPSSIPGT WAGQRLAQ
|
| |