Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1199 |
Symbol | |
ID | 4446295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1301558 |
End bp | 1302847 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639689006 |
Product | ErfK/YbiS/YcfS/YnhG family protein |
Protein accession | YP_830693 |
Protein GI | 116669760 |
COG category | [S] Function unknown |
COG ID | [COG1376] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.134924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACTCCA AGGTGGGTGG CCTAGGCTGG AGCCATGATG ACTGCAATGG TCCCCGGGCC ATGACGGCAC GGGAACACCG GACCGCCACC AGGAAGATGC TCCTGGTGGG AGCGCTCTGC CTGCTGGCGG CGGGCGGGGG AGTATTCGCC GCCGTCGCGC CCGGTTCCGG CCGCGCCGCC TTTGAATCCG AGGCCGGCTC CGCTGAACGG TCGGCGCCGG GACTGGCTGC ACCGGTGGTG GTGCCGGTGC AGCTTGCGGC TTCCCCGGCA GACGGCGCAC AGCAGGTGAA TCCGGCAGCG CCGGTCTCGC TCCGCGTCAC TAACGGAACC ATTGAACGCG TGGCCCTGAC GGACGGCAAC GGGGAAGCCG TCGAGGGGAG CCTCAGCCCG GATGGAACCG GCTGGTCAGC CGCCGGCCCC CTGCAGTTCA ACAGCAGCTA CACCTACAAC TTCGTGGTGA CCGACGCTGC AGGCCGCAAG ACCAACGACA CACGCACATT CACCACGGTG ACAACGGCCA ACGAAGCCGA TGCCGCTATC TACCCGCTGG ACGGCATGAA GGTTGGCGTA GCGCAGCCGC TGCAGATCAC GTTCAGCGAA CCCGTGCTGA ACAAGGATGC AGTGGAGAAA GCCATCAAGG TCTCGTCCAC GTCCGGCCAG GCGGGCGCAT TCCACTGGTT CAGCGACAGC ATGGTGCGGT ACCGGGCCGA GAACTTCTGG GCTGCCAACA GCACGGTGAC CATGGACATG CAGCTGTTCG GCGTCGATTT CGGCAACGGG CAGATCGGCA ACTTCGACAA GAAGGTCACT GTCCACGTCG GCGACAAGAA GGTCGCTGTA GCCGACGCGA CGGCGCACAC CTTTTCGGTG AGCGTTAATG ACAAACCGGC AGGCACATGG CCCGCCACCA TGGGTGACAC CCGGTTCCCG TCGGCCCGGG GCTTTCTTGT GTTCATGGAG AAGTACCGCG TGGAGCACAT GTCTGCCTCC AGCATCGGGC TGAAGCCTGA CGACCCGGCA TACTACGGCG AGCTGGACGT TAACTACGCC ACGCGCCTCA CGCCCAGCGG AGAGTTCATC CACCAGGCCA CGGACTCCGC GATGCCCTAC GTCGGTGTGG CCAACCTGTC GCACGGTTGC ATCGGGCTCG GCCCGGACGG AGCCAAATGG GTCTTCGACA ACATGACCAC CGGGGATGTC GTCCAGGTGG TCAACACCGA GGGCGAGAAT GCCAACTTTG ACGACGGATT CGGCGACTGG AACATCCCCT GGGCCCAGTA CGCCAATTGA
|
Protein sequence | MNSKVGGLGW SHDDCNGPRA MTAREHRTAT RKMLLVGALC LLAAGGGVFA AVAPGSGRAA FESEAGSAER SAPGLAAPVV VPVQLAASPA DGAQQVNPAA PVSLRVTNGT IERVALTDGN GEAVEGSLSP DGTGWSAAGP LQFNSSYTYN FVVTDAAGRK TNDTRTFTTV TTANEADAAI YPLDGMKVGV AQPLQITFSE PVLNKDAVEK AIKVSSTSGQ AGAFHWFSDS MVRYRAENFW AANSTVTMDM QLFGVDFGNG QIGNFDKKVT VHVGDKKVAV ADATAHTFSV SVNDKPAGTW PATMGDTRFP SARGFLVFME KYRVEHMSAS SIGLKPDDPA YYGELDVNYA TRLTPSGEFI HQATDSAMPY VGVANLSHGC IGLGPDGAKW VFDNMTTGDV VQVVNTEGEN ANFDDGFGDW NIPWAQYAN
|
| |