Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3574 |
Symbol | |
ID | 4443885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4015517 |
End bp | 4016797 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691398 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_833049 |
Protein GI | 116672116 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCGTT CAACACCCCC ACGCCCGACG CCGGGACGCC GGCAGTTCCT GCAGCTCGCC GCTGCCGGCG GTGCCGCCGT CGTTCTTTCC GCCGCACCAG GAACGGCGTG GGCCGCACCG ACCGCGGGAC GGACCCGGTG CTACGTGCTC GTGGTGGACG GCTGCCGCCC GGACGAGATC ACCCCCGCGC TGACGCCGCG GCTGGCTGAC CTGCGTGCAG CCGGCACCAA CTTCCCGGCG GCACGGTCCC TCCCGGTCAT GGAGACCATT CCCAACCACG TGATGATGAT GAGCGGCGTG CGACCGGACC GCTCTGGCGT TCCGGCCAAC GCGATCTACG ATAGGGCCGA GGGTGTGGTC CGCGACCTCG ACCGGTCCAC GGACCTGCAC TTCCCGACCA TTCTGGACCG CTTGCAGGAG CGCGGGCTCA CCACCGGCTC GGTGCTGAGC AAGAAGTACC TCTATGGCAT CTTCGGAGCC AGGGCAAGCT ACCGGTGGGA ACCGCAGCCG GTGCTTCCGG TAACCGGCCA TGCTCCCGAT GCCGCCACGA TGGACGCCCT GCTGGCGATG GCAGGCGGGC CGGATCCGGA CTTCGTGTTC ACAAACTTGG GCGACATTGA CCGCGTGGGC CACTCCGACC TTTCCGGCAC CACGCTGCGA GCCGCCCGGG AATCCGCACT GGCGGACACG GACCTGCAGG TGGGCCGCTT CATCGACCAT CTCAAAGGCA CGGGCAAGTG GGAGTCCAGT GTGGTGATGG TGCTCGCCGA CCACTCCATG GACTGGTCCA TCCCCACGAA CGTGGTTTCC GTCGACCTGG TCCTGCAGTC CCGTCCGGAG TTGCAGCACA ACGTCAGGAT CGCCCAGAAC GGCGGGGCTG ACCTGCTCTA CTGGACCGGT CCTGATGCAG AGCGTGCGGC CGGTATGGCT GCCGTCGAAC AGTTAGTCAG CGCCCATGAG GGAGTGCTGT CCGTCCATAA ACCGGTGGAC CTGCGGCTGG GGACCGAGGC CGGAGACCTC GTAGCCTACT GCCGCGCCGG CTGGCGTTTC TCCGACCCGT ATGTGGCTTC CAACCCGATC CCGGGAAACC ACGGACACCC CGCCACCGAA CCCATCCCCT TCTTCATCTC CGGCGGCAGC CCGCTGGTGG CACCCGGGAC GGTGTCCTCG GAGCATGCAA GGACTATCGA TGTTGCACCG ACCATCGGCA CCATTTACGG GCTCAAAGCC CCGGACGGCG GGTATGACGG AACTTCGCGG TCCGGCTCCC TGCGGCTCTG A
|
Protein sequence | MSRSTPPRPT PGRRQFLQLA AAGGAAVVLS AAPGTAWAAP TAGRTRCYVL VVDGCRPDEI TPALTPRLAD LRAAGTNFPA ARSLPVMETI PNHVMMMSGV RPDRSGVPAN AIYDRAEGVV RDLDRSTDLH FPTILDRLQE RGLTTGSVLS KKYLYGIFGA RASYRWEPQP VLPVTGHAPD AATMDALLAM AGGPDPDFVF TNLGDIDRVG HSDLSGTTLR AARESALADT DLQVGRFIDH LKGTGKWESS VVMVLADHSM DWSIPTNVVS VDLVLQSRPE LQHNVRIAQN GGADLLYWTG PDAERAAGMA AVEQLVSAHE GVLSVHKPVD LRLGTEAGDL VAYCRAGWRF SDPYVASNPI PGNHGHPATE PIPFFISGGS PLVAPGTVSS EHARTIDVAP TIGTIYGLKA PDGGYDGTSR SGSLRL
|
| |