Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0677 |
Symbol | |
ID | 4446810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 727174 |
End bp | 731751 |
Gene Length | 4578 bp |
Protein Length | 1525 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639688478 |
Product | 5'-nucleotidase domain-containing protein |
Protein accession | YP_830176 |
Protein GI | 116669243 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases [COG2374] Predicted extracellular nuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.608053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCA CACCATGGAA AATTGCGCTG GGCACGGCGT TGTCAGCGGG GCTCATCGCA GCTCCCCTGG CTTCCGTCCC GGCGTTTGCA GTGGAGGTCT CACCGGCCGC TGCAGGCACC TCTCCCGTCG TCATCAACGA GGCTTACCTC AGCGGCGGAA GCGCGGGAGC AGCCTACAAG AACAAGTTCG TAGAGCTGTA CAACACGTCT GACACGCCGG TAACCCTGGA CGGCTGGTCC CTGCAGTACC GCGCCTCCGG GGGCACCACC GCACCGTCCG CCACGGCTCC CCTCGCCGGC ACCATCCCGG CCAAGGGGTA CTACCTGCTC AAGGGCGGCA GCAACGGCAC CGTCGGACTG GATCTGCCCG CCGCCGATGT GACCGCCACA GGCTTCAACC CGGCAGGCGC GGGCGGCACG ATCGTCCTGG CCAAGCAGTC CACAACCCTG AACCCGCTGG CCACCGGCTC GGTAATCGAG CCTGCCAACG TTGCGGACCT GCTGGGCTAC GGGACATCCA ACACATTCGA GACCAAGGCC GCTGCGGCCC CGGGAAGCAA CACCGACGTC AAGAGCCTGA ACCGCAGCGG CGGCCTGGAC AGCAACAACA ACTCCGCCGA TTTCACGCTT AACGCCGCCA TTACCCCGAC ACCCGCCGGT GGCAGCGCGG ATCCGGTGGA TCCCGATCCC GTGGATCCCC CGACGGCACC CGCCACCAGG ACAATCGCCG AAATCCAGGG ATCCGGCACG GCCAGCCAGT TCGTCGGCAC CTCCGTCACC ACCCGAGGCA AGGTCACCGC GGCGTACCCC ACCGGCGGCT TCGCCGGCTT CTACCTCCAG ACCCCCGGAA CCGGCGGCGA CCTGACGCCG GCCAACCACA CGGCGTCGGA CGCCATCTTC GTCTACTCGC CCGCAACGGT AGGCTCCGTC GCGATCGGTG ACTACGTCGA AATCACCGGG GCCGTTGCCG AGTTCTACGG GATGACCCAG GTGAACGTGG CGGATGCTGC CGGCCTGAAG AAGCTCAACG AGGCCGCCCC GGAAGTGAAG TCCACCGGCT TCGCCCTCCC GGCGGACGAA GCCTTCCGCG AGTCCCTCGA AGGCATGCTG CTGACACCGC AGGGACCTGT GACGGTCACC GACAACTACT CCCTGAACCA GTACGGCGAA ATCGGCCTTG CCGGCGGGAC GACCCCGCTC GAGCAGCCCA CCGCCGTCGC CCCTTACGGC TCTGCCGAAT ACACGGCAAC CGTGGCGGCC AACGCCGCCC GCGGCATCAA GCTCGACGAC GGTGCCACTA CCAATTTCCT GAAAGATGCC ACCACCAAGG CCCAGGTCCT GCCGTACCTG ACCGCCGCGG AGCCCGTCCG CGTCGGCTCC CCCGTGACGT TCCAGACCGA CGTGGTGCTC AGCTACGCCA ACAACTCCTG GAAGTTCCAG CCGCTGACCC ACCTGACGCC GGAAAACGCC AGCATCATCC AGCCCGCAAC CTTCGGCGCA ACCCGTGCCG AAGCACCCGC CGCCGTCGGA GGCACCCTGA AGATTGCCTC CTTCAACGTG CTCAACTACT TCCCCACCAC CGGTGACATG CTGGCCGGCT GCACCTTCTA CACCGACCGC GACGGCAACC CCATCACCGT CCGCGGCGGC TGCGACGCCC GCGGTGCAGC CAACGCCGAG AACCTCAAGC GCCAGCAGGA CAAGATCGTG GCGGCCATCG GCAAGTCCGG CGCCGACGTC GTCTCCCTGA TGGAGGTTGA GAATTCGGCT CAGTTCGGCA AGGACCGCGA CGATGCCCTG GCCAAGCTGG TGGAAGCCCT GAACATTCCC ACCCCGGGAA TCTGGGACTA CGTCCGCACG CCCGCCAACG CTCCGCCGCT GGCTGACGAG GACATGATCC GCACAGCGTT CATCTACAAG AAGGCAGCTG CGGAACCGGT GGGCGAATCC GTCATCCACA ACGACACCGT GGCTTTCGCC AGTGCCCGCA AGCCGCTCGC CCAGGTGTTC AAGCCGGTTG GCGCGTCCGA TGACAAGAAG TTCATCGCCA TCGCCAACCA CTTCAAGTCC AAGGGCTCGG CCGCAACTCC TGAAGACACC GACAAGGGCC AGGGCGCCTC GAACCTTGCC CGCACCGAGC AGGCCAAGTC GCTCCTCGCA TTCTCGAACG ACCTCCAGGC CTCAAAGGGC ACGGACAAGG TCTTCCTGAT GGGCGACTTC AACGCCTACG CCAAGGAAGA CCCCATCAAC GTCCTCACGG CCGCCGGCTA CATCAACCAG GACGAAAAAG CCCGGAATGC CGACGGGTCA GCCAAGCACT CCTACCTGTT CGGCGGCCTG GTGGGTTCCC TGGACCACGT CCTCGCCACG CCGGGTGCGG ACTCGGTGGT CACCGGTGCC GACATCTGGA ACATCAACTC CGTGGAGTCC GTGGCGCTGG AGTACAGCCG GTATAACAGC AACGTGACCA ACTACTATGC GCCGGATCAG TTCCGGGCCA GCGACCACGA TCCCGTGGTG GTGGGCCTTG ATCTGCCGGC CGTACCTGTC CTGCCGCCGA GCGTTGACCT GAACTTCCTG GGCATCAACG ATTTCCACGG CCGCATCGAC TCCAACACGG TCCTGTTCGC CGCCACCATC GAAAAGCTCC GGGAGGCGGC CGCCCCCGGC GCCACGGCCT TCCTGTCTGC AGGCGACAAC ATCGGGGCCT CGTTGTTCGC GTCCGCCGTC GCCAAGGACC AGCCCACCAT CGACGTGCTG AACTCCCTGG AACTGCGCAC GTCCGCCGTG GGCAACCACG AGTTCGACGG CGGCTGGGCG GACCTCCGCG ACCGCGTCAT CGCCGGCGGG ACGAATGCCA GTTTCCCGTA CCTCGGTGCC AACGTGTACA AGAAGGGCAC CACCGAGCCG GCCCTGCCTG AATACACGGT GCTGGAGTTG AACGGCGTCA AGGTTGCGGT GATCGGCACT GTCACCCAGG AAGTGCCGTC GCTGGTCACC CCGGCAGGCA TCACCGACCT TGAATTCGGC GATCCGGTGG ACGCGATCAA CCGCGTTGCC GCGAAGATCA CCGCCGAAAA GCTCGCCGAC GTCATCATCG TGGAAGATCA CGACGGCGCC GGATCCGGCA CCCCGGACGG CTCCACTTTG GAGCAGGAAG TCGCTGCCGG CGGTCCTTTC GCCAGGCTGG TCAACGAAAC CTCTCCCGAG GTGGACGCCA TCTTCACCGG TCACACCCAC AAGCAGTACG CCTGGGACGC TCCGGTGCTC GATGCCAACG GACAGCCGAC CGGCAAAACA CGCCCCATCG TGCAGACCGG CAGTTACGGC GAGTTCATCG GCCAGATCCA GCTGACAGTC GACACTGCCA CCATGCAGGT ATCCGGCTAC AAGGCCGGCA ACGTCAAGCG CACCGTTCCC ACCACCACCG AGACGGCCGC TGACCTCGTG GCCAGGTACC CGCGGGTGGC GGCAGTCAAG ACGGTTGTCG ACAAGGCGTT GGCAGACGCC GCGGTGATCG GCAACCAGCC GGTCGGAAAG GTCACTGCCG ATATAACCAC CGCCTTCACG CCCGCCACGG CAACCAGTCC GGCGTCCCGT GATGACCGCG CGAACGAATC CACCCTTGGC AACCTCGTGG CCGATTCACT CGTGGACGCC CTCAAGGCAC CGGACCTCGG CACCGCCGAA ATCGGCGTCG TGAACCCCGG CGGACTGCGC AACGAGCTGT ATTACGCGCC GGACGGGACC ATCACCTACG CGGAGGCGAA CGCCGTACTC CCGTTCGTGA ACAACCTCTG GACCACGTCC CTGACCGGGG CACAGTTCAA GACGCTCCTG GAACAGCAGT GGCAGACCAA CCCGGACGGC ACAGTCCCGA GCCGCGCCTA CCAGCAGCTG GGGTTGTCCA AGAACGTCAA CTACACCTAC GACGCCGGAC GCGCCGCAGG GGACCGCATT ACGTCCATCC GGGTCAACGG CTCACTCATC GAGCCGGCCA AGTCCTACCG GATCGGCACG TTCAGCTTCC TGGCAACCGG CGGCGACAAC TTCCGGATCT TCAAGGAAGG TACCGGCACC AAGGACTCGG GCCTTGTGGA CCGGGATGCC TGGATCAAGT ACCTGCAGGA ACACAACCCG GTATCGCCGG ACTTTGCCCG CCGCACCGTG GCTGTCGTGA ACACCACGGC CGCCGAGGTC AAGGGCGGGG ACTCCATCAC GCTGGCGGTT TCCAAGCTGG ACCTCACCTC CCTGGGCAGC CCGGTGAACA CCTCGCTGGC GGCCTCTTTC ACGGACCCTG CGGGCACGGT TACCCAGCTC GGCACCGTCC CGGTGTCCGG CGGAGCCGCG GCAGTGGACG TGAAAGTCCC GGCCGGCGCG GCTGCCGGCA CCGGCACCCT GGTGCTCACC GCTGCCGAGT CCGGGACCGT GGTCAAGGCC GCAGTTCAGA TTGCCGACAG CGGCCCGGTG CCGCCGGTCT GCACAGCACC TGTACCGCCC ACCAAGTGGT ACGACTTTGC AGGCTGGATC AAGTACGGCC TGGCCTGGAT CCAGTACCAG AAGTGCCTGA AAGGCTAG
|
Protein sequence | MKRTPWKIAL GTALSAGLIA APLASVPAFA VEVSPAAAGT SPVVINEAYL SGGSAGAAYK NKFVELYNTS DTPVTLDGWS LQYRASGGTT APSATAPLAG TIPAKGYYLL KGGSNGTVGL DLPAADVTAT GFNPAGAGGT IVLAKQSTTL NPLATGSVIE PANVADLLGY GTSNTFETKA AAAPGSNTDV KSLNRSGGLD SNNNSADFTL NAAITPTPAG GSADPVDPDP VDPPTAPATR TIAEIQGSGT ASQFVGTSVT TRGKVTAAYP TGGFAGFYLQ TPGTGGDLTP ANHTASDAIF VYSPATVGSV AIGDYVEITG AVAEFYGMTQ VNVADAAGLK KLNEAAPEVK STGFALPADE AFRESLEGML LTPQGPVTVT DNYSLNQYGE IGLAGGTTPL EQPTAVAPYG SAEYTATVAA NAARGIKLDD GATTNFLKDA TTKAQVLPYL TAAEPVRVGS PVTFQTDVVL SYANNSWKFQ PLTHLTPENA SIIQPATFGA TRAEAPAAVG GTLKIASFNV LNYFPTTGDM LAGCTFYTDR DGNPITVRGG CDARGAANAE NLKRQQDKIV AAIGKSGADV VSLMEVENSA QFGKDRDDAL AKLVEALNIP TPGIWDYVRT PANAPPLADE DMIRTAFIYK KAAAEPVGES VIHNDTVAFA SARKPLAQVF KPVGASDDKK FIAIANHFKS KGSAATPEDT DKGQGASNLA RTEQAKSLLA FSNDLQASKG TDKVFLMGDF NAYAKEDPIN VLTAAGYINQ DEKARNADGS AKHSYLFGGL VGSLDHVLAT PGADSVVTGA DIWNINSVES VALEYSRYNS NVTNYYAPDQ FRASDHDPVV VGLDLPAVPV LPPSVDLNFL GINDFHGRID SNTVLFAATI EKLREAAAPG ATAFLSAGDN IGASLFASAV AKDQPTIDVL NSLELRTSAV GNHEFDGGWA DLRDRVIAGG TNASFPYLGA NVYKKGTTEP ALPEYTVLEL NGVKVAVIGT VTQEVPSLVT PAGITDLEFG DPVDAINRVA AKITAEKLAD VIIVEDHDGA GSGTPDGSTL EQEVAAGGPF ARLVNETSPE VDAIFTGHTH KQYAWDAPVL DANGQPTGKT RPIVQTGSYG EFIGQIQLTV DTATMQVSGY KAGNVKRTVP TTTETAADLV ARYPRVAAVK TVVDKALADA AVIGNQPVGK VTADITTAFT PATATSPASR DDRANESTLG NLVADSLVDA LKAPDLGTAE IGVVNPGGLR NELYYAPDGT ITYAEANAVL PFVNNLWTTS LTGAQFKTLL EQQWQTNPDG TVPSRAYQQL GLSKNVNYTY DAGRAAGDRI TSIRVNGSLI EPAKSYRIGT FSFLATGGDN FRIFKEGTGT KDSGLVDRDA WIKYLQEHNP VSPDFARRTV AVVNTTAAEV KGGDSITLAV SKLDLTSLGS PVNTSLAASF TDPAGTVTQL GTVPVSGGAA AVDVKVPAGA AAGTGTLVLT AAESGTVVKA AVQIADSGPV PPVCTAPVPP TKWYDFAGWI KYGLAWIQYQ KCLKG
|
| |