Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3384 |
Symbol | |
ID | 7294865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 3750276 |
End bp | 3753212 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643591791 |
Product | conserved repeat domain protein |
Protein accession | YP_002489430 |
Protein GI | 220914121 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4409] Neuraminidase (sialidase) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 92 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCAGG CTGCGGGCAT AGGGGCCCTC GCCGTTGCCC TGCTCGCCGG AACCGGCCTG CCCGCCCAGG CAGCGCCCAT CCCGTCCACC AGCCCCACTG CTCCTCCCGG CGCTTTCCAG GAAACCAACC TGGCCGCGGA CCGGACCGCC AACAACTTCT TCTACCGCAT ACCCGCCCTG TCCTACCTGG GCAACGGCGT GGTCCTCGCC GCCTGGGACG GCAGGCCCGG CAGCGCAGCC GATGCCCCCA ATCCCAACTC CATCGTGCAG CGCCGCAGCA CCGACGGCGG CCGGACCTGG GGGCCGGTGC AGGTGATCGC CGCCGGGCAC GTGGGCGATG CCGCCGCTCC GAAGTACGGC TACAGCGATC CTTCCTACGT TTACGACGCC GAGGCCGGGA AAGTCTTCGC ATTCTTTGTG TACTCCAAGG ACCAAGGGTT CGGCGGCAGC CAGTTCGGCA ACGATGACGC GGACAGGAAC GTCATTTCCT CAGCCGTGAT CGAATCGTCC GACGGCGGCA CCACCTGGAG CCAGCCCCGC CTCATCACCG GCGTCACCAA GCCGGGCACC AGCAAGACCA ACCCCGTGGC CGGCGACGTC CGCTCCAATT TCGCGTCCTC CGGCGAAGGC ATCCAGCTCA AGTACGGTCC CTACAAGGGA CGTCTGATCC AGCAGTACGC CGGGGACATC CGCCAGGCGG ACGGCAGCAA CAGGATCCAG GCCTACAGCG TCTACTCCGA CGACCACGGC GCCACGTGGC ACAAGGGCGC CAACGTGGGC GACCGGATGG ACGAAAACAA GACGGTGGAA CTCTCCGACG GGCGCGTGCT GCTCAACTCG CGGGACAACG CCAACCAGGG CTACCGCAAG GTGGCGGTGT CCACCGACGG CGGCGCCACC TACGGCCCGG TCACGCAGGA CACCGAACTG CCGGACCCCG CCAACAACGG GGCCATCGCA CGCATGTTCC CCAACGCCGC GCAGGGCACG GCCGATGCGA AGAAGCTGAT CTTCACCAAC GCCAACTCCA AGACGGGGCG CGAGAACGTC TCGGCCCGCG TGTCCTGCGA CGACGGCGCC ACCTGGCCCG GGGTCCGCAC CATCCGCTCC GGCTTCTCCG CCTATTCCAC CGTGACCCGG CTGGATGAGG GCAGGTTGGG CGTCCTGTAC GAGGCCAATT ACACGGACAA CATGCCGTTC GCCGCCTTCG ACGACGCGTG GCTGAACTAC GCCTGCGCGC CGCTGTCCGT TCCCGCCGTA ACTACCGCGC CCGGCGCCAC CAAGCAGGTG CCCGTGACGG TCACCAACCA GGAGGCTGCC ACGCTCTCCG GCGCCACCGT CACCGTCTAC ACCCCCAGCG GCTGGTCCGC CACCACGGTG CCTGTTCCCG ATGTCGCACC CGGCGCCTCC GCCACCGTGA ATGTTGACCT CACCGCGCCG GCGAACGCCA GCGGCCCACA GAACCTCAAC GCAGCGTTCA CGACGGCGGA CGGCCGGGTT TCGCAGGCCG CCTTCACGGC CACGGTCCCC GTGGCACCGC AGGTGGGCCT CACCATCACC GGAGCTGCGC CGGACCGTGA CGTCGCCGCA AGTCCCTACA AGGCGGGCGA CGTTGTCAGC TACACGTTCA ACGTGAAGAG CACGGCCAAC GTCACCGCAA ACTCGGTGCC GCTGTCCGGT ACGTTCGAGG CCGGCTTCCT TCCACCGGCC CCCGGGGCAC CAAGCAGCCC TAACTGCCGG TACAACAACC TGGCTGCCGG TGCCAGCTAC ACCTGCACCA CGGCCAGGCA CACGGTGACG GCCGAGGACA TGGCCCGCGG CTACTTCGTG CCGCAGGCCA CCTTCAGCAT CACGGCCAGC GCGACGCCGT CGCTCACCAG GACGGTGGCG TTTACCGGAG CGGCAGTAGC CCTGCGCGAC GGCCTGGTTT CTGCCACCAT CACGGGCAGC CGTAACGACG CCGGCCGGGA CCTCGCCGCG CAGCCGTACA CCGCGGGGGA GGCCTTCCCC TACAAGTTCG ACGTCACCAA CACCAGCCCG TTGGTGGAGA AGGTGGTGCC CACAGCGGGG AACTTCAGCC CGTTCGTGCC GGACGGCCCC GGCAACTGCC GGTACAACGT CCTGCCCGCC GGACAGTCCT ACACGTGCGC CACACCGCGG CACACCGTTA CCGCTGAGGA GGCAGCCCAG GGCTTCTTCA TCCCGGAAAC CAACTGGGAC GTCAGCGCTG CTGGCCAGAC CATCCGGACC TACGGCGTGA ACGGCGGCGA GGTGGACCTG AAGGTGCGCG ACGCCAGCCT GGACGGCACC ATCGCAGCTG AATGGACCGA CAAAGACGGC GACCGGTACG CCTCCGCAGG CGACACCGTG ACGTACACCT ACACCGTGGG CAATGCAGGC AACGTTCCGC TGACCGGGGT GGCTGCACCG GCCGCGGGCA TCAGCGAACC GTCCCTGGCC GCCGGCAGCA GTGTCACCGC TACCCGGGAT TACGTGCTGA CGGCGGCGGA CATCACCGCC GGGAAGCTGG ACGCAGTAAG TTTCAGCGCC ACCGGGGACA ACGGGACCAG GCGTGCCACC GCTTCCGTGA GCGGCGGCGG GATTCAGCTT GAGCTCCAGC CCGCCCAGCC GGAGTCCGAA CCCGCGCTGA CGGTCCAGGA CTTCGACGGC CAGACGCCGC CGTTCGACCT GAAGACGCAG GACAAGTACC GGAACGGCCA GAAGGTGGTC CTCGAAGGGC TCGATTATGG GCAGTGGTAC TACGTCTATC TCAACAAGCG CAGCCAACGG ATCGGCTGGT TGTTCCCCAC CACCGCCAAC ACAGTGGAGT TCATCCTCCC CGCCGGGATC CAGAACGGCC GTGATGATGT GGTGGTCCTG GACAAGGACG GAAAGCAGGT CTCGTTCGAC CGGCTGCAGG TGACCCCGAA GGGTTAG
|
Protein sequence | MGQAAGIGAL AVALLAGTGL PAQAAPIPST SPTAPPGAFQ ETNLAADRTA NNFFYRIPAL SYLGNGVVLA AWDGRPGSAA DAPNPNSIVQ RRSTDGGRTW GPVQVIAAGH VGDAAAPKYG YSDPSYVYDA EAGKVFAFFV YSKDQGFGGS QFGNDDADRN VISSAVIESS DGGTTWSQPR LITGVTKPGT SKTNPVAGDV RSNFASSGEG IQLKYGPYKG RLIQQYAGDI RQADGSNRIQ AYSVYSDDHG ATWHKGANVG DRMDENKTVE LSDGRVLLNS RDNANQGYRK VAVSTDGGAT YGPVTQDTEL PDPANNGAIA RMFPNAAQGT ADAKKLIFTN ANSKTGRENV SARVSCDDGA TWPGVRTIRS GFSAYSTVTR LDEGRLGVLY EANYTDNMPF AAFDDAWLNY ACAPLSVPAV TTAPGATKQV PVTVTNQEAA TLSGATVTVY TPSGWSATTV PVPDVAPGAS ATVNVDLTAP ANASGPQNLN AAFTTADGRV SQAAFTATVP VAPQVGLTIT GAAPDRDVAA SPYKAGDVVS YTFNVKSTAN VTANSVPLSG TFEAGFLPPA PGAPSSPNCR YNNLAAGASY TCTTARHTVT AEDMARGYFV PQATFSITAS ATPSLTRTVA FTGAAVALRD GLVSATITGS RNDAGRDLAA QPYTAGEAFP YKFDVTNTSP LVEKVVPTAG NFSPFVPDGP GNCRYNVLPA GQSYTCATPR HTVTAEEAAQ GFFIPETNWD VSAAGQTIRT YGVNGGEVDL KVRDASLDGT IAAEWTDKDG DRYASAGDTV TYTYTVGNAG NVPLTGVAAP AAGISEPSLA AGSSVTATRD YVLTAADITA GKLDAVSFSA TGDNGTRRAT ASVSGGGIQL ELQPAQPESE PALTVQDFDG QTPPFDLKTQ DKYRNGQKVV LEGLDYGQWY YVYLNKRSQR IGWLFPTTAN TVEFILPAGI QNGRDDVVVL DKDGKQVSFD RLQVTPKG
|
| |