Gene Achl_3384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3384 
Symbol 
ID7294865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3750276 
End bp3753212 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content68% 
IMG OID643591791 
Productconserved repeat domain protein 
Protein accessionYP_002489430 
Protein GI220914121 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4409] Neuraminidase (sialidase) 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCAGG CTGCGGGCAT AGGGGCCCTC GCCGTTGCCC TGCTCGCCGG AACCGGCCTG 
CCCGCCCAGG CAGCGCCCAT CCCGTCCACC AGCCCCACTG CTCCTCCCGG CGCTTTCCAG
GAAACCAACC TGGCCGCGGA CCGGACCGCC AACAACTTCT TCTACCGCAT ACCCGCCCTG
TCCTACCTGG GCAACGGCGT GGTCCTCGCC GCCTGGGACG GCAGGCCCGG CAGCGCAGCC
GATGCCCCCA ATCCCAACTC CATCGTGCAG CGCCGCAGCA CCGACGGCGG CCGGACCTGG
GGGCCGGTGC AGGTGATCGC CGCCGGGCAC GTGGGCGATG CCGCCGCTCC GAAGTACGGC
TACAGCGATC CTTCCTACGT TTACGACGCC GAGGCCGGGA AAGTCTTCGC ATTCTTTGTG
TACTCCAAGG ACCAAGGGTT CGGCGGCAGC CAGTTCGGCA ACGATGACGC GGACAGGAAC
GTCATTTCCT CAGCCGTGAT CGAATCGTCC GACGGCGGCA CCACCTGGAG CCAGCCCCGC
CTCATCACCG GCGTCACCAA GCCGGGCACC AGCAAGACCA ACCCCGTGGC CGGCGACGTC
CGCTCCAATT TCGCGTCCTC CGGCGAAGGC ATCCAGCTCA AGTACGGTCC CTACAAGGGA
CGTCTGATCC AGCAGTACGC CGGGGACATC CGCCAGGCGG ACGGCAGCAA CAGGATCCAG
GCCTACAGCG TCTACTCCGA CGACCACGGC GCCACGTGGC ACAAGGGCGC CAACGTGGGC
GACCGGATGG ACGAAAACAA GACGGTGGAA CTCTCCGACG GGCGCGTGCT GCTCAACTCG
CGGGACAACG CCAACCAGGG CTACCGCAAG GTGGCGGTGT CCACCGACGG CGGCGCCACC
TACGGCCCGG TCACGCAGGA CACCGAACTG CCGGACCCCG CCAACAACGG GGCCATCGCA
CGCATGTTCC CCAACGCCGC GCAGGGCACG GCCGATGCGA AGAAGCTGAT CTTCACCAAC
GCCAACTCCA AGACGGGGCG CGAGAACGTC TCGGCCCGCG TGTCCTGCGA CGACGGCGCC
ACCTGGCCCG GGGTCCGCAC CATCCGCTCC GGCTTCTCCG CCTATTCCAC CGTGACCCGG
CTGGATGAGG GCAGGTTGGG CGTCCTGTAC GAGGCCAATT ACACGGACAA CATGCCGTTC
GCCGCCTTCG ACGACGCGTG GCTGAACTAC GCCTGCGCGC CGCTGTCCGT TCCCGCCGTA
ACTACCGCGC CCGGCGCCAC CAAGCAGGTG CCCGTGACGG TCACCAACCA GGAGGCTGCC
ACGCTCTCCG GCGCCACCGT CACCGTCTAC ACCCCCAGCG GCTGGTCCGC CACCACGGTG
CCTGTTCCCG ATGTCGCACC CGGCGCCTCC GCCACCGTGA ATGTTGACCT CACCGCGCCG
GCGAACGCCA GCGGCCCACA GAACCTCAAC GCAGCGTTCA CGACGGCGGA CGGCCGGGTT
TCGCAGGCCG CCTTCACGGC CACGGTCCCC GTGGCACCGC AGGTGGGCCT CACCATCACC
GGAGCTGCGC CGGACCGTGA CGTCGCCGCA AGTCCCTACA AGGCGGGCGA CGTTGTCAGC
TACACGTTCA ACGTGAAGAG CACGGCCAAC GTCACCGCAA ACTCGGTGCC GCTGTCCGGT
ACGTTCGAGG CCGGCTTCCT TCCACCGGCC CCCGGGGCAC CAAGCAGCCC TAACTGCCGG
TACAACAACC TGGCTGCCGG TGCCAGCTAC ACCTGCACCA CGGCCAGGCA CACGGTGACG
GCCGAGGACA TGGCCCGCGG CTACTTCGTG CCGCAGGCCA CCTTCAGCAT CACGGCCAGC
GCGACGCCGT CGCTCACCAG GACGGTGGCG TTTACCGGAG CGGCAGTAGC CCTGCGCGAC
GGCCTGGTTT CTGCCACCAT CACGGGCAGC CGTAACGACG CCGGCCGGGA CCTCGCCGCG
CAGCCGTACA CCGCGGGGGA GGCCTTCCCC TACAAGTTCG ACGTCACCAA CACCAGCCCG
TTGGTGGAGA AGGTGGTGCC CACAGCGGGG AACTTCAGCC CGTTCGTGCC GGACGGCCCC
GGCAACTGCC GGTACAACGT CCTGCCCGCC GGACAGTCCT ACACGTGCGC CACACCGCGG
CACACCGTTA CCGCTGAGGA GGCAGCCCAG GGCTTCTTCA TCCCGGAAAC CAACTGGGAC
GTCAGCGCTG CTGGCCAGAC CATCCGGACC TACGGCGTGA ACGGCGGCGA GGTGGACCTG
AAGGTGCGCG ACGCCAGCCT GGACGGCACC ATCGCAGCTG AATGGACCGA CAAAGACGGC
GACCGGTACG CCTCCGCAGG CGACACCGTG ACGTACACCT ACACCGTGGG CAATGCAGGC
AACGTTCCGC TGACCGGGGT GGCTGCACCG GCCGCGGGCA TCAGCGAACC GTCCCTGGCC
GCCGGCAGCA GTGTCACCGC TACCCGGGAT TACGTGCTGA CGGCGGCGGA CATCACCGCC
GGGAAGCTGG ACGCAGTAAG TTTCAGCGCC ACCGGGGACA ACGGGACCAG GCGTGCCACC
GCTTCCGTGA GCGGCGGCGG GATTCAGCTT GAGCTCCAGC CCGCCCAGCC GGAGTCCGAA
CCCGCGCTGA CGGTCCAGGA CTTCGACGGC CAGACGCCGC CGTTCGACCT GAAGACGCAG
GACAAGTACC GGAACGGCCA GAAGGTGGTC CTCGAAGGGC TCGATTATGG GCAGTGGTAC
TACGTCTATC TCAACAAGCG CAGCCAACGG ATCGGCTGGT TGTTCCCCAC CACCGCCAAC
ACAGTGGAGT TCATCCTCCC CGCCGGGATC CAGAACGGCC GTGATGATGT GGTGGTCCTG
GACAAGGACG GAAAGCAGGT CTCGTTCGAC CGGCTGCAGG TGACCCCGAA GGGTTAG
 
Protein sequence
MGQAAGIGAL AVALLAGTGL PAQAAPIPST SPTAPPGAFQ ETNLAADRTA NNFFYRIPAL 
SYLGNGVVLA AWDGRPGSAA DAPNPNSIVQ RRSTDGGRTW GPVQVIAAGH VGDAAAPKYG
YSDPSYVYDA EAGKVFAFFV YSKDQGFGGS QFGNDDADRN VISSAVIESS DGGTTWSQPR
LITGVTKPGT SKTNPVAGDV RSNFASSGEG IQLKYGPYKG RLIQQYAGDI RQADGSNRIQ
AYSVYSDDHG ATWHKGANVG DRMDENKTVE LSDGRVLLNS RDNANQGYRK VAVSTDGGAT
YGPVTQDTEL PDPANNGAIA RMFPNAAQGT ADAKKLIFTN ANSKTGRENV SARVSCDDGA
TWPGVRTIRS GFSAYSTVTR LDEGRLGVLY EANYTDNMPF AAFDDAWLNY ACAPLSVPAV
TTAPGATKQV PVTVTNQEAA TLSGATVTVY TPSGWSATTV PVPDVAPGAS ATVNVDLTAP
ANASGPQNLN AAFTTADGRV SQAAFTATVP VAPQVGLTIT GAAPDRDVAA SPYKAGDVVS
YTFNVKSTAN VTANSVPLSG TFEAGFLPPA PGAPSSPNCR YNNLAAGASY TCTTARHTVT
AEDMARGYFV PQATFSITAS ATPSLTRTVA FTGAAVALRD GLVSATITGS RNDAGRDLAA
QPYTAGEAFP YKFDVTNTSP LVEKVVPTAG NFSPFVPDGP GNCRYNVLPA GQSYTCATPR
HTVTAEEAAQ GFFIPETNWD VSAAGQTIRT YGVNGGEVDL KVRDASLDGT IAAEWTDKDG
DRYASAGDTV TYTYTVGNAG NVPLTGVAAP AAGISEPSLA AGSSVTATRD YVLTAADITA
GKLDAVSFSA TGDNGTRRAT ASVSGGGIQL ELQPAQPESE PALTVQDFDG QTPPFDLKTQ
DKYRNGQKVV LEGLDYGQWY YVYLNKRSQR IGWLFPTTAN TVEFILPAGI QNGRDDVVVL
DKDGKQVSFD RLQVTPKG