Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_4396 |
Symbol | |
ID | 7279728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011879 |
Strand | + |
Start bp | 331747 |
End bp | 334899 |
Gene Length | 3153 bp |
Protein Length | 1050 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643580350 |
Product | hypothetical protein |
Protein accession | YP_002478164 |
Protein GI | 219883000 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 87 |
Fosmid unclonability p-value | 0.360797 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGAGA TCTCAAACCA AACCGATCAG GACACCGGCG TGCAGGCAGA CGGGGGCGCG ACGCAGCCCT CCGGTCCGGC CGCAGCCGAC ACCAATGACC TCACCAGCCT CAACGTGCTG GATCCCAACT TCACCGGCCT GCCACCGTGG GAGGCGTCGG ACTACGCCTC CAGCATCACC GGCTCCGGTG CCTTCAAGCT GATGGACTCC GGCGTCGCGC CGCTGGTCGC GGCCGCCCGC GGCTACACCC GCATCGACAA GACCAACTAC ACCGAGATGG CCCGCAAGAT GCAGGTGGTC GGCAACTCCG GCCAGGGCAA GCGGTTCAAG CGGACCCTCT CGGCGCCCGG CAAAGACGGC ATGGTCATGC CGTGGTACTC CCTGGCCGGC ATCATGAAGG CCAAGCGCGA GGGTGAGCGC CCCGTTCCCC ACACCTACCA GGTCCGCCCC GAGTTCCCGG AGAACAACGA GGCCGGAAAG CCGCTGAAAT ACGAGTTCGT TTCCAACGTC GGCACCCCGC TGGACCTGCA CCCGGCCATC CCGACCGACT GGATCGACAC CACCCCGGTT GTCATGTTCG CCGAAGGGAT GCTCAAGGGC GACTCTGCCC TGTCCGCCTA CCTGGTCCAC AACGGCATCA CCTACTCGGA GCTCCGCTCT GAAGGGGTCG AGAACCCGGT CGCGAAGCTG CGCGAGCTCC TCGAACGCAT CCCCCACGAT GAGCGCGTCC TGATCGTTTC CATCGGCGGC GTCTACAACG CCTCCGGCAA CGCGGTGGAC TGGCGCGAAA TCACCCTGAA GGACCGCATC GGCTGGATCG CGTTCGACGC CGACATTGCG ATCAACCCCG CAGTGCACGC CGCCGCGAAC AAGCTCTCGA CACAGCTGTT CGAGAAGTCC AAGATGGCCG AGGTCCTGTA CCTGAACCCT CAGGTCACCA CCGGTGATGA CGGGTCCACC ACCAAGGACG GTGTGGATGA CTTCCTGGCC AAGCGGGGCA ACTGGGCCGC CCTGATCAAC CAGCTCGACG ACGTCATGCC TGATGCTCCG GCCAAAAACG CCGAAGACAA GGCCGGCAAC TGGCGTGTCG GCAAGACAGG GCAGTTCGTC GAGGAGTGCG TGGCGCTGAA CACCGGCGCC GGCGGCACCA TCGGTGAATA CCGGTGGGAG AAGCGCGTCG AAATCGGCGG CCGCGTCGTG GCCCTGGAAG CCCGCCGCCA GCCCACCGAC CAGGAACTGA AGACCGGCAT CTTCAACCCG AACGTCAAGT ACGAGGACAT CGACGACGCC CAGGTCGAAA TCGAAGTGTC CTGGCACCTG AACGGCCGCG ACTTCACCGC CACCGTCACC GGCCCCGAAT CCATCCTGGG CTACCGGCCC GAGGAATGGG ACCGGAAGAA AGCCTCCGTG CCCCGCGAAC TGCTGCTGCA CCCGGAATGG CCCCCGCGCG GAGCCAAGGG CGAATCCTGG CTGTCGGCGA TCAAGGCCAA CCGGCCCGAG GATGTGGCGT TCAAGACCCG CTGGATGCAG ATGGGCTGGG TCCCCGTCCA GGACGGCGAC CCGGTATTCC TCATCGGCGA CCAGATTGTC GGGGACGTCG AAGCCAACAC CTCCGTCCTG CCCGGGATCA GCTCCCGGGA GATCGACGTC ATCCAGAAAT ACGGCGTCGG AGAGAACATC AGCGGCGACT TCGATGACGA TGACTTCCGT GCTGAAGCCA GGAAGAACTT CCGGGCCATC ATGGACACCT ACATCCACTC CAAGGCCTGG ACTGACCTGT CCACCGCCAC GGTGGTCCTG GCCGGCGCGC TGCGCCCGGT CATCCCGATC CGCCCCAAGG CGACCATCTA CGTGTGGGGA CCCAAGGGCA AGGGCAAGTC CTGGACTGCC GAGTGCATGA TGCACTTCTG GGCCCGGCAC CGCGGTGACT GGCACGAAGA GCTGCCGGGT TCGGCCAAGG ACACGGCCGC GGCCATGGAG AACGCCGTCG CGCGCACCCC CATCTGGGTG ATCGATGACC TGGCACCCTC CGCGGTCCGC CGCCAGGCGG AGCAGGAAGA CGCCAAACTG GCCGACATCA CCCGCGCCAT CTTCAACAAC GCCACGAAGC TGCGCATGAA CGCTGACATG ACGGTGAAGA AGTCCAACAA GCCGATGACC CAGCTGATCA TGACCGCCGA GAACGAACTC AGCACCCCCT CGGCGAAGGA ACGGCTCATC CCGCTCTACG TCGGCAAGGG CAAGCTGAAC CCGGAGAAGG CGCCCACGGA CGCCCTGAAC GACATTGCCC GGGACAAGGG CACGCCGGCC CGCTTCACGG CCCAGGTCAT CAAGTACATC CGCTACGCTG CAGCGACCCA CCCGGGCGGC TGGGCCGCGT ACGTGACCTC CATGGAGGCT GCCCGCGCCC GCCTGAAGGA ACAGATCAGC GGCCTGATGA AGAAGATGGG TGCGGCCTCC GGGTCCCTGG AACGCACCTC GACCCTCGCA GCGGACGTCA TGCTGACGTT CTACCTGCTG GAGAACCTGG CCCGTGAACT GGACATGGGT GCCGACTTTG CCAAGCAGTT CCGCGTCAAC GCCGAGATGT CCATGTCCGT GATCCAGCTG GTCACCAACG CCCACGCCGA AAACCAGCAG GCCGCCCCCG GCATCTCCCT GGTCAAGGCA CTGGCAGCCC TGCTCGCCTC CGGCCACGCC CACGTGATCT CAGGGGACGA CCCGGCACGG CCCCCGATCG AAGGCACTGA GCAGAACGAA GCCATGGTCA ACAGCCGGCT CGGCTGGGTG GTCGGCGGCG GCGATGGCAG CCTGCGGCCG GCAGGAACGA ACATCGGCAC CGTCGTTACC GTGATGGACA ACAACCGGGA GCCGCAGAAG GTCGTGCTGT TCTCCTCGGA GACGGCCTTC AGCGCCGCGC AGAAGGCATA CCCCTCCCTG ATCCAGTTCG GGCAGGGGCC GACGGCAGCC TGGTCCAGCG TGTGGGATGA GAACCTGAAC CCGCCCTACA TCACCCGGAT CAAGAACGGC CGCGGAACGC TGCTGGCGCC CTGGCGGAAG GGCAAGATCA CCGGCATCCC CATCGCTGTC TCGCGACTGA TCGACGGCGG CCTGTCCATC GAAGACCTGC CCGAGGCAAG CAACGAGGAC TGA
|
Protein sequence | MEEISNQTDQ DTGVQADGGA TQPSGPAAAD TNDLTSLNVL DPNFTGLPPW EASDYASSIT GSGAFKLMDS GVAPLVAAAR GYTRIDKTNY TEMARKMQVV GNSGQGKRFK RTLSAPGKDG MVMPWYSLAG IMKAKREGER PVPHTYQVRP EFPENNEAGK PLKYEFVSNV GTPLDLHPAI PTDWIDTTPV VMFAEGMLKG DSALSAYLVH NGITYSELRS EGVENPVAKL RELLERIPHD ERVLIVSIGG VYNASGNAVD WREITLKDRI GWIAFDADIA INPAVHAAAN KLSTQLFEKS KMAEVLYLNP QVTTGDDGST TKDGVDDFLA KRGNWAALIN QLDDVMPDAP AKNAEDKAGN WRVGKTGQFV EECVALNTGA GGTIGEYRWE KRVEIGGRVV ALEARRQPTD QELKTGIFNP NVKYEDIDDA QVEIEVSWHL NGRDFTATVT GPESILGYRP EEWDRKKASV PRELLLHPEW PPRGAKGESW LSAIKANRPE DVAFKTRWMQ MGWVPVQDGD PVFLIGDQIV GDVEANTSVL PGISSREIDV IQKYGVGENI SGDFDDDDFR AEARKNFRAI MDTYIHSKAW TDLSTATVVL AGALRPVIPI RPKATIYVWG PKGKGKSWTA ECMMHFWARH RGDWHEELPG SAKDTAAAME NAVARTPIWV IDDLAPSAVR RQAEQEDAKL ADITRAIFNN ATKLRMNADM TVKKSNKPMT QLIMTAENEL STPSAKERLI PLYVGKGKLN PEKAPTDALN DIARDKGTPA RFTAQVIKYI RYAAATHPGG WAAYVTSMEA ARARLKEQIS GLMKKMGAAS GSLERTSTLA ADVMLTFYLL ENLARELDMG ADFAKQFRVN AEMSMSVIQL VTNAHAENQQ AAPGISLVKA LAALLASGHA HVISGDDPAR PPIEGTEQNE AMVNSRLGWV VGGGDGSLRP AGTNIGTVVT VMDNNREPQK VVLFSSETAF SAAQKAYPSL IQFGQGPTAA WSSVWDENLN PPYITRIKNG RGTLLAPWRK GKITGIPIAV SRLIDGGLSI EDLPEASNED
|
| |