Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2471 |
Symbol | |
ID | 7293946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 2779845 |
End bp | 2782802 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643590880 |
Product | protein of unknown function UPF0182 |
Protein accession | YP_002488525 |
Protein GI | 220913216 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0000000003677 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCACCG GCAGACCCCC TTCCCGACGC GGCGCGCTGA CGCCCACCTT GATCGTTGTT GCCCTCGTCG TCGTGGGTTT CATCTTCTTC GCAAACGTCT GGACCGATGT CCTTTGGTAC CGGCAACTGG GGTTCATCGA GGTCTTCCTC ACCGAGAACC TCGCCAAGAT CGCCATCTTC GCTGCCGGGT TCGCAGTGAT GTTCGCCGGC GTGTTCTTCG CCATCCGCAT TGCCTACCGC TCCCGGCCCG TCTATGCGCC TGATTCCGAT ATCCGGGACA ATCTCAACCG CTACCAGGCA CAGCTGGAAC CGGTACGCCG CGTGGTGATG ATCGGCCTGC CCATCCTGTT CGGGCTCTTC GCCGGAAGCG CTGCCGCGAG CCAGTGGCAG AAGGTATTGC TCTTCTTCAA CCAGGTGGGC TTCAACAAGC AGGACCCCCA GTTCAACATG GACATCAGCT TCTATGTGAT GGTCCTGCCG TTCCTGGGAT TCGTCACCGG CTTCCTGATC AGCGTGGTTG TCATTGCCGG CATCGCCGGC ATCCTGACCC ACTACCTGTA CGGCAGCATC CGGCTCATGG AGCGCGGCGT CTACACCGCC CGCGCCGCCC AGATCCACAT TGCCGTCACC GGTGCCGCGT TCCTGATCCT GCTGGGAGTC AACTTCTGGC TTGACCGGTA TTCCTCGGTC CAGAGCAACG GCGGACGCTG GGCCGGTGCC CTCTACACGG ACGTTAACGC CGTCATCCCC ACGAAGGCCA TCCTCGCCGT CGCTGCCGTA CTGGTTGCTG CGCTGTTCAT CGTGGCAGCC GTCATTGGCC GATGGCGCCT GCCGGTCATC GGAACGGCGA TGCTGATCAT CACTTCCATC CTTGCCGGTG GGGTGTACCC CTGGGTGATC CAGCAGTTCC AGGTCCGTCC CTCGGAAGAG ACGCTGGAAA AGGACTACAT CCAGCGGAAC ATCGACAACA CCCGCAGCGC TTACGGGCTC GATGGAATCC AGGAAACGCG CTACGACGCT ACCAACACCG CAACATCAGG TGCCCTGGCC CCGGACGCGC AGACCACCGC GAACATCCGG CTCCTGGACC CCAACCTTAT TTCTGACGCC TTCGCGCAGC TGGAACAGTA CCGCCCGTAC TACCAGTTCC CCGAGGCCCT TAACGTTGAC CGGTACGAGG TGGACGGCAA GATCCAGGAC ACCGTGATCG CTGTCCGGGA ACTGAACCCA ACCAACGTGG CAGCCAACCA GCAGGGCTGG CTGAACCAGC ACGTGGTGTA CACGCACGGC TACGGCGTGG TGGCGGCCAA GGGCAACAAG TTCACCGTGG ACGGCAAGCC TGAGTTCCTG CAGTCCGGCA TCCCTTCCAA TGGCGTCCTG GGTAACGACA GCTCCTACCA GCCCCGCATC TACTTCGGTG AATCTTCGCC CGAGTACTCG ATCGTCGGCG CTCCGGAAGG CACGGCACCA CGTGAGCAGG ACCGTCCGTC CGGCCGTGAA GGCGAAGGCG AAACCCAGTA CACGTTCACC GGTAACGGTG GACCGAACGT GGGCTCGTTC TTCAACAAGG TCCTGTACGC CATCAAGTTC CAGTCCTCGG ACCTGCTGCT TTCGGACGGC GTGAACTCCG AGTCCCAGGT CCTGTACGAC CGCAACCCCC GTGAACGCAT CCAGAAAGTT GCGCCGTACC TCACCGTTGA CGGCAACGCC TACCCGGCGG TGGTTGATGG CCGCGTGAAG TGGATCGTGG ACGGCTACAC CACCAGCCAG TACTACCCCT ACTCGCAGCA AAAGCAGCTG TCCGAGGCCA CTGCAGACAC GCAGACCACC TCCGGCCGTG CCGTGGCGCT GCCGAACAGT ACCGTGAACT ACATCCGGAA CTCCGTCAAG GCAACGGTGG ATGCCTACGA CGGGTCCGTC ACGCTCTACG CCTGGGATGA CCAGGACCCC GTACTGAAGG CCTGGAACAA AGTGTTCCCC ACGTCGCTGA AGCCCTATTC AGAAATGTCG GGTGCGGTGA TGAGCCACGT CCGTTACCCG GAGGACCTGT TCAAGGTCCA GCGTGAACTG CTGGGCCAGT ACCACGTAAC GGATCCGCGC AGCTTCTACA AGAACAACGA CGCCTGGAGC GTCCCGGCCG ATCCCACTGT GGACACCGAC GTCAAGCAGC CGCCGTTCTA CATGTCGCTG CAGATGCCGG ACCAGGACAA GCCGGCATTC CAGCTCACGT CGTCCTTCAT TCCGCAGATC GTCAACAACA ACGCCCGCAA CGTCCTCTAC GGCTTCCTGG CAGCGGACTC CGACGCCGGC AACCAGGCGG GCGTCAAGGG TGAGGGCTAC GGCAAGCTGC GCCTGCTCAA TATCCCGCCG GAAACCCAGG TCCCCGGCCC GGGCCAGGCG CAGAACAAGT TCAACTCCGA TCCCACCGTG TCCCAGGCCC TGAACCTGTT GCGGCAGGGC GCCTCGGACG TGTTGAACGG CAACCTGCTG ACGCTGCCTG TGGGCGGCGG CATCCTGTAC GTGCAGCCGG TCTACCTGAA ATCGACCGGT GAGACGTCCT ACCCCACCCT GCAGCGCGTC CTGGTGGCAT TCGGTGACAA GGTGGGCTTC GCACCCACCC TGGATGAAGC ACTCAAGCAG CTCTTCGGCG GTAATTCCGG CGCAGCAGCG GGCGATTCGG ACAACAACGG GCAGACGCCA GCCGGGCCTG CCAGTCCGTC GGAACCGGGC GCGGACGCCA AGGCTGAACT GAAGGCAGCC CTGGACGAGG CAAATGCTGC CATCCAGGCC GGCCAGTCGG CTCTGTCAAC CGGTGACTTC GCCGCTTACG GTGAGGCGCA GAAGCGGATC ACCGCGGCAC TGAAGAAGGC CGTGGACGCC GAGGCCAAGA TTCCCGGCAC CACCCCGGAA GCAAGCCCGT CACCAACGGC AACGCCGTCA CCGTCGCCCA GCAACTAG
|
Protein sequence | MSTGRPPSRR GALTPTLIVV ALVVVGFIFF ANVWTDVLWY RQLGFIEVFL TENLAKIAIF AAGFAVMFAG VFFAIRIAYR SRPVYAPDSD IRDNLNRYQA QLEPVRRVVM IGLPILFGLF AGSAAASQWQ KVLLFFNQVG FNKQDPQFNM DISFYVMVLP FLGFVTGFLI SVVVIAGIAG ILTHYLYGSI RLMERGVYTA RAAQIHIAVT GAAFLILLGV NFWLDRYSSV QSNGGRWAGA LYTDVNAVIP TKAILAVAAV LVAALFIVAA VIGRWRLPVI GTAMLIITSI LAGGVYPWVI QQFQVRPSEE TLEKDYIQRN IDNTRSAYGL DGIQETRYDA TNTATSGALA PDAQTTANIR LLDPNLISDA FAQLEQYRPY YQFPEALNVD RYEVDGKIQD TVIAVRELNP TNVAANQQGW LNQHVVYTHG YGVVAAKGNK FTVDGKPEFL QSGIPSNGVL GNDSSYQPRI YFGESSPEYS IVGAPEGTAP REQDRPSGRE GEGETQYTFT GNGGPNVGSF FNKVLYAIKF QSSDLLLSDG VNSESQVLYD RNPRERIQKV APYLTVDGNA YPAVVDGRVK WIVDGYTTSQ YYPYSQQKQL SEATADTQTT SGRAVALPNS TVNYIRNSVK ATVDAYDGSV TLYAWDDQDP VLKAWNKVFP TSLKPYSEMS GAVMSHVRYP EDLFKVQREL LGQYHVTDPR SFYKNNDAWS VPADPTVDTD VKQPPFYMSL QMPDQDKPAF QLTSSFIPQI VNNNARNVLY GFLAADSDAG NQAGVKGEGY GKLRLLNIPP ETQVPGPGQA QNKFNSDPTV SQALNLLRQG ASDVLNGNLL TLPVGGGILY VQPVYLKSTG ETSYPTLQRV LVAFGDKVGF APTLDEALKQ LFGGNSGAAA GDSDNNGQTP AGPASPSEPG ADAKAELKAA LDEANAAIQA GQSALSTGDF AAYGEAQKRI TAALKKAVDA EAKIPGTTPE ASPSPTATPS PSPSN
|
| |