Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_0830 |
Symbol | |
ID | 7292266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 900789 |
End bp | 903665 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643589230 |
Product | protein of unknown function DUF214 |
Protein accession | YP_002486914 |
Protein GI | 220911605 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3127] Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.800045 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCACC GGCACCCGCC GCTGGAGTTG GCGCCGGCCG CCGGTGGCCG CCGAGGCAGT TTCCGGGTGG CCCTGAGGAT GGCCCGGCGG GACATTGCCC GGCACAGGGG CCGGTCCCTG CTCATCGTGC TGCTGATCAT GCTGCCGGTG GCAGGGATGA CCGCGGCAGG CACCCTGTAC CAAAGTTCAC TGCGGACGCC GCAGGAAATC GTGCGGTACG AACTGGGCAA AACCCAGGCA CGCTTCGTCG CTTTGCCCAC GCCCAATGGG GACTCCGTCC AGGACCCGCT GAATGACTCC GTAGTGGCCT CGAGCACCGG AGAGCTGGAT GGTGATTTCG TTCCCACGGA CCCCAAGGAC CTGCTTCCGC CCGGCTACGG GATACTGCCG CTGCGGCAAT TGCAGCTGAC CACCTCCGTG GGCGCGGCCA AGGTCACGCT GAACGGCGTG CAGGTGGACG CGCTGAACGA GGCTTTCGAG GGGAAATTCA CCCTCCTGGA TGGCCGGGCA CCGGCAGGCG GTGCAGAGGT TTTGGCCTCA CGCGGGCTGC TCAAGCGGTT CAACGTGGAC TTGGGCGAGG AACTTACGAC GTCGGCGGGG ACCTTTGTCC CGGTGGGCAC CATCCGCGAC GCGGACGCCT CGGACAACAA CAGCACGCTG TACCTGCAGT TCGCGCAGGT GCCCGCAGGC CTTGTCCAGG GGCCCGCTTC CGCACAGGCT GCCTCCTACT ATGTGACAGG ACCGGAACCG GTCACCTGGC AGCAGGTCAG GGAGGCCAAC AGCAAGGGCG TGGGAGTGCT GTCCCGCAGC GTGGTGCTGG ATCCGCCGCC GCCGGGCGAG CTGACCGTGC CGGGGGCCAG GGTGGCCGGA GTTCCCTCGG AGACGGTCGC CACCTACGCC ACCTTCGCTG TGGTGGGTGC CCTTGCCCTG CTGGAAGTCG GCCTCCTGGC CGGGGCAGCG TTCGCCGTCG GGGCCAAGCA CCAGGTGCGC GAACTCGCTC TTCTTGCGGC TTCCGGAGCC GAAGCACCCA CCATCCGTGC CGTGGTCACC GCCAGCGGCC TGTGGCTCGG CGGCCTGGCC GTCACGGCCG GCGCCGCTGC CGGCTTAGCA GCTGCGGCCG GAGTGGTCTG GTGGGTCCGT TCCACCGGTT CCGCCCGCCT GGCGGGAGTC CACCCGGATC TGCTATTGAC CACCGTAGCC ATGCTGATGG GCTTGGCGGC CTGCATCCTT GCCGCCCTGG CGCCGGCCAA CCTCGTTGCC CGGCAGGCAC TGCTGGGTGC GCTCAAGTCC GGACGCGCAC CGGCAGCAAG CGGACGCCGG AGCACCCTCG CCGGGGTGGC TGTGCTCTTG GCGGCCGCCG GACTGCTAGC CGCAGGCTCG GCCCTGGGCA GCTCGGCCAA GGACCCCGAC CAGCGGGCAC AACAGGCCGC CGCAGTTTCC GCGATGCTGG CCGGGGGAGC GGTCCTGGCG GTGGTGGCCC TAGTGCTCCT GACCGGATGG CTGGCAGCGA CCCTGACATC CCGCACCCGG GCCATGCCGC TGCCGCTGCG CATGGCGGCA CGGGACTCCG CCAGGAACCG GAGCCGGACG GTCCCGGCGG TGGCGGCCGT ACTGGCCGCC GCGACCCTGG CAAGTGCGGC CCTGGTGCTG ACGGCCAGCC AACTGGCAGG GCAGCGCCAG TCGCATTCCT GGAGCGCACT GGAGAACCAG TCGTACCTGC CGCTTAACCT CGCGCAGCCA CCGCTGGCCG ACGGCACCGC CGCTCCGGCC GTGACTGTCG ACCCGGAGCG GCTCTCGGCT GCCGTCTCGG GTGCCCTGGA CAGCGTCTCC TGGACCAGGA CAGTCACTAC CCCCGCGCCG GTGGAAAACT GTGGTTTCGG GGAAGGCTCA GGTCCCGCCG GGCCCGTGGC GACTGAAACG TCCAACTGTC TCCTCTACGC CCTTGCCAGG CCAAGCGGCC AGGAATGTCC GGTCACGCCC CAACGGCGGG TCGTGGATCC GGACGACTGG CGGTGTCGCG GCTCCATGGC CTCGGACCAG CCAACGGACC GGTCCATCCT GGTGGGCGGC GCCGACGATA TCCGTGCCAT GTTCGGGAGC GAAGCCGGGG CTGCCATCGT TGCCGCCCTG GACGCCGGGG CCATGGTGGT GACCAACCCG GCATTTGTCC GCGACGGAAA AGTCGAGCTG CAGGGCCTGG ACGTGCGCAC CCAGCAGCCA TCGCCGTCGG ACGGGTCCAT CGTCCACGAA GCTGTCACCA GCACCGTCTT GGACGCCGTC ATTCTGGAGC CCGCGGTGGC CGTGCCCTAC TACGGCATCG TCTCGCCGGC CACGGCCGGG CGCCTGCGCC TTGATCCGAA GGCCGTCGGG CTGGCCATTC AGCTCAACCG GCCTCCTTCG GCGGCTGAGG TTGACGCGGT GACCGGTGCG GTGGCCGGAG TCGTCGGACA CTACGGCCTG GGATTCTGGG TGGAACCGGG GGTTCCCAAA GACCAGGCCT GGATGGGCTG GCTGATCGTG GCTATCAGCG CTTTGATCAC CTTCAGCGCG GCCGGGATCA CCACTGGCCT TGCCCTCGCC GATGCCCGGA CGGACCATGC CACGCTGGCT GGAGTGGGGG CGGCGCCGCG GCTGCGGAAG ACTCTGGCGG CGTCGCAGGC GCTCTTCACC TCAGGGCTTG GAGCTGTGCT GGGTGCCGTG GCGGGCACGG TGCCCGCCGT GCTCATTGTC GCTTCCACGG AGATGCGTTC CTCGCTGGAG GTACCCTGGC TTCCCCTTGC GGCGATGGTG ATGGCCATCC CGCTGACAGC GTCAGCCCTG GCCTGGGCCT TCACCAGGTC TGCGCTGCCG ATGACCCGAA GGGCCCTCGG CGGCTGA
|
Protein sequence | MPHRHPPLEL APAAGGRRGS FRVALRMARR DIARHRGRSL LIVLLIMLPV AGMTAAGTLY QSSLRTPQEI VRYELGKTQA RFVALPTPNG DSVQDPLNDS VVASSTGELD GDFVPTDPKD LLPPGYGILP LRQLQLTTSV GAAKVTLNGV QVDALNEAFE GKFTLLDGRA PAGGAEVLAS RGLLKRFNVD LGEELTTSAG TFVPVGTIRD ADASDNNSTL YLQFAQVPAG LVQGPASAQA ASYYVTGPEP VTWQQVREAN SKGVGVLSRS VVLDPPPPGE LTVPGARVAG VPSETVATYA TFAVVGALAL LEVGLLAGAA FAVGAKHQVR ELALLAASGA EAPTIRAVVT ASGLWLGGLA VTAGAAAGLA AAAGVVWWVR STGSARLAGV HPDLLLTTVA MLMGLAACIL AALAPANLVA RQALLGALKS GRAPAASGRR STLAGVAVLL AAAGLLAAGS ALGSSAKDPD QRAQQAAAVS AMLAGGAVLA VVALVLLTGW LAATLTSRTR AMPLPLRMAA RDSARNRSRT VPAVAAVLAA ATLASAALVL TASQLAGQRQ SHSWSALENQ SYLPLNLAQP PLADGTAAPA VTVDPERLSA AVSGALDSVS WTRTVTTPAP VENCGFGEGS GPAGPVATET SNCLLYALAR PSGQECPVTP QRRVVDPDDW RCRGSMASDQ PTDRSILVGG ADDIRAMFGS EAGAAIVAAL DAGAMVVTNP AFVRDGKVEL QGLDVRTQQP SPSDGSIVHE AVTSTVLDAV ILEPAVAVPY YGIVSPATAG RLRLDPKAVG LAIQLNRPPS AAEVDAVTGA VAGVVGHYGL GFWVEPGVPK DQAWMGWLIV AISALITFSA AGITTGLALA DARTDHATLA GVGAAPRLRK TLAASQALFT SGLGAVLGAV AGTVPAVLIV ASTEMRSSLE VPWLPLAAMV MAIPLTASAL AWAFTRSALP MTRRALGG
|
| |