Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1967 |
Symbol | |
ID | 7293428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 2217701 |
End bp | 2219032 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643590371 |
Product | CBS domain containing protein |
Protein accession | YP_002488030 |
Protein GI | 220912721 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.00000080224 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGCCCC TGCTCCTGGT GGCGATGGCC CTGGTTTTCC TGGCCATCGC CGCCGTTCTG ACTGCGGCCG AGGCCGCCTT CAATTTCCTC CCGCGCCACG ATGCGGAAGA AGCTCTGCTC CGGAGCCGCG GAACGGCTAT GCGCAGCATC CTCCGCGAGC CCGTTGCCCA CATCCGGGCC CTGAGGTTCT GGCGTGTCTG GTTTGAAATG GCCTCCGCCG TGGCCGTCTC GGTGCTGCTT TACAGTCTCC TGGACAACAT CTGGCTGGCC GGCCTTGCAG CAACCGGCAT CATGGCGCTG CTGGGGTTCG TGATCGTGGG TGTCTCCCCC CGCCAGCTGG GCAGACTCCA CTCGTCCGCG GTAGTGCGCT TCACTGCGCC GATGATCCGT TTCCTGACGT GGGTACTGGG GCCCATCCCC GGATGGCTCG TCGCCGTGGG TAGCGCTGCC GCGCCAGGCG CCCCCGGCGG GGATGAAGCC TTCTTCAGCG AACAGGAATT CCGTGAACTG GTGGACCGCG CCAGCGAATC CGACATGATC GAGGACACCG AAGCTGAGAT GATCCAGTCG GTGTTCGACT TCGGGGACAC GCTGGTCCGT GCCGTCATGG TGCCCAGGAC GGACATCGTT GGCATCGACT CCGGTTCGAG CCTGCACGAG GCCATGTCCC TGTTCCTCCG GTCCGGCTAC TCCCGGGTCC CGGTTATCGG CGAGGACACC GACCACATCC TGGGAATCGT CTACCTCAAG GACGTGGCCG CCGTCGTGCA TGAACTGGAC GCCGGCGTCG AACCGCCCAC GGTCGACTCC ATGGCACGCG AGGTCCGCTA CGTGCCGGAA TCAAAGCCGG TGAGCGACCT GCTGCGGGAA CTGCAGAAGG AGTCCACACA TGTGGCCATC GTCATCGACG AGTACGGCGG CACGGCCGGC CTGGTCACGC TTGAGGACCT GATCGAGGAA ATCGTGGGCG AGATCGTTGA CGAATACGAC ACCGAAAGCG CCGAGGCCGT AGAGCTCGGC GACGGTTCCT ACCGGGTCAG CTCCAGGATG AGCATCGATG ACCTCGGAGA GCTTTTTGAC ATCGAGCTTG ACGACGACGA AGTGGACACC GTGGGCGGAC TGCTCGCCAA AGCGCTGGGA CGTGTTCCCA TCGTAGGCAG CAGGGTGGAA GTCCAGGGAG TATCGCTGCG CGCCGACCGG CTCGAGGGCC GCCGCAACCG CGTCAGCCAT ATCATTGCGG CACCCGTGCC AACAGTAGAC ACTGACCTTG AAGACCTCTT CCATGAGGCG GAAGCAACCC AGCAGGGAGT TCCACGTGAG CAAGAAAAGT AA
|
Protein sequence | MTPLLLVAMA LVFLAIAAVL TAAEAAFNFL PRHDAEEALL RSRGTAMRSI LREPVAHIRA LRFWRVWFEM ASAVAVSVLL YSLLDNIWLA GLAATGIMAL LGFVIVGVSP RQLGRLHSSA VVRFTAPMIR FLTWVLGPIP GWLVAVGSAA APGAPGGDEA FFSEQEFREL VDRASESDMI EDTEAEMIQS VFDFGDTLVR AVMVPRTDIV GIDSGSSLHE AMSLFLRSGY SRVPVIGEDT DHILGIVYLK DVAAVVHELD AGVEPPTVDS MAREVRYVPE SKPVSDLLRE LQKESTHVAI VIDEYGGTAG LVTLEDLIEE IVGEIVDEYD TESAEAVELG DGSYRVSSRM SIDDLGELFD IELDDDEVDT VGGLLAKALG RVPIVGSRVE VQGVSLRADR LEGRRNRVSH IIAAPVPTVD TDLEDLFHEA EATQQGVPRE QEK
|
| |