Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2230 |
Symbol | |
ID | 4445291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2509047 |
End bp | 2510390 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639690039 |
Product | CBS domain-containing protein |
Protein accession | YP_831710 |
Protein GI | 116670777 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00904717 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCCCAC TGATCCTTGT TGGCATGGCG CTGGTCTTCC TCAGTTTCGC AGCCCTGCTG ACAGCGGCTG AGTCCGCGTT TAATTTCCTT CCCCGCCACG ACGCCGAAGA GGCCGTGCTT CGAAGTGACG GCCAGGCCCT GAAGCGCATC CTGGACCAGC CAGTCGCGCA CATGCGGTCC CTGAGGTTTT GGCGGATCTG GTTCGAAATG GCCTCGGCAG TGGCGGTCGC TGTCCTTCTG CACAGCCTGC TGGACAGTGT ATGGCTCGCC GGCCTTGCCG CCACCGGAAT CATGGCCCTC GTAGGCTTCG TGATCGTGGG GGTGTCACCG CGGCAGCTGG GCCGTGCCCA CGCCGCCGGC GTCGCCAGGT TCAGTGCGCC GATCATCCGT TTCCTATGCT GGGTGCTGGG TCCCATTCCG GGCTGGCTGG TTGCGCTGGG CAGCGTCGTG GCACCGGGCG CACGCGCCGG CGACGAAGCC TCCTTCAGTG AAGAAGAGTT CCGCGAGCTC GTGGACCGTG CCACTGAATC TGACATGATC GAGGACAACG AAGCCGAACT GATCCAGTCC GTGTTCGACT TCGGCGACAC CCTGGTCCGG GCCGTGATGG TGCCCCGCAC GGACATCCTC AGCATCGACG CGGGCTCGAG CCTGCACCGG GCCATGTCCC TCTTCCTGCG GTCCGGCTAC TCCCGGATCC CCGTGATCCG CGACAACACG GACCAGATCC TGGGCATCAT CTACCTCAAG GATGTCGCCG CCGCGCTGCA CGGCCTCGGC CCGGGCGAGG AACCCCCCAT CGTGGATGAC CTTGCCCGCG AAGTCCGCTA CGTGCCGGAG TCGAAGCAGG TCAGTGACCT GCTTCGTGAA CTGCAAAAGG AATCAACGCA TGTGGCCATC GTGATCGACG AGTACGGCGG AACTGCCGGA CTTGTGACGC TTGAGGATCT GATCGAGGAA ATCGTCGGCG AGATTGTGGA TGAATATGAC ACCGAGAGCG CCGAGGCCGT GGCCCTTGGC AACGGCAGCT ACCGGGTGAG TGCCCGGATG GGCATCGACG ACCTCGGCGA GCTGTTTGAT GTGGAACTCG ACGACGACGA AGTGGACACC GTCGGCGGCC TGCTCGCCAA GGCCCTCGGC CGGGTTCCCA TCGTCGGCAG CACCGTAGAG GTGGACGGGA TCTCGCTGCG GGCGGAACGC TTGGAAGGCC GCCGCAACAG GGTCAGCCAC ATCATCGCGG CGCCCGTTGC AAAGGGCGCC GTTCCAGAAC AAACTGACCT TGAAGACCTA CTCGACGAGG CCGAAACAAT GCAACAGGGA GTTCCACGTG AGCAAGCAGA ATAA
|
Protein sequence | MTPLILVGMA LVFLSFAALL TAAESAFNFL PRHDAEEAVL RSDGQALKRI LDQPVAHMRS LRFWRIWFEM ASAVAVAVLL HSLLDSVWLA GLAATGIMAL VGFVIVGVSP RQLGRAHAAG VARFSAPIIR FLCWVLGPIP GWLVALGSVV APGARAGDEA SFSEEEFREL VDRATESDMI EDNEAELIQS VFDFGDTLVR AVMVPRTDIL SIDAGSSLHR AMSLFLRSGY SRIPVIRDNT DQILGIIYLK DVAAALHGLG PGEEPPIVDD LAREVRYVPE SKQVSDLLRE LQKESTHVAI VIDEYGGTAG LVTLEDLIEE IVGEIVDEYD TESAEAVALG NGSYRVSARM GIDDLGELFD VELDDDEVDT VGGLLAKALG RVPIVGSTVE VDGISLRAER LEGRRNRVSH IIAAPVAKGA VPEQTDLEDL LDEAETMQQG VPREQAE
|
| |