Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2827 |
Symbol | |
ID | 4444623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3179774 |
End bp | 3181552 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639690649 |
Product | extracellular solute-binding protein |
Protein accession | YP_832306 |
Protein GI | 116671373 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.320124 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTTAG GTCGTATTTC CAAAGCGATA GGCGTCGCGG CTGCTGCTGC GCTAGCTCTC AGCGCCTGCG CCGGCAACAG CGGCGGGACA ACCCCGTCAA GCGCTACTGC CGGCAAACAG GGTGGTTCAG CCACTGTGGT TGAAGTGAAC GCGTTCAACA CTTTCAACCC CAACACTGCC GACGGCAACA CTGACATCAA CTCGAAGATC AGCTACGCCA CCCACTCGGG TTTCTACTAC ATCGATAACC AGCTGAACGT TGTTCGCAAC GAAAAGTTCG GCAAGATGGA AAAGACTTCC GATGACCCCC TGACGGTGAA GTACACCATC AACGAGGGAG TCAAATGGTC CGACGGCACT CCGGTGACGG CAGCCGACCT CCTGCTGCAG TGGGCTGCTT TCTCGGGTTA CTACGACGAT GCCGACTCGG AAGCGAAGAC CGGCACGTCC TACTTCTCCT ACGCGGGTGA TCCCACCGGA CTCAAGCTCA CTGACTTCCC GGAGCTGGGC GACGGCAACC GTTCCATGAC CATCAAGTAC TCCAAGCCTT TCGCCGACTG GGAAACCATC CTTGGCGGTC CCGGGATCGA CATCGCAGCC CACGTCCTGG CCAAGAAAGC CGGACTGGCC GACGCCAAGG CGCTGGTGGA CTACCTTAAG GAAAAGCCGA AGGGCGATCC GAAGGCCCCG AAGCCGGCCG ATGAGAAGCT GAAGGCCATG GCGGATCTTT GGAACACCGG TTTTGACACC AAGACCCTGC CGTCCGATCC GAGCCTCTTC CTCTCCAACG GCCCGTACAT CGTCAAGAGC GTCACCCAGG ACCAGTCCCT GACCATGGTC CGCAACAAGG ACTACAACTG GGGCCCGGAG GCCAGCCTGG ACGAAATCAC GGTCCGCTAC ATCGGTTCCG CTCCGGCGCA GGTGCAGGCA CTCAAGAACG GCGAAGCGGA CATCATTGCG CCGCAGGCCT CGGCAGATAC CATCGAGCAG CTCAAGGCGC TTGAAAGCCA GGGAGTCACC GTGGAACAGG GCAACCAGCT CTCCTACGAC CACATCGACC TGAACTACTC GGGTCCGCTG GCCGAAAAGA GCGTCCGTGA GGCGTTCATG AAGACCGTTC CGCGCAAGGA CATTGTGGAC AAGATCGTGA AGAAGCTTGA CCCGGAGGCC AAGCCGCTGG ACTCGCAGCT ATTCGTTCCG GCCCAGGCGG CCTACGCTGA TTCGGTCAAG AACAACGGTT CTTCCGCTTA CCAGGACGTG GACATCGACG GTGCCAAGGC GCTCCTCGCC GGCAAGACGC CTGAGATCCG CATCATGTAC AACAAGGACA ACCCCAACCG CGTTGACGCT TTCTCCCTGA TCCGCGAGTC GGCCACCAAG GCTGGCTTCA AGATCGTTGA CGGCGGCCTG GGCAAGTCTG ACTGGGGCAA GGCCCTCGGC GACGGCAGCT ACGACGCCAC CATCTTCGGC TGGATCAACC CCGGTGTCGG CGTTTCCGGC GTCCCGCAGA TCTTCCGCAC CGGCAACGGA TCCAACTTCA ACCAGTTCAG CGATCCGGAA GCCGACAAGC TGATGGACGA GCTGATCGTC ACCACGGACC GCAGCAAGCA GGATGAGCTG AGCAAGGAGA TTGACAAGAA GATCTGGGAA TCCGCTTACG GCCTTCCGCT CTTCCAGTCC GTCGGAGTGG ACGCCTACAG CGACCGGATC ACGGGCGTGA AGTTCATGCC GAACCAGACC GGTGTTTGGT GGAACTTCTG GGAGTGGGCT GAGAAGTAA
|
Protein sequence | MRLGRISKAI GVAAAAALAL SACAGNSGGT TPSSATAGKQ GGSATVVEVN AFNTFNPNTA DGNTDINSKI SYATHSGFYY IDNQLNVVRN EKFGKMEKTS DDPLTVKYTI NEGVKWSDGT PVTAADLLLQ WAAFSGYYDD ADSEAKTGTS YFSYAGDPTG LKLTDFPELG DGNRSMTIKY SKPFADWETI LGGPGIDIAA HVLAKKAGLA DAKALVDYLK EKPKGDPKAP KPADEKLKAM ADLWNTGFDT KTLPSDPSLF LSNGPYIVKS VTQDQSLTMV RNKDYNWGPE ASLDEITVRY IGSAPAQVQA LKNGEADIIA PQASADTIEQ LKALESQGVT VEQGNQLSYD HIDLNYSGPL AEKSVREAFM KTVPRKDIVD KIVKKLDPEA KPLDSQLFVP AQAAYADSVK NNGSSAYQDV DIDGAKALLA GKTPEIRIMY NKDNPNRVDA FSLIRESATK AGFKIVDGGL GKSDWGKALG DGSYDATIFG WINPGVGVSG VPQIFRTGNG SNFNQFSDPE ADKLMDELIV TTDRSKQDEL SKEIDKKIWE SAYGLPLFQS VGVDAYSDRI TGVKFMPNQT GVWWNFWEWA EK
|
| |