Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2548 |
Symbol | |
ID | 4444874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2857465 |
End bp | 2859219 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639690367 |
Product | extracellular solute-binding protein |
Protein accession | YP_832027 |
Protein GI | 116671094 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0070372 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCAGGA TCGTTCAATT AGACTGGGGA CCGGTCCGCT GCCCGCGGTC CTTCCGGCCG CCCAGAGGTG ATCATCGAGT GCCCAGCAGT AAATCGCGCC GCATTGCAGT CAGTATCGGT GCAGGGGTCC TGTCGCTGCT GCTGGCCGCA TGCACAGGAA CGCCTGCTCC GGCACCGGCG TCGTCCTCTG CGGCTCCTGC CGGCACGGCA GCCGCTGGCC CTACCGCGAC GTTCACCTTC GGCACGGCGG CCCAGCCGCT GGGCCTTGAT CCCGCGCTGG CATCCGATGT GGAGTCCTAC CGGATCACCC GCCAGGTCCT GGAGGGCCTT GTGGGAGTGG ACCAGACCAC GGGCCTGCCC ACTCCCCTGC TCGCCACTGA GTGGGCGGAA TCCAACGAAG GCCGCGCCTA CACCTTCAAG CTCCGCGAGG GCGTCACCTT CCAGGACGGT ACACGCTTCG ATGCCGCGGC TGTCTGCGCC AACTTCAACA GATGGTTCAA CTTCCCGGCG AGCCTGCGGA AGCAGGCGCC CGGCACATCG TTCAAGGGCG TGTTCAAGGC CTACGCCGAC CAGGCATCGC TCTCCATCTA CAAGGGCTGC ACCGCCGTCT CTGCCGGGAA CGTCCGGATC GACCTCACCC AGCCGTTCAC CGGGTTCCTC CAGGCGCTGA CGCTCCCGGC CTTCGCCATA TCCTCCCCGA CGGCCATGGC GGCACAGAAG GCGGACAGCC TCAGCCAGAC CCGGGACGGC CAGCCCGTGT CGGCCTATGC CCTGCACCCT GTGGGCACGG GTCCCTTCAG CTTCGCCGCG TGGCAGGACT CCAGCGTCAA GCTGGTCAGC AACAAGGACT ATTGGGGTGA CAGGGGGCAG ATCGGCACCA TCAACTTCGT CACCTACGAT CACCCGCAGT CCAGGCTCCA GGCCCTCCTC GACGGGAAGA TCGACGGCTA TGACGCCGTC ACTGTGGGCA ACTTCGACCA ACTCGTCAAA CGCGGGCAGC AAATCATCCA GCGCGACCCG TTCTCCGTGA TGTACCTGGG CATGAACCAG GAAGTGCCCA TCCTGCAAAA CATCAAAGTG CGCCAGGCCA TCGAGATGGC GGTGGACAAG GAAACGCTGA TCCGCCGGTT CTTCATCGAC AACACTGCCC AGGCAACCCA GTTCGTCCCG CCCAAGCTCA GCGGGTTCAA CAACAACGCC CCCTCACTGG GCCACGACCC GGCCAAGGCC AAGGCGCTTC TGGAGGAAGC CGGGTACAAG GGCGAGGAAC TCAAGTTCTA CTACCCCCTC AATGTCACCA GGCCATACCT TCCCACACCC GAAAAGGTGT ACGCAGAGCT CAGCAGGCAA CTTACCGCTG TGGGCCTGAA CATCAAGCCG GTTCCGGTGG AGTGGTCGGA CGGGTACCTG CAAAAAGTCC AGTCACCAGG GGACCATGCC CTGCACCTGC TCGGCTGGAA CGGTTCCTAC TCGGATCCGG ACAATTTTGT GGGTCCCCTG TTTGGCGAGA AGACCGGTGA ATTCGGCTAC CAGGACCCGC AGGTCTTTTC GAAGATCGCC CGGGCACGCG GCTTGCCGGA GGGCGAGGAG CGGACGCAGC AATACCGCAC CATCAACGCC CAGATCGCCG AATCGGTCCC CGCCGTCCCC ATTGCTTTCC CCATTTCAGC TCTGGCGCTC TCCGACCGGG TGCTGAAGTA CCCTGCCTCG CCGGTATTAA ACGAGGTTTT CACAAAGGTG GAGCTAAAAC CTTGA
|
Protein sequence | MSRIVQLDWG PVRCPRSFRP PRGDHRVPSS KSRRIAVSIG AGVLSLLLAA CTGTPAPAPA SSSAAPAGTA AAGPTATFTF GTAAQPLGLD PALASDVESY RITRQVLEGL VGVDQTTGLP TPLLATEWAE SNEGRAYTFK LREGVTFQDG TRFDAAAVCA NFNRWFNFPA SLRKQAPGTS FKGVFKAYAD QASLSIYKGC TAVSAGNVRI DLTQPFTGFL QALTLPAFAI SSPTAMAAQK ADSLSQTRDG QPVSAYALHP VGTGPFSFAA WQDSSVKLVS NKDYWGDRGQ IGTINFVTYD HPQSRLQALL DGKIDGYDAV TVGNFDQLVK RGQQIIQRDP FSVMYLGMNQ EVPILQNIKV RQAIEMAVDK ETLIRRFFID NTAQATQFVP PKLSGFNNNA PSLGHDPAKA KALLEEAGYK GEELKFYYPL NVTRPYLPTP EKVYAELSRQ LTAVGLNIKP VPVEWSDGYL QKVQSPGDHA LHLLGWNGSY SDPDNFVGPL FGEKTGEFGY QDPQVFSKIA RARGLPEGEE RTQQYRTINA QIAESVPAVP IAFPISALAL SDRVLKYPAS PVLNEVFTKV ELKP
|
| |