Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1036 |
Symbol | |
ID | 4446471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1112941 |
End bp | 1114515 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639688839 |
Product | extracellular solute-binding protein |
Protein accession | YP_830530 |
Protein GI | 116669597 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTTC TTCCTCAAGG AAGTGAGATT TCCCGCCGCA GGCTGCTGCA GTTCGGCGCC GCGGCCGGGT TTCTGCTGGG CACAGGAAGC CTCGCCGGCT GCGCCGGCCC CACCGGCCTG CCGGGACCCA GCACCCTGAC CCTGGCCCTC AACCGGTCCC TGGTCAGCCT GGACAACAAG CTCAACCAGT TCGACGCCGC CGTCACCGTG CAGCGCGCCG TCCGCCAGGG CCTCACCGCC ATCGGCCCCG AAACCAAGCC CGTCCTGGTC CTCGCGGACC GCTTCGAGAT GACCGGCCCC ACCGAGTGGA CCGTCCGGCT CCGCGAAGGC ATCCGCTACT CGGACAACAG CCCGGTCAAG GTGGAGGACG TTGCCACCGC CCTGAAGATG TACCAGCAGG TGCAGGGCTC CTTCGTGGCC GGCTTCTTCC CCGAATTCCC CGAAGTAGTG CCGGTGGATG ACCGCACCTT CAAGATGGTG TCCAAGAAGC CCGTCCCCAT CCTGGACTCG CTCATGAGCA TGATCCTGAT TACACCCGCC GCGCAGAACA AGCCGGAGGA ACTGCAGGAA GGCGTGGGCA CCGGCCCGTA CGTGGTCACC AAGTTCAACC GCGGCGCCGG CACCTACAGC CTGGAACGCA ACCCGAACTA CTGGGGCCCG GCACCGCAGG TGGACAACGT GGAAGTCCGG TTCCTCCCCG AGGAATCCAG CCGCGTGATC GCGCTGCGCA GCGGCGAGGT GGACATCATC GACTCCATCA CGCCGGACTC GCGCGAGCAG CTGGCCGGAC TTCCCGGCGT CGAACTTGAA GAGGCATCAA GCCTCCGGCT GAACCAGATC TTCTTCAACT TCCGCAAGCC CGCCGGCCAC CCCCTGGCCA ACCCCAAGGT GCGCCAGGCC CTGAGCATGG CGATCGACGG CGAAGCACTG GTGAAGAACG TCCTGGTGGA CTCCGTCACC CAGGCCGAGG GCGTCACGCC CTCCAGCCTC ACCGGCTACC ACAAGACCGG CACCTACACC TACGATCCGG AAAAGGCCAA GGCCACCCTC GCCGAGCTGG GCGTCAAGGA CCTCACCCTG AAGATCATCT GGGAAACCGG CGAGTTCCCG TCCGACACCT CCGTGATGGA GGCCCTGGTG GAAATGTTTG GCAAGATCGG CGTGAAGGCC GAGCTGCAGC AGTTCGAACC CGGCGGCAAC ATCCTGGCCT GGCGCCAGGG CAAGCAGGGG GACTGGGACC TCCTGGGCAA CGGCTACCCC AGCCCCACCG GCCTGGCCAT CACCATGCTG CAGGGCATGT ACTCCGGCAC CCCGGAAAAG GAAAAGACCC GTGACACCTA CCAGGGCTAC GTGATCCCTG AGGTCACCGC CAAGATCCAG GCCGCCTCCG CCGAAGCGGA CCCGGCCCGC CGGACCGAAC TGCTGAACGA CGCCCAGCAG GCAGTCTGGG ACACCTGGCC CTGCGCCTGG GCGTTCGTGC CCAAATCCGT CCTCGCCCAC CGGAAGCGGG TGTCCGGCAT CAACCTGGCG CCCACCAATT CCTACCCGCT CGTTGATGTC CGGCTGGAGG CCTGA
|
Protein sequence | MTVLPQGSEI SRRRLLQFGA AAGFLLGTGS LAGCAGPTGL PGPSTLTLAL NRSLVSLDNK LNQFDAAVTV QRAVRQGLTA IGPETKPVLV LADRFEMTGP TEWTVRLREG IRYSDNSPVK VEDVATALKM YQQVQGSFVA GFFPEFPEVV PVDDRTFKMV SKKPVPILDS LMSMILITPA AQNKPEELQE GVGTGPYVVT KFNRGAGTYS LERNPNYWGP APQVDNVEVR FLPEESSRVI ALRSGEVDII DSITPDSREQ LAGLPGVELE EASSLRLNQI FFNFRKPAGH PLANPKVRQA LSMAIDGEAL VKNVLVDSVT QAEGVTPSSL TGYHKTGTYT YDPEKAKATL AELGVKDLTL KIIWETGEFP SDTSVMEALV EMFGKIGVKA ELQQFEPGGN ILAWRQGKQG DWDLLGNGYP SPTGLAITML QGMYSGTPEK EKTRDTYQGY VIPEVTAKIQ AASAEADPAR RTELLNDAQQ AVWDTWPCAW AFVPKSVLAH RKRVSGINLA PTNSYPLVDV RLEA
|
| |