Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0188 |
Symbol | |
ID | 4447346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 192272 |
End bp | 193915 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639687983 |
Product | extracellular solute-binding protein |
Protein accession | YP_829689 |
Protein GI | 116668756 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAGA CGATCAAGAA CCGTCCGCTG GTCAACTCCG CCAGCCGCCG GAGTTTCCTG AAATTGAGTG GTGCGGCGGG CATCGCGGCT GCTTTCGCCA GCTCACTGGC GGCCTGCGGG GGCCCTGCTG CCACCACGGC AGGCGCTTCC GGTTCAACGG CGCCGATCAA CAAGGACCTC ATCATCGAGG CCGGCATCTC CTACGCCCTG TCCACCGGGT TTGATCCGCT GTCCTCCTCC GGCGCCACAC CCCTGGCCGC CAATCTGCAC GTCTACGAAG GCCTCATCGA ACTGCACCCG GCCACCCGCG AGCCGTACAA CGCCCTGGCT GCCGCGGACC CCAAGATGGT CAGCCCCACC ACCTACCAGG TGTCCCTCCG CCAAGGCGCC AAGTTCCACG ACGGCACTCC GGTCACGGCA GACGACGTTG TCTTCTCGTT CACCCGTGTG ATGGATCCGG CAAACAAGTC GCTGTTCTCG CAATTCATCC CCTTCATCAA GGAAGTCAAG GCCGTTGATG CGGCCACGGC CGAGTTCACC CTCAAATACG CTTTCCCGGG CTTCGGCCCG CGGATTTCCG TGGTCAAGGT TGTCCCCAAG GCCCTGGCCA ACTTCCCGCT GGGCTCGGAA CAGCTCAAGG CCTTCGATGC CAAGCCGGTG GGCACCGGCC CGTACAAACT CATCTCGGCA GTCAAGGACG ACAAGATCGT TTTCGAAGCC AACCCTGACT ACAACGGCCC CATGCCGGCC CTCGCAAAGG GCATGACCTG GCTGCTGCTC TCCGACGCCG CAGCGCGCGT CACCGCCATG CAGTCCGGCC GCGTGCAGGC AATCGAGGAC GTCCCCTACC TGGACGTGGA CGGCCTCAAG ACCAAGGCCG CCGTGGAATC CGTCCAGTCC TTCGGCATGC TGTTCCTGAT GTTCAACTGC ACCAAGGGAC CGTTCAGCGA CAAGCGCGTC CGCCAGGCCC TGCACTACGG CCTGGACAAG GACTCCATCA TCAAGAAGGC CCTGTTCGGC AACGCCAAGG CAGCCAGTTC CTACTTCCAG GAGGGGCACC CGGACTACGT CAAGGCGAAG AACGTCTACT CCTACGACGC CAACAAGGCC GCGGACCTGC TCAAGGAAGC CGGGGTCACC AGCCTGGAGT TCGAACTGCT GACAACGGAC ACCGCCTGGG TCAAGGATGT CGCCCCGCTG ATGCTCGAAT CCTGGAACAA GATCCCCGGG GTGAAGGTCA CACTCAAGAA CCTGCAGTCC GGTGCCTTGT ACGCGGACCG CGTGGGCAAG GGCGACTACA GCGTGGTTGC GGCTCCGGGC GACCCCTCGG TCTTCGGAAA CGACGCCGAT CTGCTGCTGA GCTGGTTCTA CGCCGGGGAC ACCTGGATGA AGGGCCGCGC CAACTGGGCC GCCACCCCTG AGCGCGCACA GCTCGTGGAC CTCATGGCCA AAGCCGGCCA GTCCGCCGGG GACGAAGCCA AGAAGCTGAC CGGCGAAATC GTCGACCTGG TTTCCGAGGA AGTGCCGCTG TACCCGATCT TCCACCGCCA GCTCCCCAGC GCGTGGGATT CCACCAAACT CAGCGGCTTC AAGCCGCTGC CCACCACCGG CGTCTCCTTC GTCGGCGTCG GCCGCACGGC CTAG
|
Protein sequence | MDKTIKNRPL VNSASRRSFL KLSGAAGIAA AFASSLAACG GPAATTAGAS GSTAPINKDL IIEAGISYAL STGFDPLSSS GATPLAANLH VYEGLIELHP ATREPYNALA AADPKMVSPT TYQVSLRQGA KFHDGTPVTA DDVVFSFTRV MDPANKSLFS QFIPFIKEVK AVDAATAEFT LKYAFPGFGP RISVVKVVPK ALANFPLGSE QLKAFDAKPV GTGPYKLISA VKDDKIVFEA NPDYNGPMPA LAKGMTWLLL SDAAARVTAM QSGRVQAIED VPYLDVDGLK TKAAVESVQS FGMLFLMFNC TKGPFSDKRV RQALHYGLDK DSIIKKALFG NAKAASSYFQ EGHPDYVKAK NVYSYDANKA ADLLKEAGVT SLEFELLTTD TAWVKDVAPL MLESWNKIPG VKVTLKNLQS GALYADRVGK GDYSVVAAPG DPSVFGNDAD LLLSWFYAGD TWMKGRANWA ATPERAQLVD LMAKAGQSAG DEAKKLTGEI VDLVSEEVPL YPIFHRQLPS AWDSTKLSGF KPLPTTGVSF VGVGRTA
|
| |