Gene Arth_0188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0188 
Symbol 
ID4447346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp192272 
End bp193915 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content64% 
IMG OID639687983 
Productextracellular solute-binding protein 
Protein accessionYP_829689 
Protein GI116668756 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAGA CGATCAAGAA CCGTCCGCTG GTCAACTCCG CCAGCCGCCG GAGTTTCCTG 
AAATTGAGTG GTGCGGCGGG CATCGCGGCT GCTTTCGCCA GCTCACTGGC GGCCTGCGGG
GGCCCTGCTG CCACCACGGC AGGCGCTTCC GGTTCAACGG CGCCGATCAA CAAGGACCTC
ATCATCGAGG CCGGCATCTC CTACGCCCTG TCCACCGGGT TTGATCCGCT GTCCTCCTCC
GGCGCCACAC CCCTGGCCGC CAATCTGCAC GTCTACGAAG GCCTCATCGA ACTGCACCCG
GCCACCCGCG AGCCGTACAA CGCCCTGGCT GCCGCGGACC CCAAGATGGT CAGCCCCACC
ACCTACCAGG TGTCCCTCCG CCAAGGCGCC AAGTTCCACG ACGGCACTCC GGTCACGGCA
GACGACGTTG TCTTCTCGTT CACCCGTGTG ATGGATCCGG CAAACAAGTC GCTGTTCTCG
CAATTCATCC CCTTCATCAA GGAAGTCAAG GCCGTTGATG CGGCCACGGC CGAGTTCACC
CTCAAATACG CTTTCCCGGG CTTCGGCCCG CGGATTTCCG TGGTCAAGGT TGTCCCCAAG
GCCCTGGCCA ACTTCCCGCT GGGCTCGGAA CAGCTCAAGG CCTTCGATGC CAAGCCGGTG
GGCACCGGCC CGTACAAACT CATCTCGGCA GTCAAGGACG ACAAGATCGT TTTCGAAGCC
AACCCTGACT ACAACGGCCC CATGCCGGCC CTCGCAAAGG GCATGACCTG GCTGCTGCTC
TCCGACGCCG CAGCGCGCGT CACCGCCATG CAGTCCGGCC GCGTGCAGGC AATCGAGGAC
GTCCCCTACC TGGACGTGGA CGGCCTCAAG ACCAAGGCCG CCGTGGAATC CGTCCAGTCC
TTCGGCATGC TGTTCCTGAT GTTCAACTGC ACCAAGGGAC CGTTCAGCGA CAAGCGCGTC
CGCCAGGCCC TGCACTACGG CCTGGACAAG GACTCCATCA TCAAGAAGGC CCTGTTCGGC
AACGCCAAGG CAGCCAGTTC CTACTTCCAG GAGGGGCACC CGGACTACGT CAAGGCGAAG
AACGTCTACT CCTACGACGC CAACAAGGCC GCGGACCTGC TCAAGGAAGC CGGGGTCACC
AGCCTGGAGT TCGAACTGCT GACAACGGAC ACCGCCTGGG TCAAGGATGT CGCCCCGCTG
ATGCTCGAAT CCTGGAACAA GATCCCCGGG GTGAAGGTCA CACTCAAGAA CCTGCAGTCC
GGTGCCTTGT ACGCGGACCG CGTGGGCAAG GGCGACTACA GCGTGGTTGC GGCTCCGGGC
GACCCCTCGG TCTTCGGAAA CGACGCCGAT CTGCTGCTGA GCTGGTTCTA CGCCGGGGAC
ACCTGGATGA AGGGCCGCGC CAACTGGGCC GCCACCCCTG AGCGCGCACA GCTCGTGGAC
CTCATGGCCA AAGCCGGCCA GTCCGCCGGG GACGAAGCCA AGAAGCTGAC CGGCGAAATC
GTCGACCTGG TTTCCGAGGA AGTGCCGCTG TACCCGATCT TCCACCGCCA GCTCCCCAGC
GCGTGGGATT CCACCAAACT CAGCGGCTTC AAGCCGCTGC CCACCACCGG CGTCTCCTTC
GTCGGCGTCG GCCGCACGGC CTAG
 
Protein sequence
MDKTIKNRPL VNSASRRSFL KLSGAAGIAA AFASSLAACG GPAATTAGAS GSTAPINKDL 
IIEAGISYAL STGFDPLSSS GATPLAANLH VYEGLIELHP ATREPYNALA AADPKMVSPT
TYQVSLRQGA KFHDGTPVTA DDVVFSFTRV MDPANKSLFS QFIPFIKEVK AVDAATAEFT
LKYAFPGFGP RISVVKVVPK ALANFPLGSE QLKAFDAKPV GTGPYKLISA VKDDKIVFEA
NPDYNGPMPA LAKGMTWLLL SDAAARVTAM QSGRVQAIED VPYLDVDGLK TKAAVESVQS
FGMLFLMFNC TKGPFSDKRV RQALHYGLDK DSIIKKALFG NAKAASSYFQ EGHPDYVKAK
NVYSYDANKA ADLLKEAGVT SLEFELLTTD TAWVKDVAPL MLESWNKIPG VKVTLKNLQS
GALYADRVGK GDYSVVAAPG DPSVFGNDAD LLLSWFYAGD TWMKGRANWA ATPERAQLVD
LMAKAGQSAG DEAKKLTGEI VDLVSEEVPL YPIFHRQLPS AWDSTKLSGF KPLPTTGVSF
VGVGRTA