Gene Arth_0426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0426 
Symbol 
ID4447086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp455919 
End bp457529 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content63% 
IMG OID639688225 
Productextracellular solute-binding protein 
Protein accessionYP_829927 
Protein GI116668994 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACCA GCCGCAAGCT CGCCGTCCTT GGCGCCCTGA TGGCAGGAAC GATGCTGTTC 
ACCGCCTGCT CGGGCAGCTC CAACGGTTCC GCCGCCATCA AAGACTCGTC CGCGGAGTTC
GGCTTCCAGG AGACCGGCTT CCCGATCGTC AAGGACACGC TGACGCTCAA GTTCTCCGGA
ACCAAGTCGG CGCTCGCCCC CGATTACAAC ACCATGTCCC TGGTGCAGCA GTGGGAAAAG
GACACCAACA TCCACATCGA CTGGGAGAAC CTCCCGGAGA CGGTGTTCAA GGAAAAGAAA
AACCTCATCC TGGCCAGCGG CGACCTGCCC GACGCCTTCT TCAACAGCGG GCTCACCGAC
GCGGAAATCG CCACCTACTC GGCCAGCGGA ACACTGATCC CCCTCGAAGA CCTCATTCAG
AAAAATGCCC CCAACCTGTC CAAGCTGCTC GCCGACCGGC CGGACATCAA AGCGGCCATC
ACCTCCTCCG ACGGGCACAT CTACTCCCTC CCCTCCATCG AAGAACTGGG ACTCGTCCAG
TTCCCCAACG AGATGGCGAT CAACACCGCG TGGCTGAACA AGCTGGGCCT CCCGATGCCC
AAGACCGTGG ACGAACTGCA TGATGCCCTG CTCGCCTTCA AGACCAAGGA CGCCTCAGGC
ACCGGTAAAA CCATCCCGCT GAGCTTCATG CCCGGCTCCT GGTGCGGTGA CATCGTTGAC
CTCATCGCCG CCTTGGGCGG AGTCCCGGAC AACATGGACC ACAGGATCGT CCAGGACGGC
AAGGTCATCT ACACCGCCAC CCAGGACGGC TACAAAAAGG CCCTCCAGAC CCTGCATACC
TGGTATCAGG AAGGCCTGAT CGATCCAGAA TCGTTCTCCC AGGATGACAA GGCCTACCTG
GCCAAGGGCA AGGCCAGCAC CGAAAACCTG GGCTCCTTCG TCTGGTGGGA AGTCAAGGAA
ATGGTCGGCG CCGACCGCGC CGGCGACTAC AAACTGCTCC CCGTACTTGA GGGCGTGGAC
GGCAAGCGGC TCGCCAGCCA GTCCAACAAC CAGGAAATCG CCCGCGGCGC CTTCGCTGTG
ACCCGAACCA ACAAATACCC TGCCGCCACC ATCCGCTGGG CAGACAACCT GTACGATCCC
ATCCAGTCCG CCCAGGCCAA CTGGGGCCCC ATCGGTGAAA CCCTGCAGAA GGACCCCGCC
ACCGGGCTGC TGACCCAGAT ACCCGCGGCC GCGGGAACCA GTGAAGGCGA ACGCCGCCAG
AAGGTTGCCC CGGGCGGCCC GAAGGCCAAC ACCGCGGAGA ACTTCGAGAA GGTCGTGGCA
CCCGAGCCGC GCGCGGCCGA GCGGCAGAAG ACCGTCGAGG AGAACTACAA GCCTTTCGCA
GCCAACGACG GCTACCCCCC GGTGGCACTG TCCAACGAGG AAGTGCAGCA GATCAGCACC
ATCGAGACGG ACGTGGCCGC CATCGTCAAG CAGACCACGG CGAAATGGAT CGTCTCCGGC
GGCATCGAGG CGGAGTGGGA CGGCTACGTC TCGCAGCTGA AGAACATCGG CCTGGACAAG
ATGGTGGACG TCTACCAGCA GGCCTACGAC AGGTACCAGA AGAACTCCTG A
 
Protein sequence
MATSRKLAVL GALMAGTMLF TACSGSSNGS AAIKDSSAEF GFQETGFPIV KDTLTLKFSG 
TKSALAPDYN TMSLVQQWEK DTNIHIDWEN LPETVFKEKK NLILASGDLP DAFFNSGLTD
AEIATYSASG TLIPLEDLIQ KNAPNLSKLL ADRPDIKAAI TSSDGHIYSL PSIEELGLVQ
FPNEMAINTA WLNKLGLPMP KTVDELHDAL LAFKTKDASG TGKTIPLSFM PGSWCGDIVD
LIAALGGVPD NMDHRIVQDG KVIYTATQDG YKKALQTLHT WYQEGLIDPE SFSQDDKAYL
AKGKASTENL GSFVWWEVKE MVGADRAGDY KLLPVLEGVD GKRLASQSNN QEIARGAFAV
TRTNKYPAAT IRWADNLYDP IQSAQANWGP IGETLQKDPA TGLLTQIPAA AGTSEGERRQ
KVAPGGPKAN TAENFEKVVA PEPRAAERQK TVEENYKPFA ANDGYPPVAL SNEEVQQIST
IETDVAAIVK QTTAKWIVSG GIEAEWDGYV SQLKNIGLDK MVDVYQQAYD RYQKNS