Gene Arth_3316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3316 
Symbol 
ID4443958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3721520 
End bp3723049 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content62% 
IMG OID639691140 
Productextracellular solute-binding protein 
Protein accessionYP_832792 
Protein GI116671859 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.578802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGTA GAAAACTCAG CCTGGCTGCC GCCGTCGTGA CCGCCGCCGC ACTTACCCTC 
ACAGCGTGCA GCGGGGGTGA AGCACCGGCC GCGGATTCCT CGAAGATCAG CGTCATGGCG
CCGTTCCTGG AGGCGCAGCC ACCGTCCGCT GACGGCGCCG TGCAGAAGAA GCTCGAGGAG
CTCACCGGCA AGGACGTGAA CATCACGTGG ACGCCGAACG CCTCCTACGA GGACAAGATG
AACATCACCC TGGCATCCTC CGAAATCCCC CAGGTCATGG TGGTGCAAAG CAAGTCGCCG
GGCTTCGTGA AGAACGCCGA GGCGGGCGCA TTCTGGGACC TCACGGACAA AATTGACGAG
TACCCCAACC TGAAGACCAC TTTCCCGGAC GTCGAGAAGA ACGCCAGCGT GAACGGCAAG
GTTTACGGCG TCTTCCGGGC CCGCAACCCC ATCCGCGCAG CAGTGATGTT CCGCCAGGAC
TGGCTGGACA AGCTGGGACT GAAGGCACCC GAAACCACCG AGGACCTCTA CACCGTGGCC
AAGGCCTTCA CCGAGCAGGA CCCGGACGGC AACGGCCAGA ACGACACCTG GGGCATCACC
ATTCCCAAGT GGGGTGCCCT GGGCTCCAAC AGCCCCTACG ACATCATCGA GGAGTGGTAC
GGGGCCGGTA ACCGCTGGAC CGACAAGGAC GGCAAGCTGA TCCCCAGCTT CGAAACCGAG
GAGTTCCTGG AAGCCAACCG GTTCGTCAAG AAGATGGTGG ACGAAAAGCT CATCAACCCG
GACTTCGCCA CCTTCGACGG CACCAAGTGG AACGAGCCAT TCTTCAACGG CAAGGGCGGC
ATTATTGTCG ACGTCGACTC CCGCGTCAGC GTGCTGATCA ACCTGTTCAA GCAGGCCAAC
CCGAACGATT TCCAGAACAA GGTGGGCTTC GTCGGCAACC TCAAGGGACC GGACGGCGAA
CTGCACGCAC ACCCGACGGA CGGCTACTCC GGGTTCCTTG CTATTCCCAA GACCAGCGTC
CGCACCGAGG CTGAACTGAA GAACGTCCTG GAATTCCTGA ACACGATGAA CGGCAAGGAC
GTGGCCGTGC TCCTCAACAA CGGCATCGAA GGCGTGAACT TCACCGTGGA GGACGGCAAG
GCCGCCACCA TCAAGCCCGA AACCCCTGAA GGCAAGGCTG TGGCCACGGA CATCAAGAGC
TACGCGCAGC TGGGCACCAA CGTCACCGGC AACAACTTCT ACCCCGTCAA GCAGGCCTCG
GACTACGAAC AGAAAGTCTT CGACAAGCGC GTTGAGGTCA TGGCCCAGGA CCTGAAGAGT
GCCGTCTACA ACCCGGCCGC GGCCTTCGTG TCGCCTACCT ACGTGGCCAA GGGCGCGCAG
CTGGACAACA TCGTGGCCGA TGCCCGGATC AAGTACCTGG CCGGCCAGAT CGATGAGCAG
GGCCTGAAGG ACGCCATCAA GCTGTGGAAG ACCAGCGGGG GCGACAAGGT CCAGGAAGAG
ATCAACAAGC TCTGGCAGGA CAGCAAGTGA
 
Protein sequence
MTRRKLSLAA AVVTAAALTL TACSGGEAPA ADSSKISVMA PFLEAQPPSA DGAVQKKLEE 
LTGKDVNITW TPNASYEDKM NITLASSEIP QVMVVQSKSP GFVKNAEAGA FWDLTDKIDE
YPNLKTTFPD VEKNASVNGK VYGVFRARNP IRAAVMFRQD WLDKLGLKAP ETTEDLYTVA
KAFTEQDPDG NGQNDTWGIT IPKWGALGSN SPYDIIEEWY GAGNRWTDKD GKLIPSFETE
EFLEANRFVK KMVDEKLINP DFATFDGTKW NEPFFNGKGG IIVDVDSRVS VLINLFKQAN
PNDFQNKVGF VGNLKGPDGE LHAHPTDGYS GFLAIPKTSV RTEAELKNVL EFLNTMNGKD
VAVLLNNGIE GVNFTVEDGK AATIKPETPE GKAVATDIKS YAQLGTNVTG NNFYPVKQAS
DYEQKVFDKR VEVMAQDLKS AVYNPAAAFV SPTYVAKGAQ LDNIVADARI KYLAGQIDEQ
GLKDAIKLWK TSGGDKVQEE INKLWQDSK