Gene Arth_0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0496 
Symbol 
ID4447016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp526933 
End bp528294 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content61% 
IMG OID639688293 
Productextracellular solute-binding protein 
Protein accessionYP_829995 
Protein GI116669062 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTTTA GATCGTCTTC CGGGCTTGCG GCAATCGTTA CGGCTGCGGC TTTGGCATTG 
ACCGGCTGCG GCGCGGGCGC AGGGACTACT GGCAGTTCAA CAAACGCCGA CGGAAAGGTG
GACGGCACGG GCAAGACGCT CAACGTCCTG GTCGGCGTCC TCAGCCAGTA CCCGGAGCAG
CAGAAGAAGT GGCAGAGCGA CATCGCGGCC AAGTTCAAGG CAGAAACCGG GGCCGACGTG
AAGTTTGAAA CCTTCGCCTC AGCCAACGAC GAGCTGACGC GCATCCAGAC CTCCGTGGTT
TCAGGGCAGG GCCCGGACAT CTATGGGCTC GGCACCACGT TCACTCCGAC CGCCTTCGCC
ACCAAAGCCT TCGTGACCCT GTCCGACGAC GATTGGAAGA AGGTCGGCGG CAAGGACCGC
TTCAACCCTG CAGCATTGGG CATCTCCGGC CCCGACGAGG GGCACCAGGC CGGCATCCCG
TTCGTAAGCC GCCCCTTCGT GATGGCTTAC AACAAGGAGC TGTTGGCGGC TGCAGGCATT
GAGAAGCCTG CCACCAGCTG GGACGAGCTT GCCGAACAGG CGAAGAAAAT GACCAAGGAC
GGCACGTTCG GCATCGCCAC CGGGTACAAA GACTCCTACG ATCCGTGGAA GTTCATCTGG
GCCATGTCCG TCCAAGCCGG CAATCCGCTG GTGGACGGAA ACAGCCTCAA GATGGATGAT
CCCACCGTCA AGAAGGCTTA CGAGACTTAT TTCGGCTGGT TGACCGATGA CAAAGTTGTG
GACCCTGCCT CCGTCGGGTG GAGCAACAGC AACGCGGTTG CTGCCTTCGC CAGCGGAAAA
GCCGGTTATC TGATGATGAC GACGTCGAGC TCCATCCCAA CGCTGGACAA GTCGGCCGTG
GCAGGCAAGT ACGAATACGC ACTGATGCCC ACTACCGCTC CGGGTGAATC CAGCCCCAAG
AGTGACGGCG CGGAAGCCGC GAGCATCCTC TCCGGGGATA ACCTCGTGGT GGCGGACTAC
TCGAAGGAGA AGGATCTCGC CTTCGCCTAC ATCAAGCTGA TCACCTCGAA AGAGGAACAG
CTGAACTACC AAAAGACCTT CGGCGACCTG CCCGCAAACG CCGAGGCGTT GGCCAGCCTC
ACTGATCCCA AGCTCAAGCC AATCGCGGAT GCCGCCGCCA AGTCCAAAGC CACCCCGTTT
ACAGGTGCTT GGGGCGACAT CCAGCTCGGC TTGCTCAACG TCACTGTTCA GTCGATTCCG
GACCTTTCCA GCGGCAGGCT CGACGAGTCG GCCCTCGAGG CTCGAATCAA GGACGCCCAG
ACCAAGGGGC AGGCGTCCCT TGACCGGGCC GCCAAGGGAT AA
 
Protein sequence
MRFRSSSGLA AIVTAAALAL TGCGAGAGTT GSSTNADGKV DGTGKTLNVL VGVLSQYPEQ 
QKKWQSDIAA KFKAETGADV KFETFASAND ELTRIQTSVV SGQGPDIYGL GTTFTPTAFA
TKAFVTLSDD DWKKVGGKDR FNPAALGISG PDEGHQAGIP FVSRPFVMAY NKELLAAAGI
EKPATSWDEL AEQAKKMTKD GTFGIATGYK DSYDPWKFIW AMSVQAGNPL VDGNSLKMDD
PTVKKAYETY FGWLTDDKVV DPASVGWSNS NAVAAFASGK AGYLMMTTSS SIPTLDKSAV
AGKYEYALMP TTAPGESSPK SDGAEAASIL SGDNLVVADY SKEKDLAFAY IKLITSKEEQ
LNYQKTFGDL PANAEALASL TDPKLKPIAD AAAKSKATPF TGAWGDIQLG LLNVTVQSIP
DLSSGRLDES ALEARIKDAQ TKGQASLDRA AKG