Gene Arth_0482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0482 
Symbol 
ID4447037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp513005 
End bp514336 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content63% 
IMG OID639688279 
Productextracellular solute-binding protein 
Protein accessionYP_829981 
Protein GI116669048 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00268905 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATTCCA AGATCAGGTT CCGAGCACGG GTTGCCGTGG CCCTGTCCAT CGCTTCAGCG 
GCTGCATTGA CCGGCTGCGG GAGCGGGCCG TCCGCCCCGG CCGCCACGGA CGACGGCCAG
CCCATTGAAG TGTGGGCACG CGCTGGCACC GACGCCGCCA CCACCTACGC CGCCATGTTC
AAGGAATTCA CGGCGAAGAC GGGTGTGAAG GTCAATTTCC AGGGGGTTCC GGACCTGGAC
CAGCAACTGC AGACCCGCGC GGCATCCAAG AAGCTCCCGG ACATCGTCAT CAACGACTCC
GCCGCGCTGG GCAACTACAC GGCGCAGGGC TACCTCCAAA AGATCGACAG GTCCTCAGTG
ACCGGTAGCG ACCGGATTGC CGATTCCCTG TGGAAGGAGA CTGAAGGCCT GGACGGAGCA
AGCTACGGAG TTCCGTTCTC CCGCCAGACC ATGGTCACCA TGATCCGCAA GGACTGGCGC
GAAAAGCTGG GACTTCCCGT CCCCAAGACG CTTGAGGACC TGGCGAAGCT CGCCACGGCC
TTCGCCACCC AGGATCCGGA CGGCAACGGC CAGGCAGACA CATACGGCAT GGTCGTTCCC
GGCTCCACCG AACGCGGATA CCTGGGCTGG TGGGCTTCCT CGTACATCTG GCAGGACGGC
GGCAGCTACC TGAAGGATGA GGGCAACGGA AAGTACTCGT CCAACGCCTC CTCCCCGGAA
ACCCAGGCAG GTGTCAGCTG GGTCAAGCAG CAGTTCTGCA CTCCGGGAAA CACGCAGCCC
GGTGCCCTGA CCGCAGCCAC CAGCGTGGCC TCCCCGTTCT TCCAGACCGG CAAGGCCGGC
ATCATCCTCA CCGGCCCCTA CAACTTCTCA TCGTTCGACA AGACTCCCGG CAAGGACGTC
TACGAGGTCA TCGAAGCGCC CAAGGGCTCC AAGGACAACA CCGTGCTGGC CGAGGGCGAG
AACATCTATG TCACGGCAAG CAACGCCAAG AAGGACCAGA CCAAGCAGGT CATCGACTAC
CTGGTTTCCC AGGACGGCCA GAAGGCCGGC ATGACGGCCG GCAAGCAGCC GATCGTCCGC
GTACCGGTGA ATTCCGACGT CGATGCTGCC GCCGTCTACA ATGATCCGCG CTGGTCAGTG
GTCCAGGAGG CACTGAAGTC GTCGTCCAAG GCCTTCCCGT CCGCCATCAA CTTCGTCCCG
TTCCGCCAGG CCGCGGCTGA AGCGCTGAAC AAGATCGTGG CCGATTGCCC TGCAGACAAC
GTGGCCTCCG GTTTGCAGGC CCTGGACACC GCGATCAATG ACGAACTGAA CAGCCAGAAC
GCCAAGTCAT GA
 
Protein sequence
MDSKIRFRAR VAVALSIASA AALTGCGSGP SAPAATDDGQ PIEVWARAGT DAATTYAAMF 
KEFTAKTGVK VNFQGVPDLD QQLQTRAASK KLPDIVINDS AALGNYTAQG YLQKIDRSSV
TGSDRIADSL WKETEGLDGA SYGVPFSRQT MVTMIRKDWR EKLGLPVPKT LEDLAKLATA
FATQDPDGNG QADTYGMVVP GSTERGYLGW WASSYIWQDG GSYLKDEGNG KYSSNASSPE
TQAGVSWVKQ QFCTPGNTQP GALTAATSVA SPFFQTGKAG IILTGPYNFS SFDKTPGKDV
YEVIEAPKGS KDNTVLAEGE NIYVTASNAK KDQTKQVIDY LVSQDGQKAG MTAGKQPIVR
VPVNSDVDAA AVYNDPRWSV VQEALKSSSK AFPSAINFVP FRQAAAEALN KIVADCPADN
VASGLQALDT AINDELNSQN AKS