Gene Arth_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1103 
Symbol 
ID4446406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1195290 
End bp1196561 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content63% 
IMG OID639688909 
Productinner-membrane translocator 
Protein accessionYP_830597 
Protein GI116669664 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4214] ABC-type xylose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.660627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCGC TCAAGAAGCT ATTTGGCGGA AACACCCGTC AATTCGGCAT GATCTTCGCC 
CTGGTTGCAC TGATCGTCTT TTTCCAGATT TTCACCGAGG GCCGCACGCT CACCCCGGGC
AACGTCATCA ACCTCTTCAA CGGCAACTCC TACATTCTGA TCCTGGCGAT CGGCATGGTG
CTGGTCATCA TTGCCGGCCA CATCGACCTC TCAGTGGGTT CCGTGGCGGC CTTCGTCGGC
GTCACCGTGG CCCTGGCCAT CCGTGACTGG GGCATCCCCT GGTACGCCGG CGTTCTCCTG
GGCCTGGCCC TCGGGGCGCT GATCGGGGCG TGGCAAGGGT TCTGGACCGC CTATGTGGGC
ATTCCCGCCT TCATCGTGAC CCTGGCCGGT ATGCTGCTCT TCCGCGGCTT CAACCAGTTC
GTCGGCAAGT CCAACACCAT CCCGGTCCCC GCCGATTTCC AGTACCTCGG CTCGGGCTAT
CTTCCCGAAG TGGGCCCCAA CACCAACTTC AACAACCTCA CGCTGCTCCT GGGCTTGCTG
GCCGTGGCCT TTGTGATCTT CAGCGAAATC CGTGCCCGCC GGCGGGCCCT TGCCCTCGGC
GCCGAGGTGC CGGAAGCCTG GGTAATGATC CTCAAGCTCG TCCTGATCTG CGGCGCCATC
CTGTACGCCA CGTACCTGTT CGCCACCGGC CGTCCGGGAA CGTCCTTCCC CATTCCGGGC
CTGATCCTGG CCGTCCTGGT CCTCATCTAC GGCTTCATTT CCTCCAAGAC CATCGTCGGC
CGCCACATCT ACGCCGTCGG CGGCAACAGG CACGCTGCCG AACTCTCCGG CGTGCAGTCC
AAGAAGGTCA ACTTCCTGGT GATGATGAAC ATGTCCATCC TGGCCGGCCT GGCAGGCATG
ATCTTCGTGG GCCGCTCCAC CGCTTCCGGA CCGTTCGACG GCGTCGGCTG GGAACTGGAC
GCCATCGCAG CCGTGTTCAT CGGCGGCGCC GCCGTGACCG GCGGCGTGGG TACCGTGATC
GGCTCGATCG TTGGTGGCCT GGTGATGGCC GTGCTGAACA ACGGGCTGCA GCTCCTCGGC
GTCGGCGCCG ACCTCACCCA GATCATCAAG GGCCTGGTCC TCCTGATCGC CGTTGCCTTC
GACGTCTACA ACAAGACCCA GGGCAAGAAG TCCATCATCG GCATGATGAT GAAGAACTTC
GGCCGCGGCA GCACCGAGCT CCAGCCGGAC GAGACGACGG CCACCAAGGA CGTCATCCGC
AAGGAAGCCT GA
 
Protein sequence
MNALKKLFGG NTRQFGMIFA LVALIVFFQI FTEGRTLTPG NVINLFNGNS YILILAIGMV 
LVIIAGHIDL SVGSVAAFVG VTVALAIRDW GIPWYAGVLL GLALGALIGA WQGFWTAYVG
IPAFIVTLAG MLLFRGFNQF VGKSNTIPVP ADFQYLGSGY LPEVGPNTNF NNLTLLLGLL
AVAFVIFSEI RARRRALALG AEVPEAWVMI LKLVLICGAI LYATYLFATG RPGTSFPIPG
LILAVLVLIY GFISSKTIVG RHIYAVGGNR HAAELSGVQS KKVNFLVMMN MSILAGLAGM
IFVGRSTASG PFDGVGWELD AIAAVFIGGA AVTGGVGTVI GSIVGGLVMA VLNNGLQLLG
VGADLTQIIK GLVLLIAVAF DVYNKTQGKK SIIGMMMKNF GRGSTELQPD ETTATKDVIR
KEA