Gene Arth_3917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3917 
Symbol 
ID4443730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4420956 
End bp4424018 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content67% 
IMG OID639691745 
Productglycoside hydrolase family protein 
Protein accessionYP_833392 
Protein GI116672459 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGTTG AGCCCACAAC ATCCAGCCTC TCCTCCGCTG CGGGCCGCGG CGCTGGACCC 
GGGTCCGAGG CAGCGGTCAG GCAGGCGGTG CACGCCCCGG AGCGGATCCT CGACGTCGCC
GCCGAGCAGG TGGCCGCACT CGCTTCCGTA GCCCCGCCAC GGGGCACGAC GCCGGCACGG
GCCTACCTTG TGAGTGATGC CCCGCGTCTG CCGCTGAACG GTGTGTGGAA GTTCAGGCTT
CACGCGGGCA TCCGGCAGGC GCCACACGAC GGCTGGCAGT CGGGCGGCGA TCGTCCGGAT
TTCGGCGACC TCCCGGTACC GTCCAGCTGG CCCATGCACG GCCACGGGAC GCCCTGGTAC
ACGAACGTTC AGTTCCCCTT CGCCGTGGAA CCGCCGCATG TTCCTGACGC CAATCCCGTC
GGCGATCTCT TCGTCGAGTT TGAGGCGGGC CCGGAGTACT TCCCGTCAGC CTTGATCCGC
TTCGACGGCA TAGAATCGGC CGGAACTGTA TGGCTGAACG GCACCTTGCT TGGCACCACA
CGCGGCAGCC GGCTCGCCCA CGAATTCGAT GCCACGGGTG TGCTGGTACC GGGACGGAAC
GTGCTTGCAG TCCAAGTGGC GCAGTTCTCG GCAGCCAGCT ATGTGGAAGA CCAGGACATG
TGGTGGCTGC CCGGCATCTT CCGGGATGTG ACGCTGCAGG CGCGGCCGTC GGAGGGGCTC
GACGACGTCT TCGTCCACGC TGACTATGAC CATCGCAGCG GCGAAGGCAT CCTCCGCGTA
GAGGCGAGCC GGGCAGGGCA GCCGGCTGCC GCCGTCGTGC GCATCCCTGA ACTGGGGCTG
GAAGTGCCCG CCGGGGAGGA GCGTCGGCTC CCGGGCATCG AGCCGTGGTC GGCAGAATTG
CCGCGGCTGT ACGACGCCAC CGTCAGCACC ACGGGGGAGA CCGCCTCGGT GCAGCTCGGC
TTCCGCACCA TCAGTATCGA GGATGCCCAG TTCAAGGTGA ACGGCCAGCG GATCCTGCTC
CGAGGCGTCA ACCGCCACGA ACACCATCCC AAGCTGGGCC GCGTGGTGCC CAGGGAGGTC
ATGGAAAGCG AACTGCGGCT GATGAAGCAG CACAACATCA ACGCCATCCG GACCTCGCAC
TACCCGCCGC ACCCGGACTT CCTGGCGCTG GCGGACCAAC TTGGCTTCTA CGTGGTCCTT
GAATGCGACC TCGAAACGCA TGGCTTCGAA CAAGGCGGCT GGCAGCAGAA TCCCAGTGCC
GATCCGCAGT GGCAGCAGGC CCTCGTGGAC CGAATGTCCC GGACCGTGGA GCGCGACAAG
AATCACGCGT CGGTGGTTAT GTGGTCCCTG GGCAACGAGG CCGGCACGGG CGAAAACCTT
GCAGCGATGT CCAGCTGGAC CAAACTCCGC GACCCCTCCA GGCCCATCCA TTACGAGGGC
GACTGGAGCT CCGCCCACGT GGACGTATAT TCGCGGATGT ATGCAAGCCA GGCGGAAACG
GAGCTGATCG GCCGTGGCGT GGAACCTCCT TTGGATGACG CGGACCTTGA CGCGCGGCGG
CGCGCCATGC CGTTCGTCCT GTGCGAGTAC GTTCACGCCA TGGGCAACGG CCCGGGGGGC
ATGTCCGAAT ACCAGGAACT TTTTCACCGG CACCCGCGGC TGATGGGCGG CTTTGTGTGG
GAGTGGCTGG AGCATGGCAT CACCCGAACG GCTCCTGACG GGCAGGAATA CTTCGTCTAT
GGCGGCGACT TCGGCGAGGA AGTCCACGAC GGAAACTTCG TCACTGACGG CCTGGTGGAT
GCCAACCGGG TGCCGCGCCC GGGCCTGCTG GATTTCAAGA AGGTCATCGA GCCGCTAACC
ATTGACGTGG CGGCGGACTG GTCCGGGTTC TCGCTCACGA ACCGCTTCGA TTTCGCCGAC
ACATCGGGGC TGGAGTTCCG CTACTCTGTG GAGGCCGACG GCGTCACGGT CGACGGCGGT
CCGGTTGACG TTGCGCCTGT GGCGCCGCAC GAATCAGCCG ACCTCCGGTT GCCGGCCGGG
CTGCTGGAAC GAGCGGGGCT GGCTCCTGCC GTGCTGACGG TCAGGGCAGT CCTCAAGGAA
GACGCCGCCT GGGTGGACGC GGGCCACGAA GTTGCGTGGG GCCAGAAGTC TGCCAATGTG
CGGGTGGCTC CGGTGCCGGC GGCAACTTCC GCCGTGCAGG TGGACGGTGG CCTCCTTCGC
CTGGGGCCGG CTGCCTTCGA TCGCGCGACG GGTTCGCTTG TTCGGCTGGG CGACACTGCA
GTCGAGGGGC TTCGACTCCT GCTGTGGCGT GCACCCACGG ATAATGATCT TGGGGCGGAG
TGGGGTAGCC CGGATCCCCG CCCAGTAGCC ACACAGTGGC TGGATGCGGG ACTCAACCGG
ATGCACGCCC GGCTTGTCGG CATTTCCTCC CGGCCCGCGG CAGGCGGCGG GGACGAGCTG
GTGGTGCGGA CGCGGGTTGC AGCGGCAGGT AAGCAATTCG GGGTGCTGGC CGAATACGCC
TGGACCAGTG ACGGCTCCAG TGTCAGCGTG CGGACAACGG TCACACCCGA CGGTTCGTGG
GTCAACGCCG GCTGGCCAGT GCCGTGGGCG CGCATAGGAG TGGAACTCGT GATGGGTTCG
GCCGCCAAGT CCGTGGAGTG GTTCGGCCAG GGGCCGCACC ACAGTTACCC GGATACAGGC
CAGGGCACCA GGCTTGGCTG GTACGACTTG CCCTTGAAGG ACCTGGACGT GGAGTACGTC
CGGCCCCAGG AATCGGGCGC CCGTTCCGGC GTGTACTCGG CAGCGTTGGA GCTCGACGCC
GGCCGGCTTA CCATCGGCGG CGAGCCGTTC GCACTCACCG TGCGCCCCTA CAGCCTGGCG
GCACTGGAAG CCGCCACCCA TCGCCCGGAC CTTATTCCGG ACGGCCGCAC CTACGTGTAC
CTCGATCACG CCGTACTCGG AGTCGGAACT GCGGCCTGCG GTCCGGGGGT GCTGGAAGGT
TACCGGCTGG CGCCGCGGGA AGCCGACTTC TCGTTGGTCT TTGGTGTGAC TCAGTCCACA
TAG
 
Protein sequence
MPVEPTTSSL SSAAGRGAGP GSEAAVRQAV HAPERILDVA AEQVAALASV APPRGTTPAR 
AYLVSDAPRL PLNGVWKFRL HAGIRQAPHD GWQSGGDRPD FGDLPVPSSW PMHGHGTPWY
TNVQFPFAVE PPHVPDANPV GDLFVEFEAG PEYFPSALIR FDGIESAGTV WLNGTLLGTT
RGSRLAHEFD ATGVLVPGRN VLAVQVAQFS AASYVEDQDM WWLPGIFRDV TLQARPSEGL
DDVFVHADYD HRSGEGILRV EASRAGQPAA AVVRIPELGL EVPAGEERRL PGIEPWSAEL
PRLYDATVST TGETASVQLG FRTISIEDAQ FKVNGQRILL RGVNRHEHHP KLGRVVPREV
MESELRLMKQ HNINAIRTSH YPPHPDFLAL ADQLGFYVVL ECDLETHGFE QGGWQQNPSA
DPQWQQALVD RMSRTVERDK NHASVVMWSL GNEAGTGENL AAMSSWTKLR DPSRPIHYEG
DWSSAHVDVY SRMYASQAET ELIGRGVEPP LDDADLDARR RAMPFVLCEY VHAMGNGPGG
MSEYQELFHR HPRLMGGFVW EWLEHGITRT APDGQEYFVY GGDFGEEVHD GNFVTDGLVD
ANRVPRPGLL DFKKVIEPLT IDVAADWSGF SLTNRFDFAD TSGLEFRYSV EADGVTVDGG
PVDVAPVAPH ESADLRLPAG LLERAGLAPA VLTVRAVLKE DAAWVDAGHE VAWGQKSANV
RVAPVPAATS AVQVDGGLLR LGPAAFDRAT GSLVRLGDTA VEGLRLLLWR APTDNDLGAE
WGSPDPRPVA TQWLDAGLNR MHARLVGISS RPAAGGGDEL VVRTRVAAAG KQFGVLAEYA
WTSDGSSVSV RTTVTPDGSW VNAGWPVPWA RIGVELVMGS AAKSVEWFGQ GPHHSYPDTG
QGTRLGWYDL PLKDLDVEYV RPQESGARSG VYSAALELDA GRLTIGGEPF ALTVRPYSLA
ALEAATHRPD LIPDGRTYVY LDHAVLGVGT AACGPGVLEG YRLAPREADF SLVFGVTQST