Gene Arth_3325 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3325 
Symbol 
ID4443967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3733230 
End bp3735245 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content67% 
IMG OID639691148 
ProductBeta-galactosidase 
Protein accessionYP_832800 
Protein GI116671867 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0011475 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCCC AGGAAAATCC CACTCCACCC GCAGTGTGGA GCAGCCTCGA AGGTATCGCC 
TACGGGGGCG ACTACAACCC CGAGCAGTGG CCTGTCAGCA CCCGGCAGGA AGACCTGGAC
CTCATGAAGG AAGCCGGCGT CACGTTCCTC AGCGTGGGCA TTTTTTCCTG GGGCCTGCTG
GAACCGTCCG AGGGCGACTA CGATTTTGGC TGGCTGGACG ACGCCCTGGA CAACCTGGCA
GCAGCAGGCA TCAAGGTGGC CCTCGCGACC GCAACCGCCG CGCCCCCGGC ATGGTTGGTG
CGCAAGCACC CGGAAATCCT CCCGGTCACC GCTGACGGAA CCGTGCTGGG GCCGGGCTCC
CGGCGGCACT ACACGCCGTC GTCGGCCGTT TACCGGCGGT ACGCAACCGG CATCACCCGG
GTGCTGGCAG AGCGCTACAA GGACCACCCG GCACTGGCGC TGTGGCACGT GGACAACGAA
CTCGGCTGCC ACGTTTCGGA GTTCTACGGC GACGAGGACG CCACTGCCTT CCGCCGTTGG
CTTGAGCGTC GGTACGGGAG CATCGATGCC CTGAACGCGG CGTGGGGAAC GGCGTTCTGG
TCCCAGCACT ACTCCTCGTT CGAGGAGGTC CTGCCGCCAG CGGCGGCGCC GTCCACGCTT
AATCCGGGGC AGCAGCTGGA CTTCCAGCGC TTCAGCTCCT GGGCCCTGAT GGACTATTAC
CGCGCCCTCC TCGAGGTGCT GCGCGACGTA ACGCCCGGGG TGCCTGCCAC CACCAACCTC
ATGGTTTCCA GCGCCACCAA ATCCATGGAC TACTTCGACT GGGCCAAGGA CCTGGACGTG
ATCGCCAACG ACCACTACCT CGTGGCCGCC GATCCGGAGC GTCAGATCGA ACTTGCGTTC
AGTGCGGACC TCACGCGCGG CGTTGCCGGC GGTGACCCGT GGATACTGAT GGAACATTCG
ACGTCGGCAG TGAACTGGCA GCCCCGCAAC CAGCCCAAGA TGCCCGGCGA AATGCTGCGC
AACTCCCTGG CGCACGTGGC CCGCGGCGCT GATGCCGTTA TGTTCTTCCA GTGGCGGCAG
AGCTTCGCCG GTTCGGAGAA GTTCCACTCC GCTATGGTGC CCCACGGCGG ACGGGACACC
CGCATCTTCC ACGAGGTGGT GGGGCTCGGC GCCGCACTCC AGAAGCTGGC CGCGGTGCGG
GGATCGCGCG TTGAGTCGCG CGTGGCCATC GTCTTCGACT ACGAGGCCTG GTGGGCCAGC
GAACTCGATT CCCACCCCAG CCAGGACGTG AAATACCTCG ACCTGATGCG CGCCTTCCAC
CGCTCGCTGT TCCTGCGCGG GGTTTCGACG GATTTTGTGC ACCCTTCCGC CTCCCTGGAG
GGCTACGACC TGGTGCTGGT GTGCACCCTG TACAACGTCA CCGACCCGGC TGCCGCCAGC
ATCGCATCGG CTGCCCGCGC CGGCGCAACA GTCCTGGTGA GCTATTTCAG CGGGATTGTG
GACGAACAGG ACCACATCCG CCTCGGCGGG TACCCGGGCG CCTTCCGGGA GCTGCTGGGC
ATCAGGGTGG AGGAATTCCA CCCCCTGCTG ACCGGTGCCA CGGCCAAGCT CAGCGACGGC
CGAGTGGGCA CGGTATGGAG CGAGCACGTG CACTTGGCCG GGGCTGAACC GGTACAGACG
TTCACTGAAT ACCCGTTGGA GGGCGTCCCC GCCCTGACCC GGCGCTCCGT CGGAACGGGC
TCGGCCTGGT ACCTGGCCAC CTTCCCGGAC CGCGACGGCA TCGAGGCGCT GGTGGACCGG
TTGCTCGCTG ATTCCGGAGT TTCCGCAGTG GCGGAGGCGG ACCAGGGCGT TGAACTGACG
CGACGCCGCA CCGGCGACGG CACCAGCTTC ATCTTCGCCA TCAACCACTC ACGCGCCGAT
GCCACGGTCC GCGTGGCAGG GGCAGAGCTG CTGTCCGGCG AGCGGTTCAC CGGCACGGTT
CCGGCTGGCG GCGTGGCGGT GATCGCGGAG GACTGA
 
Protein sequence
MSSQENPTPP AVWSSLEGIA YGGDYNPEQW PVSTRQEDLD LMKEAGVTFL SVGIFSWGLL 
EPSEGDYDFG WLDDALDNLA AAGIKVALAT ATAAPPAWLV RKHPEILPVT ADGTVLGPGS
RRHYTPSSAV YRRYATGITR VLAERYKDHP ALALWHVDNE LGCHVSEFYG DEDATAFRRW
LERRYGSIDA LNAAWGTAFW SQHYSSFEEV LPPAAAPSTL NPGQQLDFQR FSSWALMDYY
RALLEVLRDV TPGVPATTNL MVSSATKSMD YFDWAKDLDV IANDHYLVAA DPERQIELAF
SADLTRGVAG GDPWILMEHS TSAVNWQPRN QPKMPGEMLR NSLAHVARGA DAVMFFQWRQ
SFAGSEKFHS AMVPHGGRDT RIFHEVVGLG AALQKLAAVR GSRVESRVAI VFDYEAWWAS
ELDSHPSQDV KYLDLMRAFH RSLFLRGVST DFVHPSASLE GYDLVLVCTL YNVTDPAAAS
IASAARAGAT VLVSYFSGIV DEQDHIRLGG YPGAFRELLG IRVEEFHPLL TGATAKLSDG
RVGTVWSEHV HLAGAEPVQT FTEYPLEGVP ALTRRSVGTG SAWYLATFPD RDGIEALVDR
LLADSGVSAV AEADQGVELT RRRTGDGTSF IFAINHSRAD ATVRVAGAEL LSGERFTGTV
PAGGVAVIAE D