Gene Arth_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0803 
Symbol 
ID4446674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp867722 
End bp870748 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content66% 
IMG OID639688609 
Productglycoside hydrolase family protein 
Protein accessionYP_830301 
Protein GI116669368 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0383] Alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.462233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCACGACG ACCGCCGCAT CACGGAAGTC CGTCTGGACC GCTTCATGCG CGAACGCGTG 
GACCCTGCGG TGTACTCCCG CAGCGTTCCG CTGAACCTCA GCGCCTGGGA CGTTCCGGAC
GAGCCTGTCT CCGTTCTGGA GGCCCTGCGC CACGACTTCG TGCCGCTGGA ACACGGATCG
GCGTGGGGCC GCCCCTGGAG CACCAAATGG CTGAGGCTGC AGGGTGAGGT GCCCGATTCC
TGGGGCACGG CTCCCGATAC CGCGGTGGAA ATAGTGGTGG ACCTGGGGTT CACCCGGGAG
CTCCCGGGCT TTCAGTGCGA AGGGATCGCC TGGCGGCCGG ACGGCACCAT CATCAAGGCC
ATCTCGCCCC GGAACCAGTA CATTCCACTG AAGCTCCTCG GCAGCGGGAT GGCAGTGGAC
TTCTACGTGG AGGCCGCCGC CAACCCCGAT GTTGCCCAGG GGTGGACTTT CGCAGCCATG
CCCTACGGTG ACAAGGCAAC AGCGGGAAGC GACCCCAAAT ACCGGCTGGG CGCCATGGCC
ATCGCCGAGC TCAACCAGAC GGTGTGGGAA CTGCAGCAGG ACGTATGGAC GCTCAGCGGA
CTCATGCATG AGCTCCCGAT GGAACTGCCG CGCCGCCACG AGATCCTGCG CGCCCTGGAA
CGGATGCTGG ACGTCATGGA TCCGGACGAT ATTCCCGGCA CCGCCGCGGC AGGCCGCGCA
GCCCTGGCTG AGGTTCTCTC CCGTCCGGCG TATGCCTCCG CCCATCAGCT GGTCGCCACA
GGACACGCGC ACATCGATTC GGCGTGGCTG TGGCCCGTCC GGGAGACCAT CCGCAAGTGC
GCGCGCACCT TCTCCAACGT CGTGGCCCTG ATGGACGAGT CTCCCGACTT CGTTTTCTCC
TGCTCGTCCG CGCAGCAGCT CGCCTGGATG AAGGAGTTCT ACCCCGAGCT GTTCGGCCGG
ATCCGCGAGA AGGTCAAAGC GGGCAAATTC GTGCCGGTCG GCGGCATGTG GGTGGAATCC
GACACCAACA TGCCGGGCGG TGAGGCCATG GCCCGGCAGT TCATCGAAGG CAAGGGGTTC
TTCCTCGACG AGTTTGGCGT GGAGTGCCGG GAGGCATGGT TGCCCGATTC CTTCGGCTAC
TCGGCTGCAT TGCCGCAGAT CGTCAAGGCT GCCGGCAGCA AGTGGTTCCT GACCCAGAAG
ATCTCCTGGA ACCAGGTCAA CAGGATGCCG CACCACACCT TCAACTGGGA AGGAATCGAC
GGCACGCGGC TGTTCACCCA CTTCCCGCCC GTGGACACTT ACAATTCGGA GCTGAGCGGC
CGGGAACTGG CACATGCGGA ACGCAACTAC CGGGACCACG GCCGCGGAAC CGTCTCCCTG
GTCCCGTTCG GCTACGGCGA CGGCGGCGGC GGACCGACAC GGGAGATGAT CGCCGCCGCC
CACCGTACGG CCGATCTCGA AGGGTCGCCG AAGGTCCGGA TCGGAACGGC TGCGAATTTC
TTCACGCAGG CGCAGGCCGA ATATGCGTCC CTGCCCGTCT GGGTGGGGGA GATGTACCTG
GAGCTGCACA GAGGGACCTA CACCAGCCAG GCGAAAACCA AACGGGGCAA CCGGCGCAGC
GAACACCTTC TCCGCGAGGC CGAACTGTGG TGTGCCACAG CATCAGTGCG CACCGCCGGC
GGGTTCGCGT ATCCGGCGGC CGAGTTGAAG CGCCTGTGGC AGCTGGTCCT GCTGCAGCAG
TTCCACGACA TCCTGCCCGG CAGTTCCATT GCCTGGGTCC ACCAGGACGC AGAGCGGAAC
TATGCGGCCA TCGCGGAAGG CCTTGAAGCC ATCATTGCCG ATGCCGCGCG CGCCATGCTC
GGTGAGGGCA GCCGCGAGTT CCTGCTGAAC GCCGCGCCGC ACGAACGCAG CGGAGTACCC
GCTCTTGCCG CCGCCGAACC GGTCCGGAGC GACCACCCGG TGACGGTCAC CGAGCATGCC
GGGGGATACA TCCTGGACAA CGGCGTGATC AGGGCCGTGC TGGACTCGAA CGGACTCCTG
ACTTCCCTCA TCGACCACGC AAGCGGCCGC GATGCCATCG CCCCCGGCCA GTACGGGAAC
CATCTGGAAC TACACCGCGA TACGCCCAAC GAGTGGGACG CGTGGGACAT TGACGAGTTC
TACCGCCGCA ACGTCACCTC ACTGACCGAA GCGCGTTCGG TGACGCTCGA GCGGGGCGGC
TGGGACGCCG TCGTCGTGGT GGAACGACTG GCGGGAGCAT CCGCGATCAC CCAGCGGATT
TCGCTGGAGG CGGGTTCCGG CTCGCTGGGC ATCCTGACCT CCGTGGATTG GCAGGAACGC
GAAAAGCTGC TCAAGATCGG ATTCCCCCTG GACGTGCGAG CAGACCGTTC GGCGTCAGAG
ACGCAGTTCG GGCATGTCTT CCGGCCCACC CACACCAACA CATCCTGGGA GGCAGCCAAG
TTCGAAATTT GTGCCCACCG CTGGATTCAT GTGGCGGAGC CGGGTTACGG CGTGGCGGTC
ACCAATGCCT CCAGTTATGG ACACGACGTC ACCCGCACCG TGAGGGACGA CGGCGGCACC
ACCACTACCG TCCGTACCTC GCTGCTGCGC GCACCCAAGT ATCCGGATCC CGACGCCGAC
CGCGGGCGGC ACGAGCTGCT GGTGACCATC AGGCCCGGGG CGGCCATTGC TGACGCCGTG
GAGGAGGGCT ACCGGACCAA CCTGGCCCCG CGGATCATGA GGGGCGCCAA CGCTGTCCTT
CCACTGTTCA CGGTGTTCAA CCAGGGAATC GTGGTTGAGG CGGTAAAGCT GGCGGAGGAC
GGTTCCGGTG ACGTCATTGT GCGTCTCTAT GAGTCCCTGG GGGAGCGGTC CGAGGGAATC
GTGACAGCCA ATTTCGAAAC CAGGCAAGTG CAGGTAGTGG ACCTGCTGGA GCGTCCGGTT
GCGGGCCCGG GTGTTGAAAC CGGCCGGGAT TCCGCAAAGC TGACGTTGCG TCCGTTCCAG
CTGCTCACCC TGCGGTTTGC CCGCTGA
 
Protein sequence
MHDDRRITEV RLDRFMRERV DPAVYSRSVP LNLSAWDVPD EPVSVLEALR HDFVPLEHGS 
AWGRPWSTKW LRLQGEVPDS WGTAPDTAVE IVVDLGFTRE LPGFQCEGIA WRPDGTIIKA
ISPRNQYIPL KLLGSGMAVD FYVEAAANPD VAQGWTFAAM PYGDKATAGS DPKYRLGAMA
IAELNQTVWE LQQDVWTLSG LMHELPMELP RRHEILRALE RMLDVMDPDD IPGTAAAGRA
ALAEVLSRPA YASAHQLVAT GHAHIDSAWL WPVRETIRKC ARTFSNVVAL MDESPDFVFS
CSSAQQLAWM KEFYPELFGR IREKVKAGKF VPVGGMWVES DTNMPGGEAM ARQFIEGKGF
FLDEFGVECR EAWLPDSFGY SAALPQIVKA AGSKWFLTQK ISWNQVNRMP HHTFNWEGID
GTRLFTHFPP VDTYNSELSG RELAHAERNY RDHGRGTVSL VPFGYGDGGG GPTREMIAAA
HRTADLEGSP KVRIGTAANF FTQAQAEYAS LPVWVGEMYL ELHRGTYTSQ AKTKRGNRRS
EHLLREAELW CATASVRTAG GFAYPAAELK RLWQLVLLQQ FHDILPGSSI AWVHQDAERN
YAAIAEGLEA IIADAARAML GEGSREFLLN AAPHERSGVP ALAAAEPVRS DHPVTVTEHA
GGYILDNGVI RAVLDSNGLL TSLIDHASGR DAIAPGQYGN HLELHRDTPN EWDAWDIDEF
YRRNVTSLTE ARSVTLERGG WDAVVVVERL AGASAITQRI SLEAGSGSLG ILTSVDWQER
EKLLKIGFPL DVRADRSASE TQFGHVFRPT HTNTSWEAAK FEICAHRWIH VAEPGYGVAV
TNASSYGHDV TRTVRDDGGT TTTVRTSLLR APKYPDPDAD RGRHELLVTI RPGAAIADAV
EEGYRTNLAP RIMRGANAVL PLFTVFNQGI VVEAVKLAED GSGDVIVRLY ESLGERSEGI
VTANFETRQV QVVDLLERPV AGPGVETGRD SAKLTLRPFQ LLTLRFAR