Gene Arth_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0404 
Symbol 
ID4447099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp431280 
End bp435083 
Gene Length3804 bp 
Protein Length1267 aa 
Translation table11 
GC content70% 
IMG OID639688203 
Productlevanase 
Protein accessionYP_829905 
Protein GI116668972 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAGA GGTTAGTTGC AGGTATTACC ACGGCCGCGC TAGGCCTGGG CGCGCTGGGA 
GCTGGCTCGC TGCTGGCCCC GGCCATGCTG GCCGCGCCCG CCCAAGCCGC TCCAGGCGGC
CCTTTCCCCG CCGGGCCGAT CACGCTGGGC AGCAGCAACA GTCCCATGCC GGACGTGGTG
GGCATCAGCG GAAACTGGAC CAGGCAGTCC GACGGCGGCC AGTCCGCCGT CGCCTCGCCG
GACCAAAACG CGGCCGCCAT CAGCGAACAG CGCATCAGCT CCACCGCCCG GTACACGGCC
GGAGTCACAG TCGATCCCGG CAGCCCGTAC GCCGTAGGCG CCCTGGTTTT CCGCAGTGCT
GCCGATGCCA GCAGCGGCTA TGCGGCCACC ATCGATCCCA ACCTGGACAG GGTGCGGCTG
TTCGATCTGG CCACTGGACA GGATGTGGCA CCTGCCGCCA CGGTCCCGCT GGACACGGGG
CGGAGCTACT CCGTGGACGT CCACGTGGAC GGGCCCCGGA TCTATGTGGC GGTGGACGGA
GTGCCGCGGA TCGACGCCAC CGACCAGCGG TACCAGGACG GCCATGTGGG CCTCCACGCC
TACAACGGCT CGGTGAACTT CAGCGAGCCC CGGGTGCGCA CCATCGACGC CAACGTCAGT
GGCTGGCAGG CGGCGGACGG TGCCGGCTGG ACCGCCTCGG CCACCGGCCT TCGGGGGACA
GCTCCGGGCG GCACCAACAT CAGGGCTGTC GCCACCGACC AGGCACCGCA GGTCACGGAT
TTCACCGCTG ATGTCCAGGT GACGTCGCCT TACGCCGTCG GCGCGGTCCT CTTCCGCACC
AACACTGCCG GAACCACCGG GTACGGGGCC GAAGCGGACG CCAACGCCGG CAGGCTGCGG
CTGTACCGTA TCGAGGACAA CGCCACCCTG GGCACCTTCG CGACCACCAT CACCGTCAAC
GCGGTATACC GGCTGCGCGT CACGGCCGAC GGCGGGCAGC TCGCCGTGCA CTGGCAGACG
GACCTGCTGG ACCCCAACGG CTACGCCCCG GCCATCACCG CAACAGACAC AGCGCACGGC
TCCGGGCACA TTGGCCTCCT CGCCTTCAAC GGCAGCACCG TGTTCCAGGG CATGACCCTC
AAAGGCCTGG ACACTTCCCT GCAGGGCTGG CGCACCGCCT CGGGAAGCTG GGAACCGGAC
GTCCGCGGCC TGCGCGGGGC AACCGACGGC CTGGCCGCCG GCGCGGATGC GGCACGCTTC
GTCCCGGCGG TGGCGTCCGA CGTCGTCGCG TCCCTTGACC TGGACGTCGC CGCCCCGGCA
ACGGCCGCCG TCGTGGTCCG GAGCGCGCCC GACGGCACCG GCGGCACCGA ACTCAGGGTG
GACCCCGGAG CAGGAACGGT GCAGCTGAAG GACCGCACCA CCGGGACCGT CCTTGCCGGC
GGGGCCGTAC CGCCGGACAG CTTTGCCGCC GGCCAGCTCA ACCGCATCCA ACTCACCGTC
CGCGGCGGGC AGGCAACCGC GCTGATCAAC GGCGTCCAGT CCGTTGCCGG GCCGGCCGGG
CCGGCGACAG GCAGCGGCTT CGGTCTGCGC GTCAGCGGCG GCGGCGCCTA CTTCCAAAAC
GTCCGGGCAG ACGACGTCGC CAGCTACATG AACGGCCTCT ACCAGCCCGG CTACCACTAC
AGCCAGAATT CCGGGAACAG TTCGGACCCC AACGGCCTGG TGTACTTCGA TGGCGAATAC
CACCTGTTCC ACCAGGACCG CGGCCGCTGG GCGCACGCCG TCAGCACCGA CCTGCTGCAC
TGGAAGCAGC TGCCCATAGC CCTGCCGCAC CTGGCTGCGG GGGAGTCATG GTCCGGCTCC
GCGGTGGTGG ATGCCAATGA TTCCAGCGGA CTGTTCGACG GCGGGCAGGG GCTCGTTGCG
TTCTACACGA GCTTCAACCA CGATGCCGCC AACGGCAACC AGTCCGTGCG CGCCGCGTAC
AGCAAGGACC ACGGCCGCAC CTGGTCCATC GTGCAGGCCC AGCCCGTGGT GGAAAACCCC
GGCGGTCCTG CCGGCAGCTG GGACTTCCGC GACCCCAAAG TCACTTGGGA CGCCGCCACC
GGCACATGGA TCATGGTGGT GGCCGGCGGG GACCACCTGC GCTTCCACAC GTCCACCGAC
CTGGTCCACT GGACCTTCAC CAGCGCCTTC GGCTACGGCG ACTGGGTCCG CGGCGGGGTC
TGGGAATGCC CGGACTTCTT CGAGCTGCCG GTGGAAGGGC AGCCCGGCGC CAGACGGTGG
GTGCTCTGGT GGAGCACCGG TGCCGTCAGG CCCACCAACG GCTCCGCGGC GCAATATGTC
ACCGGAACAT GGAACGGAAC GTCCTTCACC CCGGACACCG GCCCGGACGA GGTGCTCCAG
GCCGACTCCG GCCGCGACTA CTACGCTGCC ATGAGTTTCT TCGGTGCCCC GGACGGGCGG
CGGATCATGC TGGGCTGGAT GAGCAACTGG GACTATGCCT TCAGCCCGCC CACCGGCCGC
TGGAACGGGC AGCTGAGTGT CCCCCGGCAG CTGAGCCTGA AAGACATTCC GGGCGTCGGG
CCCCGGCTGG CGCAGGAACC CATAACCGAA CTGGCAGGGC TGCGCACGTC CACGTGGCAG
GCGTCCGACG TCACGGTCAC GCCGACGTCG GCCAACCCGC TGTCCGCTGC TTCCGGCCGG
TCGTTCGAGC TGGAGGCCGA GGTGGCCATC CCGTCCTCCG GCGGGGCTTC GGGCTTCACC
TTCGGGCTCC GCAAAGGCAC TGCGGGCGGC GCAGCCGGAG GGGAAGCCCA GGAGACACTG
CTCCGCTACG CCGCCGGCAC GGGGACAGCC TCGGCGACCG GCGCCGGCAC CGGCACCGGC
ACCGGCACCG GCACCATGAC GGTGGACCGC GGGCAGTCCG GCCGGGCAGA CTTCACCCGC
TATTTCGCGG GTGCCGCGGC GGACAATGCC AGCACGGCAT GGAGTTCCGA AACAGTGGCG
CTGGCCGGAG GGGGGAGCGA GCGGCGGGTG AAGCTGCGGG CCCTGGTGGA CTCCTCCTCG
GTGGAGCTCT TCGGCGGGGA CGGCACGGCA GCCATCACCT CCCTGGTCTT CCCGGACCCG
TCCTCCACCG GGCTGTCTTT CAGCACCACC GGCGGGTCCG CGCGGCTGGT GTCCGCCAAA
GTACACCAGC TGGCGGACAC CTCCCGCCTC ACTGCGGCGG TGCCGTCGGC CGTCCTTGGC
CCGGTGAGCG GCGCGGCCCG GCACAACCTG GATTCCTACT CCGTGGTTCC CGGCGGCCGC
TGGGAAAGCA CGGGAGCAGG GCTGGCCGGC ACGTTCGACA AGGACTCCAC CGCCCTGAGC
GCCGCCACGT ACTCGGATGT CCGGGTGGCC GCGACAGTCC GTTTCGGCAG CGGACCCTAC
GCGGGGGCCA TGCTCAACAG CGACAGGGTG CCGGAACGGG GTTACGGCGG CGCGGGGTCG
GTGCTGCTGC GGGCCTCGGC CGACGGTGCC ACGGCGTACT ACGTCAACCT CGACCCGAAC
CTGCGGCTGG CGCGCATCTT CAAGCTGCAG GACGGCGTCT TCGATTCCGC CGCCAGCGTC
CTGGCCAGTG TCCCCGTGCT GCTCAGCCAC GGCGTCAGTT ACAGCGTGGA AGCCGCCGCG
GCGGGGGAGC GGCTGACGGT GAAGCTCGAC GGCGTGGAAA TCCTCGCCGT CGATGATGCC
TCCCTGTCCG CCGGCAAGGT GGGACTGAAC GTGTTCGACG GCCGGGCAGC CTATCAGGAC
GTGCTGGTGA CAGGCTCGGG ATGA
 
Protein sequence
MSKRLVAGIT TAALGLGALG AGSLLAPAML AAPAQAAPGG PFPAGPITLG SSNSPMPDVV 
GISGNWTRQS DGGQSAVASP DQNAAAISEQ RISSTARYTA GVTVDPGSPY AVGALVFRSA
ADASSGYAAT IDPNLDRVRL FDLATGQDVA PAATVPLDTG RSYSVDVHVD GPRIYVAVDG
VPRIDATDQR YQDGHVGLHA YNGSVNFSEP RVRTIDANVS GWQAADGAGW TASATGLRGT
APGGTNIRAV ATDQAPQVTD FTADVQVTSP YAVGAVLFRT NTAGTTGYGA EADANAGRLR
LYRIEDNATL GTFATTITVN AVYRLRVTAD GGQLAVHWQT DLLDPNGYAP AITATDTAHG
SGHIGLLAFN GSTVFQGMTL KGLDTSLQGW RTASGSWEPD VRGLRGATDG LAAGADAARF
VPAVASDVVA SLDLDVAAPA TAAVVVRSAP DGTGGTELRV DPGAGTVQLK DRTTGTVLAG
GAVPPDSFAA GQLNRIQLTV RGGQATALIN GVQSVAGPAG PATGSGFGLR VSGGGAYFQN
VRADDVASYM NGLYQPGYHY SQNSGNSSDP NGLVYFDGEY HLFHQDRGRW AHAVSTDLLH
WKQLPIALPH LAAGESWSGS AVVDANDSSG LFDGGQGLVA FYTSFNHDAA NGNQSVRAAY
SKDHGRTWSI VQAQPVVENP GGPAGSWDFR DPKVTWDAAT GTWIMVVAGG DHLRFHTSTD
LVHWTFTSAF GYGDWVRGGV WECPDFFELP VEGQPGARRW VLWWSTGAVR PTNGSAAQYV
TGTWNGTSFT PDTGPDEVLQ ADSGRDYYAA MSFFGAPDGR RIMLGWMSNW DYAFSPPTGR
WNGQLSVPRQ LSLKDIPGVG PRLAQEPITE LAGLRTSTWQ ASDVTVTPTS ANPLSAASGR
SFELEAEVAI PSSGGASGFT FGLRKGTAGG AAGGEAQETL LRYAAGTGTA SATGAGTGTG
TGTGTMTVDR GQSGRADFTR YFAGAAADNA STAWSSETVA LAGGGSERRV KLRALVDSSS
VELFGGDGTA AITSLVFPDP SSTGLSFSTT GGSARLVSAK VHQLADTSRL TAAVPSAVLG
PVSGAARHNL DSYSVVPGGR WESTGAGLAG TFDKDSTALS AATYSDVRVA ATVRFGSGPY
AGAMLNSDRV PERGYGGAGS VLLRASADGA TAYYVNLDPN LRLARIFKLQ DGVFDSAASV
LASVPVLLSH GVSYSVEAAA AGERLTVKLD GVEILAVDDA SLSAGKVGLN VFDGRAAYQD
VLVTGSG