Gene Arth_0425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0425 
Symbol 
ID4447085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp454316 
End bp455809 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content64% 
IMG OID639688224 
Productlevanase 
Protein accessionYP_829926 
Protein GI116668993 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCG CCACGCCCAC CCTTGGGGCC TACCGGCCGG CGATGCACTA CGCCTGCAAG 
GACACCTGGC TCAACGACCC CAACGGTTTG GTCTTCCACG ACGGCATTTA CCACCTGTAC
TACCAGAACA ACCCGTTCGG GAACGTCCAC AGCAACATGT CCTGGGGCCA CGCCACATCC
ACGGACCTGG TGAACTGGGA CGAACAGCCG GTGGCCATCC CGTGCGACGA AACAGAGGAA
ATCTTCTCCG GCAGCATCGT GGTGGACCAG GACAACACCG CCGGCTTCGG CCTGTCCGGA
ACAACCCCGC TGGTTGCCGT CTACACCAGC GCCTACAAGG CCGGTTCGGC ACACGAGGGC
ATGCAGGCCC AGTCCATCGC GTGGAGCACC GACGGCGGAT ACACCTGGAC CAAGTACTCT
GGCAACCCGG TCCTCACCAG GAACTCGCCG GAATTCCGCG ACCCCAAGGT CTTCCGCTAC
GACGGCCCCG CCGGAAGCTA CTGGGTGATG GCCGCTGTCG AAGCCCACGA CTACGCGGTG
CTGCTCTATC GCTCCGGCGA CCTTAAGAAC TGGGAATACC TCAGCACCTT CGGCCCGGCC
AACGGCACCG GCGGCATCTG GGAGTGCCCG GACCTCTTTG AACTGCCCGT GGACGGAGAT
GCAGGAAACA CCAGGTGGGT CCTGACCGTG AACATGAACC CCGGCGGACC CAACGGAGGA
TCAGCAGGGC AGTACTTCGT GGGCGACTTC GACGGCGTGA CGTTCACCTC CGAGACCACC
GTCACGGACG GCATGCAGGA CCCGGAGCGC GTGGTTGACT ACCAGTGGCT CGATTGGGGC
CGGGACTACT ACGCGGCTGT GTCCTTCAGC AATGTCCCAG GCGGACGGCG GCTCATGATC
GGATGGATGA ACAACTGGCA GTATGCCAAC CACATCCCCA CTTCACCCTG GCGCAGCCCC
ATGAGCCTAG TGCGCGAAGT TGCCCTCGCC TCGATCGACG GCGAACCCCG GCTCGTCCAG
CAGCCCGTGC CTGCATGCAC CGAGGGTTCT GTTTCCGGAC CTACCCAGCG ACTGTCCCTG
AACGGCGGAG TAACAGTCGA CGGCGGCAAG GCAGTACAGC TGATCGAGGC CACCTTTACG
CCGGGGACAG CAAGCGAGTT CGGTTTGGTG GTCTGCGGCT CCGGGAGCGG TTCATCGGGG
ACCCGGATCA GCATCCGCCC CGGCAGCGGA CAACTCATGC TGGACCGAAC CAACTCCGGC
GACACCGGCT TCCACGAGGC CTTCCCGTCC ATCAGCACGG CACCACTGCA GGCAGACGCT
GGGACCTACT CCCTGCGGAT CTTCGTGGAC CACTGCTCGG TCGAAGTCTT CGCCCAGGGC
GGACTAGTGA CACTTACTGA GCTGATATTC CCGGATCCGG TGCACACCGG CATTACGGTG
TTCTCCACCG GGGGAACGGC GGAAGCTTCC CTGCAGCTTA CGGACTTGGC ATGA
 
Protein sequence
MNTATPTLGA YRPAMHYACK DTWLNDPNGL VFHDGIYHLY YQNNPFGNVH SNMSWGHATS 
TDLVNWDEQP VAIPCDETEE IFSGSIVVDQ DNTAGFGLSG TTPLVAVYTS AYKAGSAHEG
MQAQSIAWST DGGYTWTKYS GNPVLTRNSP EFRDPKVFRY DGPAGSYWVM AAVEAHDYAV
LLYRSGDLKN WEYLSTFGPA NGTGGIWECP DLFELPVDGD AGNTRWVLTV NMNPGGPNGG
SAGQYFVGDF DGVTFTSETT VTDGMQDPER VVDYQWLDWG RDYYAAVSFS NVPGGRRLMI
GWMNNWQYAN HIPTSPWRSP MSLVREVALA SIDGEPRLVQ QPVPACTEGS VSGPTQRLSL
NGGVTVDGGK AVQLIEATFT PGTASEFGLV VCGSGSGSSG TRISIRPGSG QLMLDRTNSG
DTGFHEAFPS ISTAPLQADA GTYSLRIFVD HCSVEVFAQG GLVTLTELIF PDPVHTGITV
FSTGGTAEAS LQLTDLA