Gene Arth_0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0054 
Symbol 
ID4447489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp60394 
End bp61929 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content64% 
IMG OID639687848 
Productlevanase 
Protein accessionYP_829555 
Protein GI116668622 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAA CAACAATGCA CCCTGCCGCC CCCGCCGAAG ACACCGCCGC CGACTTCCGG 
CCGGTACTTC ACTACACAGC CAAGAACACC TGGCTGAACG ACCCCAACGG ACTCGTGTGG
CACCAGGGCG TCTACCATCT CTTCTACCAA AACAACCCCT TCGACAACGT CTGGGGCAAC
ATGTCCTGGG GGCACGCCAC CTCAACCGAC CTTCTGCACT GGACCGAACA CCCGGTTGCC
ATCGCCTGCG ACGAGGAAGA AGACGTCTTT TCCGGCAGCA TCGTGGTGGA CCACGGCAAT
ACGTCGGGAT TCGGCACAGT GGAAGACCCT GCCCTGGTGG CCATCTACAC GAGCGCCTTC
AAGGAAGGCT CGGTGCACCA AGGGACACAA GCCCAGTCTC TCGCGTTCTC CACGGACGCC
GGCATGACGT GGAACAAGTA CGCAGGCAAT CCGGTGCTTG GCCGCGACTC GGCCCATTTC
CGGGATCCCA AAGTATTCCG CTACGAGGGA GCTGCCGGTT CCTGCTGGGT CATGGTGGCG
GTGGAGGCCC GGCGCCAGCA GGTTGTGCTG TACCGCTCGG CCGACCTCAA GGATTGGGAA
CACCTGAGCA CCTTCGGCCC TGCAAACGCG ACGGGAGGCG AATGGGAGTG CCCCGACCTG
TTCCCGCTCC CCGTCGACAG AGACCCGGAC AACGTCAAGT GGGTCCTCGT AGTCAATGTC
AATCCGGGTG CCGTGGCCGG CGGCTCGGGA GGGCAGTACT TCGTCGGCGA CTTCGACGGG
GTGAAGTTCA CTGCCGACCC TGATTCACTC GTTCCAGCCG ATGCCGACGG GACCACTGAT
CTCAGCCGCT GTCTGTGGCT CGACTGGGGA CGTGACTACT ACGCCGCCGT CTCCTTCAGC
AATGCCCCGG AGAACCGCCG TATCATGATC GGCTGGATGA ACAACTGGGA CTACGCCAAC
TTCTTGCCCA CGTCTCCATG GCGTTCCGGG ATGTCGCTTG CCCGCGAGAT CGAGCTCGCG
ACGGTGGACG GTTTGCCCCG CCTGGTGCAG CGCCCGGTAC TGCCATTGGA CAGCGGCGAG
CCGGCCTGCG CCATCCAGGA CGTGGAGCTT CACGACTCCC TGCTGCAACT GCCCGACGCA
ATGCCCGGAT CAGCCCAGCT GATCGACGCC GAGATCTTGC CCGGCACGGC CCGGACCGTT
GTTTTCCGGC TTCTCGGCGC ATCCGGCGGG AGCGCCGCAA CGGTTCTCAG CTTCGATGCC
GTGACGGGCC TGCTCACCCT GGATCGCCGC AACTCCGGAA ACACCGCCTT CCACGGAAAG
TTCGCGTCTG CCGAGTCGGC ACCGGTGAAG CTCGAAGCCG GCGTGCTAAG GCTCCGCGTA
ATCGTCGACC AGTGCTCGGT GGAGGTCTTT GCCCAAGGCG GCAGGGTCGT CCTGAGCGAT
CTGGTCTTCC CGATGTCCGG AAGCCTGGGC ACCGAAGTGT GCGTGGAGGG CGGCGCGGCC
TTTGTTCGGA AACTGGCCGT CACGGGCTTG TCCTGA
 
Protein sequence
MTETTMHPAA PAEDTAADFR PVLHYTAKNT WLNDPNGLVW HQGVYHLFYQ NNPFDNVWGN 
MSWGHATSTD LLHWTEHPVA IACDEEEDVF SGSIVVDHGN TSGFGTVEDP ALVAIYTSAF
KEGSVHQGTQ AQSLAFSTDA GMTWNKYAGN PVLGRDSAHF RDPKVFRYEG AAGSCWVMVA
VEARRQQVVL YRSADLKDWE HLSTFGPANA TGGEWECPDL FPLPVDRDPD NVKWVLVVNV
NPGAVAGGSG GQYFVGDFDG VKFTADPDSL VPADADGTTD LSRCLWLDWG RDYYAAVSFS
NAPENRRIMI GWMNNWDYAN FLPTSPWRSG MSLAREIELA TVDGLPRLVQ RPVLPLDSGE
PACAIQDVEL HDSLLQLPDA MPGSAQLIDA EILPGTARTV VFRLLGASGG SAATVLSFDA
VTGLLTLDRR NSGNTAFHGK FASAESAPVK LEAGVLRLRV IVDQCSVEVF AQGGRVVLSD
LVFPMSGSLG TEVCVEGGAA FVRKLAVTGL S