Gene Arth_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0403 
Symbol 
ID4447130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp429621 
End bp431192 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content70% 
IMG OID639688202 
Productglycosyl hydrolase family 32 protein 
Protein accessionYP_829904 
Protein GI116668971 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0404573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAA TGACCCATCC TTTGGCCACG GTGCCGCGGA ACGAACTCGT GGCGCGGGCC 
GAGGCGGACC CGCTCCGCCC GCGCTTCCAC TTTGTCTCGC CCGCCGGCTG GCTCAACGAT
CCCAACGGTG TGGCCCAGTG GAGCGGGACT TACCACCTCT TCTACCAGTA CAACCCGGAG
GGAGCCTTCC ACCACCGCAT CCTGTGGGGG CACGCCACCA GCCCGGACCT GGTCCACTGG
ACCGACCAGC CCGTGGCACT GGAGCCTTCC GGCGGTCCGG ATGCCGACGG CTGCTGGTCC
GGTGTGCTGG TGAACGACGG CGGGACGCCC ACCCTGGTGT ACTCCGGACG GCACGGCGGC
AGCGAACTGC CCTGCGTCGC GGTCGGCTCG CCGGACCTCG TGAACTGGAC CAAAGCCCCG
GAAAACCCGG TGATCCCGGC GCCGCCGGCC GGCGTGGACA TCACCGCCTA CCGCGACCAC
TGCGTCTGGC GGGAAGGCAC GCGCTGGCGC CAGCTCGTGG GATCCGGCAT CCGCGGGCGC
GGCGGCACGG CGTTCCTCTA CGAATCCGCG GATCTCCGGC GCTGGGACTA CATCGGCCCG
CTGGTCATCG GCGACGCGTC CTCCGGAGAC CCGGCAGCCA CTAACTGGCA GGGCACCATG
TGGGAATGCG TGGACCTGTT CCGCGCAGGC GACGGGATTC TGGGGGATAG GGCATTGGAA
TCCCAGACGC CGGGCACGGA CGTGCTCGTC TTTTCGGCCT GGCACGACGG CGATACCCGC
CACCCGCTGT ACTGGACCGG CAGTTACGCC GGGGATTCCT ACACGCCGCG CGAACTCCAC
CGGCTGGACT ACGGCGGCCG CTACTTCTAC GCCCCGCAGT CCTTCGCGGA CGAGTCCGGC
CGCCGGGTCA TGTTCGGCTG GCTGCAGGAG GGCAGAACGG ATGGCGCCAT GGTGGAAGCC
GGCTGGTCCG GCGTGATGAG CCTGCCGCGG GTGGCCTCCC TGGATGCCCA CGGCGGGCTG
GCCTTCGCTC CCGTGCCGGA GGTGGAGCTG CTGCGGCGCG ACCACGTCCG GACCGGTCCC
CGAACGGTGG GCACCGGCGA GGTCCTGGCC GGGGTGTCCG GGAACCAGCT GGACCTGGAG
CTTGACCTGG AACTGGAGCC CGGGAGCGTC TTCCGGCTGG GCGTGCTTGG CTCCGGCCCA
GGAGGCCCTG ACGGTGTGCC GGCCGGGGCG GAAGAAACAG TCATCGAGGT CGGCTACACC
GTGGGCAGCG GCGGCTCCGA GCAGTCCTAT GTGCTCCTGG ACCGCGTGAA CAGCAGCCTG
GACCGGACCG TGGACGCGGA GGAAAAGTCC GGCCCGGTGC AGCTGCCCGG CGGAAAACTG
CACCTCCGCG TGCTGGTGGA CCGCTCGGCG CTGGAGATCT TCGCCAACGG CAAACCGCTC
ACGGCCCGGG CCTATCCGAC GCTCGGCGGG GAAAACGTAA GGCTGTCCGC CGCCGGTACA
GTCCGGCTGC TGCAGCTGGA CGCCTGGCGG ATGGAAGGAG TCTTCGGCGC CCCGCGCCCG
CTGTTCCCGT AG
 
Protein sequence
MTEMTHPLAT VPRNELVARA EADPLRPRFH FVSPAGWLND PNGVAQWSGT YHLFYQYNPE 
GAFHHRILWG HATSPDLVHW TDQPVALEPS GGPDADGCWS GVLVNDGGTP TLVYSGRHGG
SELPCVAVGS PDLVNWTKAP ENPVIPAPPA GVDITAYRDH CVWREGTRWR QLVGSGIRGR
GGTAFLYESA DLRRWDYIGP LVIGDASSGD PAATNWQGTM WECVDLFRAG DGILGDRALE
SQTPGTDVLV FSAWHDGDTR HPLYWTGSYA GDSYTPRELH RLDYGGRYFY APQSFADESG
RRVMFGWLQE GRTDGAMVEA GWSGVMSLPR VASLDAHGGL AFAPVPEVEL LRRDHVRTGP
RTVGTGEVLA GVSGNQLDLE LDLELEPGSV FRLGVLGSGP GGPDGVPAGA EETVIEVGYT
VGSGGSEQSY VLLDRVNSSL DRTVDAEEKS GPVQLPGGKL HLRVLVDRSA LEIFANGKPL
TARAYPTLGG ENVRLSAAGT VRLLQLDAWR MEGVFGAPRP LFP