Gene Arth_4088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4088 
Symbol 
ID4447678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4612270 
End bp4613961 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content64% 
IMG OID639691919 
Productalpha amylase, catalytic region 
Protein accessionYP_833563 
Protein GI116672630 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGATCG CAGAGACGTC CGATCTGTGG TGGAAGAATG CCGTGGTCTA CTGCCTGGAC 
CCGGAAACCT TTTTCGACGA CGACGGCGAC GGCACCGGAG ATTTCGGCGG CCTGATTCAG
CGCGTGGACT ACCTGGCGGC CCTCGGGGTC ACGTGCATCT GGCTCATGCC GTTCTACCCC
TCGCCGGACC GGGACGACGG CTACGACATT ACCGACATGT ACGGTGTGGA CCCGCGGCTC
GGCACCCTGG GCGACGTGGT GGAGTTCATC AGGACGGCGA AGGACCGGGG AATGCGGGTG
ATCGCGGACT TCGTCATCAA CCACACCTCG GACAAGCACC CCTGGTTCAA GGAATCCAGG
AAGTCCGTCG ACAATCCCTA CCGTGACTAC TACGTGTGGC GGAAGGACAC TCCCCCGGAC
ACGTCGGAAC AAGTGGTTTT CCCCGGCGAG GAAACGTCCA TCTGGACCCA GGACAAGGCG
ACGGGCGAGT GGTATCTGCA CATGTTCGCC AAGCACCAGC CGGACCTGAA TGTCGCCAAT
CCGAAGGTCC GCGACGAGAT CGCCAAGTCC ATGGGGTTCT GGCTCCAGAT GGGGCTGGAC
GGATTCCGGC TGGATGCCGT GCCGTTCTTC CTGGAGCTTC AGGGCGTGTC CAAGGAGGAC
GCGGCCAAGA TCGATCCGCA CGACTACCTG GCCGCGCTGC GCAGCTTCCT GAACCGGCGC
AACGGCAGCG CGGTGCTGCT CGGAGAGGTC AACCTCCCCT ACAAGGAGCA GTTGAAATAC
TTCGGCGGTC CGGACGGCAA CGAGCTGAAC ATGCAGTTCG ACTTCCTGAG CATGCAGAAC
CTCTATCTTT CCCTGGCCAG GGAGGACGCC CGTCCGCTGG CCAAAACCCT GGCCGGCAGG
CCGGCCATCC ATCCGGACAA CCAGTGGGCC ATGTTCGTCC GCAACCATGA TGAACTGACG
CTGGACAAGC TGAGCGACGA GGAACGCGCG GAGGTTTTCG CCGCCTTCGG GCCGGACCCG
GACATGCAGA TGTACGGGCG GGGACTCCGG CGCCGCCTGC CGCCGATGCT CGACGGCGAT
CCCGCCCGGA TCCGAATGGT CTATTCGCTG ATGTTCTCCC TCCCCGGAAC CCCGGTCCTT
TTCTATGGCG AGGAGCTCGG GATGGGTGAG GACCTGCGGG CGAAGGGCCG CTCCGCCGTG
CGCTCCCCCA TGCAGTGGAC TGACACGGCA AACGGCGGGT TCTCCACCGC TCCGGCGGAC
AAGCTGGTGG CGCAGGTAGT GGACGGCTAC TTCGGGCCTA AGAACATCAA CGCGGCACAG
GCGAAGCGTG ACCCGGACTC GCTGTGGAAT TTCATCGCCG CACTGATCCG AAGCTACCGG
GAGAGTCCGG AGCTTGCCTG GGGCGACTTC GAGCTCATCA AGCAGTCCAA TCCCGGAGTG
CTCCTGCACA GCTGCACCCG CGCCGGTTCA ACGCTGGTCC TGGCGCACAA CATGGCCGCG
CAGCCGGCCT CCGTCTCGGC AAAGGTTTCC TCGCCGGAAG ATCCGGAGGA GGCATTCGGC
GGTGCCATCC TGCGGGATCT GCTCGACGGC GACAATGTCC CCTTGGCGGA CGACGGCGGC
TTCGAACTCG AGCTGGAACG CTACGGCTAC CGCTGGTTCC GGATCCAGCA CCCCGCCGAC
AGGCGGATAT AG
 
Protein sequence
MRIAETSDLW WKNAVVYCLD PETFFDDDGD GTGDFGGLIQ RVDYLAALGV TCIWLMPFYP 
SPDRDDGYDI TDMYGVDPRL GTLGDVVEFI RTAKDRGMRV IADFVINHTS DKHPWFKESR
KSVDNPYRDY YVWRKDTPPD TSEQVVFPGE ETSIWTQDKA TGEWYLHMFA KHQPDLNVAN
PKVRDEIAKS MGFWLQMGLD GFRLDAVPFF LELQGVSKED AAKIDPHDYL AALRSFLNRR
NGSAVLLGEV NLPYKEQLKY FGGPDGNELN MQFDFLSMQN LYLSLAREDA RPLAKTLAGR
PAIHPDNQWA MFVRNHDELT LDKLSDEERA EVFAAFGPDP DMQMYGRGLR RRLPPMLDGD
PARIRMVYSL MFSLPGTPVL FYGEELGMGE DLRAKGRSAV RSPMQWTDTA NGGFSTAPAD
KLVAQVVDGY FGPKNINAAQ AKRDPDSLWN FIAALIRSYR ESPELAWGDF ELIKQSNPGV
LLHSCTRAGS TLVLAHNMAA QPASVSAKVS SPEDPEEAFG GAILRDLLDG DNVPLADDGG
FELELERYGY RWFRIQHPAD RRI