Gene Arth_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3038 
Symbol 
ID4444302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3404242 
End bp3405948 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content68% 
IMG OID639690862 
Productalpha amylase, catalytic region 
Protein accessionYP_832517 
Protein GI116671584 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCCACCA CCGCCACTCT GGCAACCCTG TCCGACTCAA ACCTCGCAGC CGATCCCAAC 
TGGTGGCGCC AGGCCTCCGT GTACCAGATC TATCCGCGCA GCTTCTCGGA TTCGAACGGT
GACGGCCTGG GCGACATCAA GGGCATCACC GCCAAAGTGC CCTACCTGAA GGAACTGGGG
ATCGACGCCG TGTGGCTCAG CCCGTTCTAC CCCTCCGCGC TCGCAGACGG CGGTTACGAC
GTCGACGACT ACCGCAACGT TGACCCCAAG CTGGGCACCC TGGAGGACTT CGCCGAAATG
TCCGCCGCGC TGCACGAAGC CGGCATCAAG CTCATCGCGG ACATCGTCCC CAACCACTCC
TCGAACCGGC ACGAATGGTT CAAGGAAGCA CTCGCCGCAC CGCGGGGCTC CGCCGCCCGC
GAACGCTACA TCTTCCGTGA CGGCCTGGGC GAGAACGGCG AGCTGCCGCC GTCGGACTGG
GACTCCGTCT TCGGCGGTCC CGCCTGGGAG CGCATCACCG AACCGGACGG CACGCCGGGC
CAGTGGTACA TGCACATCTT CGCCAAGGAG CAGCCGGACC TGAACTGGTC CAACCGCGAA
ATCCGCGATG ACTTCCTGAA GACCTTGCGC TTCTGGTCCG ACCGCGGCGT GGACGGCTTC
CGCGTGGACG TGGCACACGC CCTCACCAAG GACCTCACCG AGCCCCTTCT CTCCAAGGTC
GAACTCAGCG AGGCAAACAC CGGCACTGAC GGTTTCGCCG ATGGCTCGCA CCCCTTCTGG
GACCGCGACG AAGTCCACGA GATCTACGCC GAATGGCGTG AGGTGTTCAA CGAGTACAAC
CCGCCGCGCA CCGCCGTCGC CGAAGCCTGG GTGCACGCCA CCCGGCGCGC CCGCTACGCC
AGCCCGCAGG GCCTGGGCCA GGCGTTCAAC TTCGACCTCC TGCAGGCCGA TTTTGATGCC
GAGGAGTTCC AGGAAATCAT CACCCGGAAC CTCGCCGAGG CAACGGCTAC CGGCGCATCC
TCGACGTGGG TCTTCTCCAA CCACGACGTC GTCCGGCACG CCACCCGCTA TGGCCTCCCG
ATGGGCGGCG GAGTCCACGC CAAGGGCCAG GACGGCAAGG GCTGGCTGCT GGCCGGCGCG
CCGGCTGAGG AACTGGATGT CGAGCTCGGC CTGCGCCGCG CCCGTGCGGC CAGCCTGCTG
ATGCTTGCCC TCCCGGGCTC GGCCTACCTG TACCAGGGCG AGGAACTTGG CCTGCAGGAA
GTGGCCGAGA TCCCGGACTC GGAGCGCCAG GACCCGTCCT TCTTCCGCAA CAAGGGCGTG
GAGATCGGCC GTGACGGCTG CCGCGTGCCG CTGCCGTGGT CCACGGATGG AACGTCCTTC
GGCTTCGGCG CCGGCGATGC CCACCTGCCC CAGCCGGCAT GGTTCGCCCG CTACGCAGTG
GACGCCCAGG ACGGCGTGGA GGGCTCCACC CTGGAGCTGT ACCGCAAGGC GCTGAAACTG
CGTCGCGAGC TGCAGACGGC CGAAGAGCTG GAATGGGTGG AGACCGGCAA CCCCGAGGTC
CTGCACTTCA GTCGCCACGG CGGCTGGCAG TCGGTGACCA ACTTCGGTGC GGAGCCGGTG
GAACTCCCGG CAGGCGAGGT CCTGCTCAGC AGCGCCCCGC TGGACGGAAA GCTGCTGCCC
GGCAACACCA CTGTCTGGCT GCGGTGA
 
Protein sequence
MSTTATLATL SDSNLAADPN WWRQASVYQI YPRSFSDSNG DGLGDIKGIT AKVPYLKELG 
IDAVWLSPFY PSALADGGYD VDDYRNVDPK LGTLEDFAEM SAALHEAGIK LIADIVPNHS
SNRHEWFKEA LAAPRGSAAR ERYIFRDGLG ENGELPPSDW DSVFGGPAWE RITEPDGTPG
QWYMHIFAKE QPDLNWSNRE IRDDFLKTLR FWSDRGVDGF RVDVAHALTK DLTEPLLSKV
ELSEANTGTD GFADGSHPFW DRDEVHEIYA EWREVFNEYN PPRTAVAEAW VHATRRARYA
SPQGLGQAFN FDLLQADFDA EEFQEIITRN LAEATATGAS STWVFSNHDV VRHATRYGLP
MGGGVHAKGQ DGKGWLLAGA PAEELDVELG LRRARAASLL MLALPGSAYL YQGEELGLQE
VAEIPDSERQ DPSFFRNKGV EIGRDGCRVP LPWSTDGTSF GFGAGDAHLP QPAWFARYAV
DAQDGVEGST LELYRKALKL RRELQTAEEL EWVETGNPEV LHFSRHGGWQ SVTNFGAEPV
ELPAGEVLLS SAPLDGKLLP GNTTVWLR