Gene Arth_0743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0743 
Symbol 
ID4446748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp800116 
End bp801837 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content67% 
IMG OID639688548 
Productalpha amylase, catalytic region 
Protein accessionYP_830241 
Protein GI116669308 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.802881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAAAG CCCCCGCCGT CGCGGCAGGG CCGCTGACCC TGGTCCACAC TGCTGATGAA 
GCTTCCGGCT GGTGGCGGTC CGCCGTCATC TACCAGGTCT ATCCGCGCTC CTTTCGGGAC
CTGAACGGAG ACGGCATCGG CGATCTTGCC GGCATAACAG CGGAGCTGCC CCAGTTGGCC
CGGCTCGATG TGGATGCCGT CTGGCTGTCG CCGTTCTACC GCTCACCGCA AAAGGACGCC
GGCTATGACG TCAGCGACTA CTGTGACGTC GATCCGCTGT TCGGCACCCT GGCTGACTTC
GACGCCATGA TGGTGGAGGC AACCCGCCTG AAGCTGCGGG TGATCGTGGA CCTCGTTCCC
AATCACTGCT CGGACCAGCA CGCCGCTTTC CAGGCTGCCC TCGCCGCCCC CGCCGGCAGT
GCGGAACGTG ACATGTATAT CTTCCGGGAC GGCCTGGGAA CCTATGGTGA AGAGCCTCCC
AACAACTGGC AGTCGCACTT TGGCGGACCC GCCTGGACAC GAATCACGGA GCCGGACGGC
CGGCCCGGGC AGTGGTACCT CCACCTCTTC GACACCTCCC AGCCCGACTT CAACTGGGAC
AACCAGGCAG TCCACGATGA GTTCGAGCGG GTGCTGCGTT TCTGGCTGGA CCGGGGCGTG
TCCGGATTCC GCGTGGATGT GGCCCACGCA CTGGTGAAGG CTCCCGGCCT GCCGGAGTGG
GGCGGCCGGG CCGACGGCAA CAGCTGCGAC GGCTACCCCG GCCATGACGC ACCCATGTTC
GGCCAGCCGG CCCTGCATGA CATCTACCGG GCCTGGCGCC GTATCCTGGC CGAGTACGGT
CCGGACCGCA TCCTGTGTGC CGAAGCCAAT GTGGACCCGC TGCCCCGTTT GGCCGACTGG
GTCCGCCCGG ACGAAATGCA CCAGGCGTTC AATTTCCCGT ATCTCCATGC GGGCCTGGAC
GTCCACCGCC TGCGCGGTGT CATCACTGAC TCCCTGGTGG CGCTGGACGC CGTCGGCGCG
CCAAGCACAT GGGTACTGTC CAACCACGAC GTCGTCCGCC ACGCCACCCG TTTCGGCTAC
GACGGCCCTG CCCCGCGCGA CGGCGACGGG ATCGGCACCT TTGACCGGCA GCCCGACCTG
GCCCTGGGCA GGACCAGGGC TGCCGCCGCC TCCATGTTCA TGCTGGGGCT TCCCGGCGGG
GCGTACCTCT ACCAGGGTGA AGAACTGGGC CTTCCGGACG GAATCGATAT CCCGGACAGC
CAGCGCCAGG ACCCCACGTT CGCGCGCACC GGCGGAGAGC GGTTGGGCCG CGACGGCTGC
CGGGTGCCCC TTCCCTGGCG TGCCGCTGAA CTGCACGCCG GCTTCGGCTC GGGACAGGAT
CCGTGGCTGC CGCAGCCTGC GAGCTTCAGC GAACTGGCGC GCGACGCGCA GGCCGAAGAG
CCGACGTCGC ATTTGAACCT CTACCGTCGG ATGCTGTCCA TGCGGAGGGA GCTGGACCTG
GGCAGAGGCT CGCTGGCTTG GCTGGAAAGC TGGTGCAGCG ATTCGTCCCT GGCCTACGTC
AACGGCACCA CCCTTGTGGT CATGAACACG GGACGTGAAC CGTTGGAACT CCCGGCCGGC
CGGGTGCTGC TGCGCAGCTC CGCGGAGTCA GCTGCGACGA ACTCGGGCGG TCATCAGCTA
GGCTCGGGTG AGACGGCCTG GTTGACCCTT GAGGTTGGGT GA
 
Protein sequence
MLKAPAVAAG PLTLVHTADE ASGWWRSAVI YQVYPRSFRD LNGDGIGDLA GITAELPQLA 
RLDVDAVWLS PFYRSPQKDA GYDVSDYCDV DPLFGTLADF DAMMVEATRL KLRVIVDLVP
NHCSDQHAAF QAALAAPAGS AERDMYIFRD GLGTYGEEPP NNWQSHFGGP AWTRITEPDG
RPGQWYLHLF DTSQPDFNWD NQAVHDEFER VLRFWLDRGV SGFRVDVAHA LVKAPGLPEW
GGRADGNSCD GYPGHDAPMF GQPALHDIYR AWRRILAEYG PDRILCAEAN VDPLPRLADW
VRPDEMHQAF NFPYLHAGLD VHRLRGVITD SLVALDAVGA PSTWVLSNHD VVRHATRFGY
DGPAPRDGDG IGTFDRQPDL ALGRTRAAAA SMFMLGLPGG AYLYQGEELG LPDGIDIPDS
QRQDPTFART GGERLGRDGC RVPLPWRAAE LHAGFGSGQD PWLPQPASFS ELARDAQAEE
PTSHLNLYRR MLSMRRELDL GRGSLAWLES WCSDSSLAYV NGTTLVVMNT GREPLELPAG
RVLLRSSAES AATNSGGHQL GSGETAWLTL EVG