Gene Arth_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2030 
Symbol 
ID4445439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2288659 
End bp2290575 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content65% 
IMG OID639689838 
Productalpha amylase, catalytic region 
Protein accessionYP_831510 
Protein GI116670577 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.194844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGAAG CCGCTGGTGT GGACCAGGCA GCGAGGCAAC GTGCACTGAG CGCGCTGGCT 
GCGCAGGAGG ACCTCATGTC CCGCGGACAG GACAGTGTGG ACTTCCGGCG GCGCCTCGAC
CGTCATTTTC CTGACCTGTA CCGGCTCTTC CACACCCTGT ATGGCAGGCG CCCGGACTTC
GATGAGCAGT TGACGGCGCT GGTCCTCCAG ACGGCCCGTT CCTGGAACGA ACGCCCGGCG
GACCTCAAAG CCCTGGACGC AGAACGGGAA CACCATGCCG GGTGGTTCCT CGCCAACACC
ATGCTAGGCG GCGTCTGCTA CGTCGACCGC TATGCGGAGG ACCTCGAGGG AGTCCGCGCC
CGGATTCCGT ATTTCAAAGA GCTCGGCCTC ACCTACCTGC ATCTGATGCC GCTGTTCCTG
GCACCTGAAC CGCATTCCGA CGGCGGGTAC GCCGTCTCCA GTTACCGCCA GGTCAACCCG
AAACTGGGGA CCATGGAACA GCTGCGTGAA CTGGCCGCTG AGCTGCGGTC CAACGGCATC
AGCCTGGTGG TTGACTTCAT TTTCAACCAC ACCTCGGATG AGCACGCGTG GGCAAAGCGG
GCCGCGGCGG GCGATCCCGG GTACAGCGAT TACTACTGGA TCTACCCGGA CCGGGCCATG
CCGGACGCTT TCGAGCAGAA TGTGCGCGAG ATCTTCCCGG AGAACCACCC GGGGTCCTTC
ATCCAGATGG AGGACGGCCG CTGGGTCTGG GCCACGTTCC ACACGTACCA GTGGGACCTG
AACTACTCCA ATCCGGACGT CTTCCGGGCC ATGGCCGGGG AGATGCTCTT CCTGGCCAAC
CAGGGTGTGG ACATCCTGCG GATGGATGCA GTCGCCTTCA TCTGGAAGCA GCTCGGAACC
CCGTGCGAGA ACCTCCCCGA AGCCCACACC CTGCTGCAGG CCTTCAACGC CGTCTGCCGC
CTGGCCGCGC CGTCGCTGCT GTTCAAGTCC GAGGCGATCG TCCACCCCGA CGAAGTGGCG
CTGTACATCG ATCCGGCAGA GTGCCAGCTT TCCTACAACC CGCTGCAGAT GGCGCTGATC
TGGGAGTCGA TGGCCACCCG CGATGTGTCG CTCCTGTCGC AGGCCCTCGA ACGCAGGCAC
AACATTCCGG AAGGCACTTC GTGGGTGAAC TACGTACGCA GCCACGACGA TATCGGCTGG
ACCTTCGCGG ACGAGGACGC CGCGGAACTG GGCATCAACG GCTTCGACCA CCGACGCTTC
CTCAACGCCT TCTACGTCAA TCGCTTCCCT GGCAGCTTCG CGCGCGGCGT TCCCTTCCAG
GACAACCCGC GTACGGGTGA TTGCCGCATC TCCGGCACGA CGGCGTCGCT CTGCGGTCTG
GAGGGCGGCA CCGGCCAGGC GGTGGACCGG ATCCTGCTCG CCCATTCGGT AACCTTCAGC
ACCGGCGGCA TCCCCCTTCT CTACCTTGGC GATGAGGTGG GACAGCTCAA TGACTACGGC
TACGCCCTCG AGGAAGGCCA CGGCGCGGAC AGCCGCTGGG TTCACCGCCC GCACTACCCT
GCCGAGAGGT ACGCAAAGCG CCAGGACCCG GCCGCACCCG AAGGTGCGGT GTTTGAGGGC
ATTCGCGCCA TGATCTCGGC GCGGTCAGCA ACCCCGGAAT TCGCCGGAAC CCGGCTGGTC
CACTTCGACA CCAATAACCG CGGCGTCCTG GGCTACCAGC GGCCGGGCGA AGGAACCCTG
ATCCTGGTCC TCGCCAACTT CAGCGACGGA AACCAGACCA TCACCGCGCA GACCCTTTCC
GGATTCGCGC CCGGGGCAGT GGACCTGCTG ACCGGCAACC CGGTGCGGAT AGAAGGAGGG
GTCAGTCTCC GGCCACAGGA GTTCCGCTGG CTGCGGGTCA CCCCGGCCGG CGGCTGA
 
Protein sequence
MIEAAGVDQA ARQRALSALA AQEDLMSRGQ DSVDFRRRLD RHFPDLYRLF HTLYGRRPDF 
DEQLTALVLQ TARSWNERPA DLKALDAERE HHAGWFLANT MLGGVCYVDR YAEDLEGVRA
RIPYFKELGL TYLHLMPLFL APEPHSDGGY AVSSYRQVNP KLGTMEQLRE LAAELRSNGI
SLVVDFIFNH TSDEHAWAKR AAAGDPGYSD YYWIYPDRAM PDAFEQNVRE IFPENHPGSF
IQMEDGRWVW ATFHTYQWDL NYSNPDVFRA MAGEMLFLAN QGVDILRMDA VAFIWKQLGT
PCENLPEAHT LLQAFNAVCR LAAPSLLFKS EAIVHPDEVA LYIDPAECQL SYNPLQMALI
WESMATRDVS LLSQALERRH NIPEGTSWVN YVRSHDDIGW TFADEDAAEL GINGFDHRRF
LNAFYVNRFP GSFARGVPFQ DNPRTGDCRI SGTTASLCGL EGGTGQAVDR ILLAHSVTFS
TGGIPLLYLG DEVGQLNDYG YALEEGHGAD SRWVHRPHYP AERYAKRQDP AAPEGAVFEG
IRAMISARSA TPEFAGTRLV HFDTNNRGVL GYQRPGEGTL ILVLANFSDG NQTITAQTLS
GFAPGAVDLL TGNPVRIEGG VSLRPQEFRW LRVTPAGG