Gene Arth_3720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3720 
Symbol 
ID4443721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4187670 
End bp4189076 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content68% 
IMG OID639691544 
Productalpha amylase, catalytic region 
Protein accessionYP_833195 
Protein GI116672262 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAAC CGGACTGGGT AAAACACGCA ATCTGGTGGC AGGTCTACCC GATCGGCTTT 
GTCGGCGCGG AGCAGGCTGC CGCTGAACGG GCGTCCAAGG CGCCGACGTC CGAGGCGCCG
GGCCAGACTG TGGCCCATCG GCTGGGGCAG CTGGTTCCCT GGCTTGACTA CGTGCTGGAA
CTCGGCGCGT CCGGGCTCGC GCTGGGGCCC GTCTTCGCCT CGGAAACCCA CGGCTATGAC
ACCACGGACT ACTTCAGGAT CGATCCCCGG CTGGGCGATG ACGCGGACTT TGACGAGCTC
ATCGCCCAGT GCCACGCCCG CGGACTGAAA GTCCTGCTGG ACGGCGTCTT CAACCACGTG
GGGCGCAGCT TCGGGGCGTT CCAGGGTGTG CTCACGGACG GTCCCGGGTC TCCTGCCGCC
TCCTGGTTCC GCCTGCGGTG GCCGGAGTCC GGATGGGCGC CGGGGACCGA ACCGGGCTAC
GAGGATTTTG AGGGCCATCA TCACCTGGTG GCGCTCAACC ACGATGAACC GGCAGTTGCC
GCCCTGGTCA CGGACGTGAT GAAGCACTGG CTAGGCCGGG GAGCGGACGG CTGGCGGCTG
GACGCGGCGT ACGCTGTGCC GGCGTCGTTC TGGGCCCCGG TGCTGGCTGA GGTGCGCCGC
GAGTATCCGG ACTCCTATTT CGTGGGCGAG TACATCCACG GCGACTTCGC CGAGGAGGTG
GAGCGGAGCA CGCTCGACTC GGTCACGCAG TACGAACTGT GGAAGGCCGT CTGGAGTTCA
CTCAACGATG CCAACTTCTA CGAACTCGCA TCCGCGCTCG AGCGGCACAA CGGGTTCCTG
GACACCTTCG TGCCGCTCAC GTTCGTGGGC AACCACGACG TCACCCGCCT GGCCAGCAAG
CTGACGAACC CGGACCAGCT GGCGCTTGCG CTCACAGTCC TCCTGACCGT GGGCGGGACG
CCCTGCATCT ACTACGGAGA CGAGCAGGCT TTCCGCGGCG TCAAGGAGGA CCGGGCCGGC
GGAGATGACG CCGTCCGTCC GGCGTTCCCC GCGGGGCCCG CAGAACTGGC GGAGGACGGC
TGGTCCGTCT ACCACCTGCA CCAGGAACTG ATCAGCCTCC GACGGCGGCA TGCCTGGCTG
CACCGGGCCC GCACCACGGT CCTGGCGCTC AGCAACGAAC ACCTCGTTTA CCAGGTCCGC
GGCGACGGTA AGGCAGCAGC AGGTAGCACA GAGGAGGGCG GCGCAGGCGC CGCGCTGACA
GTCGCGCTGA ACCTTTCCGG CACGCCGGCG GACCTGCCCG TACCGTCCGG GTCAGGCGGC
CTCCTGGCCG GGCGGGCCCA CCGGCATCCG GATCGGGATG CCGTGGGCCT GCCCGGTTAC
GGGTGGGCAG TGCTCGGGAA CAGCTAG
 
Protein sequence
MTEPDWVKHA IWWQVYPIGF VGAEQAAAER ASKAPTSEAP GQTVAHRLGQ LVPWLDYVLE 
LGASGLALGP VFASETHGYD TTDYFRIDPR LGDDADFDEL IAQCHARGLK VLLDGVFNHV
GRSFGAFQGV LTDGPGSPAA SWFRLRWPES GWAPGTEPGY EDFEGHHHLV ALNHDEPAVA
ALVTDVMKHW LGRGADGWRL DAAYAVPASF WAPVLAEVRR EYPDSYFVGE YIHGDFAEEV
ERSTLDSVTQ YELWKAVWSS LNDANFYELA SALERHNGFL DTFVPLTFVG NHDVTRLASK
LTNPDQLALA LTVLLTVGGT PCIYYGDEQA FRGVKEDRAG GDDAVRPAFP AGPAELAEDG
WSVYHLHQEL ISLRRRHAWL HRARTTVLAL SNEHLVYQVR GDGKAAAGST EEGGAGAALT
VALNLSGTPA DLPVPSGSGG LLAGRAHRHP DRDAVGLPGY GWAVLGNS