Gene Acry_0570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_0570 
Symbol 
ID5160353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp641181 
End bp642311 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content68% 
IMG OID640552486 
Productendo-1,4-D-glucanase 
Protein accessionYP_001233713 
Protein GI148259586 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACA TGGCGAGACG CAATTTTCTC GGTCTTCTCG GCGCGGCCGC GCTCACGGGC 
GGCCCGGTGA TGATGGCGCG GGAGCTGTGG CAATCGTCCT GGCGCGGCTA CCGGCACGGA
TTCATCGACG GCCAGGGGCG GGTCATCGAC TATTCCGCCA ACAAGGGATT CAGCACTTCG
GAGGGGCAGT CCTATGGCAT GTTCCTCAGT CTCGTCGCCG GCGACCGCGC GACGTTCCGC
CGCATCCTGA ACTGGACGAA CACGAACATG GCGGGCGGGC GCCTCGGGGA GGTGCTCGCG
GCGTGGAAAT GGGGGCTGCA CGGCGGCAAA TGGGGGGTGA TCGGCGCCAA CTCGGCGGCG
GACGCGGATG CGTGGATGGC CTATTCGCTG CTCGAGGCCG CCCGGATCTG GAAGGATCAC
AATCTCGGCG CCGAGGGGCA CAAGCTGGCG ACGCGCATCG CCGATGACGA GAGCGTGGCG
ATCAACGGAT TTGGCCGGGT CCTGATACCC GGCGCCTCCG ATTTTCCGGA CACGCCGCCC
GTCATCGTCG ATCCGAGCTA TACGCCGCTG TTTCTGGCCC GCGGCATCGC GCGCGCGACC
AACCTGCCGA AATGGCAGGC GATCGCGGCC ACGCTGCCAC GGCTGATGAC GACGATCTGC
CGCAACGGAT TCGCCCCCGA CTGGGCCTGG GCGCCGCAAG CCCCCGCCTC GCCGCCGGCG
GGCCTGCCGG AGACCGGCAC CGGATCGTTC GATGCGATCC GGTGCTATCT CTGGGCGGGC
CTGACCGCGC CGGAAACGGA GGGGAGCGCG ACCGTTCTCG CCTCGCTGAA GGGCATGGCC
CGATACTTGG CCACCCACCG CGCGCCGCCG CAGAGCGTCG ATCTCGCGAG CCAGGCGACG
CACGGGACGG GCGGGATCGG GTTTTCCGCG GCGCTGCTGC CCTACCTCGC GGCACTCGGG
CGCCACCGGC TGCTCCATCA GCAGCTCGGC CGCGTGCTGG CCCAGCGGGA GACCAGCGGC
CTGTTCGGCC AGCCCGCCGA CTATTACTCG GAAAACCTGA TCCTGTTCGG ACTTGGCGGA
CTTTCGGGAA GCATCCGCTT CGACAAGCAA GGAGGTCTGA TCACGTCATG A
 
Protein sequence
MTDMARRNFL GLLGAAALTG GPVMMARELW QSSWRGYRHG FIDGQGRVID YSANKGFSTS 
EGQSYGMFLS LVAGDRATFR RILNWTNTNM AGGRLGEVLA AWKWGLHGGK WGVIGANSAA
DADAWMAYSL LEAARIWKDH NLGAEGHKLA TRIADDESVA INGFGRVLIP GASDFPDTPP
VIVDPSYTPL FLARGIARAT NLPKWQAIAA TLPRLMTTIC RNGFAPDWAW APQAPASPPA
GLPETGTGSF DAIRCYLWAG LTAPETEGSA TVLASLKGMA RYLATHRAPP QSVDLASQAT
HGTGGIGFSA ALLPYLAALG RHRLLHQQLG RVLAQRETSG LFGQPADYYS ENLILFGLGG
LSGSIRFDKQ GGLITS