Gene Acry_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1895 
Symbol 
ID5160442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp2088916 
End bp2091777 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content79% 
IMG OID640553816 
Productglycosyl transferase family protein 
Protein accessionYP_001235015 
Protein GI148260888 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.207282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCGACCT TGCCGCCGGG CCGGAACCCG CCGCCAGCCG TGCCGCCGGG GACTGACGCC 
GCCTTCGCGC GCGTGCGCGA CCGCCGCAAC GACGAAGCGG TGGACGCCTG GCGCCGCGCC
CGCCTCGCCA TCGCCGCCGG CGATCCGGCG GAGGCGGCGT TCTGGCTCGA CCGCGTCTGC
CGCCTCGCGC CGGATGACGG CCGCCCGCGT TACGAGCGCG CCCTCCTGCG CGCCGCGACC
GGCGATCCCG AGGCCGCCGC CGCCCTCGCC GACATCGCCA CCCGCTTCGA CCATGCCGGG
GCCTGGATCG CGCTCGCCCG CCTGCGCCAT GCCGCCGGCG CGCATGACGA GGCCGCCGCC
GCTCTCGCCG AACTCCTCGC CCGCCACGAG GTTCCGCCGG CGCTCGACGA TGGCGGCCGG
TTCGAAACCC TCGCCACCGC CATCGCCCGC GCCGCCGGGT TCGCCGGCTG GCGCGGCATC
GCCGAGGGCC GGCGCGTCGG CGACGGTCCG CACGGCGCGA ATCTGCTCGG CGCGCCCGAC
CTCGCCGCGA TCCGCCATCA CGACGGCATC GCCATGCCGG CGGGGCAGGG GATCGAGGGC
TGGGTCGCCC ATCCCGCCGC CCCCTGGCGG GTGCCGCGCC TCACCCTGGT CGACCGCGCC
GGCACCCGCC GCGCCCTCAC CCCGCGCCGC CGCCTCGGCC CGTCCGACCA GCGGCCCTTC
GTCGAGCGCC ACGGCTTCCG CATCGGCCCG GCGCGCCTCG CCGGTCTCGT CCCGCCCTTC
ATCCTCGAAA CCGCGTCCGG CGCCGCCCTC GCCGGCACGC CGATCGACCC CGCCGCCCTC
GCCGCGCTGA AGCCGGCCCG CATCCGCCGC CGCCGCCCCG TCCTGCCGCC GCGCGCCCCG
CTCTGCGCCG TCATGCCCGC CTATCGCGAC GCGGCGGCGA CGGCGGATGC CATCGCCTCC
TTCCGCCGCG CCGCCCCGCG CTGCCCGCTG GTGGTGGTGG ACGATGCCTC GCCCGAGCCC
GCCCTCGTCC GCCTGCTCGA TGAAGCGGCC GCGGCCGGGC GCATCGCGCT CCGCCGCCAC
CGCCGCAACC GCGGCTTCCC CGCCGCCGCC AACGCCGGCA TCGCCGCCGC CCGCCGCCTC
GTCCCCGGCT GCGACATCCT GCTGCTCAAC GCCGACATCC TGCTGCCGCC GCGCGCCCCC
GCCCGGCTGC ACGCCGCGCT CTACCGCGCG CCCGACATCG CCAGCGTCAC CCCGCTCTCG
AACGAGGCGA CGATCATGAG CTACCCGGCC CGCCAGGGCG GCAACCCCGC GCCGGACCTC
GCCGCCACCA CCCGGCTCGA CGCGCTGGCC CGCCGCGTCA ACGGCGCCGC CCTGGTCGAC
ATCCCGACCG CGATCGGCTT CTGCATGGCG ATCCGGCACG ACGCGCTCGA CGCCATCGGC
GGGTTCGACG CCACGCTCTT CGCCCAGGGC TACGGCGAGG AGAACGACTG GTGCCGCCGC
GCCGCCCTGG CCGGCTGGCG CCATGCCGGC GCGCCCGGCA TCTTCGTCGC CCATCGCGGC
GGCGCCTCCT TCGGCGCCAC CACCGCCGCG CTCTGCGCCC GCAACCTCGC CGTCCTCGAA
CGCCGCCATC CCGGCTACGC CGCCCTCATC GCCGCCCATC ACGAGGCCGA TCCGCTGCGC
GACGCCCGCC GCCGGCTCGA CCTTGCCCGC TTCGCCGCCG GCCGCGTCCG CCACGGCGCG
GTCGCCCTCG TCGCGCATGA CCACGGCGGC GGCATCGCCC GCGTCGTCGC GGCGCGGATG
GCGGCGCTGC GGGAGGCCGG GCTGCGGCCG ATCCTGCTCA GCCCCGACCT GATCCGCGCG
GACGGCTCGA TCGCCGGCAC CGGCCCCGCC CGCCTCGCCG ATGGCGGCGC CGAACCGTTC
CCCAACCTCG TCTTCGATCC CCGCACCGGG ATGCGCGCCC TCGTCGCCCT GCTCCGGCGC
GAGGGGGTGC GCCATGTCGA GTTCCACCAC ACGCTCGGCC ACGACCCGGC GATCCTCGCC
CTGCCGGCCC GGCTCGGCGT CCCGGCGGAT CACGTGGTCC ACGACTACGC CCATTTCTGC
AAGCGGGTGA ACCTGATCGG CCCGGCCAGG CGCTATTGCG GCGAACCCGG CATCGACGGC
TGCGAAACCT GCCTCGCCGC CGCCGGCAGC GCCTTGGCCG ACCCGATCGC GCCCGCCGCC
CTGCGCGCCC GCTCCGCCGC CGCCTTCGCC GCCGCCCGCC GCGTCGCCGC CCCCTCGGCC
GATGCCGCCG CCCGCCTGCG CCGCCATTTC CCCGGCCTCG CCGTCGCCGT CACCCCCTGG
GAGGACGATG ACGACCTGCC GCCGCCCCGC CCGCCCGCGC CCGGCGCGCC CCGCCGGATC
GCGCTCGCCG GCGGCATCGG CCCCGCCAAG GGGTTCGACG TCCTGCTCGC CTGCGCGCGC
GATGCCGCCG CCCGCGCCCT GCCGCTCTCC TTCGTCCTCG CCGGCACCAG CGAGGACGAT
GCCGCGCTGA TCGCCACCGG CCGCGTCCGC GTCACCGGCG CCTATGAGGA GGGCGAGGCC
CAGGCCCTGC TGCGCGATGC CGGCGCCGCC CTCGGCTTCC TGCCCTCGAT CTGGCCGGAA
ACCTGGTGCT TCGCCCTCGG CGAGCTCTGG CGCGCCGGGC TCTACGTCCT CGGTTTCGAC
ATCGGCGCCC CGGCCGCCCG CATCCGCGCG ACCCGGCGGG GCGACGTCCT GCCCCTCGGC
CTGCCGCCGG CCCGCATCAA CGACATCCTG CTCGCCTGGC AACCGCCCGC GCCCGCCGCC
CGTCCTCCGG CCCTGACGGC CGCATCCCTT TCCCCCGCCT GA
 
Protein sequence
MSTLPPGRNP PPAVPPGTDA AFARVRDRRN DEAVDAWRRA RLAIAAGDPA EAAFWLDRVC 
RLAPDDGRPR YERALLRAAT GDPEAAAALA DIATRFDHAG AWIALARLRH AAGAHDEAAA
ALAELLARHE VPPALDDGGR FETLATAIAR AAGFAGWRGI AEGRRVGDGP HGANLLGAPD
LAAIRHHDGI AMPAGQGIEG WVAHPAAPWR VPRLTLVDRA GTRRALTPRR RLGPSDQRPF
VERHGFRIGP ARLAGLVPPF ILETASGAAL AGTPIDPAAL AALKPARIRR RRPVLPPRAP
LCAVMPAYRD AAATADAIAS FRRAAPRCPL VVVDDASPEP ALVRLLDEAA AAGRIALRRH
RRNRGFPAAA NAGIAAARRL VPGCDILLLN ADILLPPRAP ARLHAALYRA PDIASVTPLS
NEATIMSYPA RQGGNPAPDL AATTRLDALA RRVNGAALVD IPTAIGFCMA IRHDALDAIG
GFDATLFAQG YGEENDWCRR AALAGWRHAG APGIFVAHRG GASFGATTAA LCARNLAVLE
RRHPGYAALI AAHHEADPLR DARRRLDLAR FAAGRVRHGA VALVAHDHGG GIARVVAARM
AALREAGLRP ILLSPDLIRA DGSIAGTGPA RLADGGAEPF PNLVFDPRTG MRALVALLRR
EGVRHVEFHH TLGHDPAILA LPARLGVPAD HVVHDYAHFC KRVNLIGPAR RYCGEPGIDG
CETCLAAAGS ALADPIAPAA LRARSAAAFA AARRVAAPSA DAAARLRRHF PGLAVAVTPW
EDDDDLPPPR PPAPGAPRRI ALAGGIGPAK GFDVLLACAR DAAARALPLS FVLAGTSEDD
AALIATGRVR VTGAYEEGEA QALLRDAGAA LGFLPSIWPE TWCFALGELW RAGLYVLGFD
IGAPAARIRA TRRGDVLPLG LPPARINDIL LAWQPPAPAA RPPALTAASL SPA