Gene Acry_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_0120 
Symbol 
ID5159461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp132301 
End bp133428 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content68% 
IMG OID640552036 
Productglycosyl transferase family protein 
Protein accessionYP_001233267 
Protein GI148259140 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03469] hopene-associated glycosyltransferase HpnB 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTGC TTGCCCTGCT CGCGCTGCTG GCTTGGATCT ACCTATATCT GCTGCACGGC 
CAGTTCTGGC AGAGCGGCCC GGAACTCGCC CCGGCGCGGC CGGCAACCGC GGTTCCGGTA
GACATCATCG TGCCGGCGCG CGACGAGGCG GAAACGATCG GGGCGGTCGT GCAATCGCTG
CTGGCTCAGG ATTATGCCGG GCCGTTCCGG GTGATCCTGG TCGACGATGG CAGTACGGAT
CGTACAGGCG ACATCGCGAT CCGGGCGGCG AACGGCGATC CGCGCTTTGC CCTGCTGCGC
GGCGGCGAAA AGCCAGCCGG CTGGTCGGGC AAACTCTGGG CATTGGAGCA GGGAGTGGCG
CACGGCGCGG CGCCAGTGCT GCTGTTCACC GATGCCGATA TTGTTCACGA TCCGCGGCAT
CTGGCGACGC TGGCCGCGAG GTTGGTGACA CCGGAGCGCG GCGCGCGGCT CGACATGGTT
TCGGAAATGG TGCGTCTGAA CTGCGAAAGC GCCGCCGAAC GCGCGCTGGT GCCGGCTTTC
GTCTACTTCT TCCAGATGCT CTACCCGTTC GCCCGCGTGA ACGATCCGCT CGACGGCACC
GCCGCCGCGG CCGGCGGCAC GGTGCTGATC CGGCGCGAGG CGCTGGAGCG GGCTGGCGGG
CTCGCGGCGA TGCACGGCGC GCTGATCGAC GACGTCACGC TGGCCGGCCG GGTCAAGCGC
GGCGGCGCCG TGTTCCTCGG GCATTCCGGC CTCGCTCGCT CGATCCGCCC CTATCCGCGG
CTTGCCGACA TCAGGGCGAT GATCTCGCGC ACTGCCTTTA CCCAGCTGCA TTATTCCGGG
CTGCTGCTCG CGCTCACGCT GGCTGGGCTG GCGGTCGTCT GGCTGGTGCC GCCGCTTGCC
CTCGTTTTCG GGCATGAAGT CGCGGCCTTG TGCGGGCTGA TTGCCTCGCT GCTCGCGGTG
CTGAGCTATC AGCCGACGCT GCGGCGCTAC GGACGCGGCT GGTATTGGGG GCTGGCACTA
CCGCTGATCG CGCTTGTCTA TATGGAGGCG ACGTTGGCCT CCGCACTGCG CTATTGGCGT
GGCACGGGGG CTGCCTGGAA AAGCCGCGAT TATGGAGCCG ACGCATGA
 
Protein sequence
MTLLALLALL AWIYLYLLHG QFWQSGPELA PARPATAVPV DIIVPARDEA ETIGAVVQSL 
LAQDYAGPFR VILVDDGSTD RTGDIAIRAA NGDPRFALLR GGEKPAGWSG KLWALEQGVA
HGAAPVLLFT DADIVHDPRH LATLAARLVT PERGARLDMV SEMVRLNCES AAERALVPAF
VYFFQMLYPF ARVNDPLDGT AAAAGGTVLI RREALERAGG LAAMHGALID DVTLAGRVKR
GGAVFLGHSG LARSIRPYPR LADIRAMISR TAFTQLHYSG LLLALTLAGL AVVWLVPPLA
LVFGHEVAAL CGLIASLLAV LSYQPTLRRY GRGWYWGLAL PLIALVYMEA TLASALRYWR
GTGAAWKSRD YGADA