Gene Acry_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_2183 
Symbol 
ID5161348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp2409226 
End bp2412213 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content71% 
IMG OID640554105 
Productglycosyl transferase family protein 
Protein accessionYP_001235300 
Protein GI148261173 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.610183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCCAGC CCGCCCATCT CCTATTGCGC CTCCCCGCCG AGCAACGACC CGCCTGGGCG 
CTCTTCGATC CGGTCTGGTA CGCCCTCCGC CATCCGGACA TCGCCGCCGG GCGCGACGAG
GCCGCCCTGC TCGAATACTA CCTCGCCGTC GGCGGCCGGC TGGGGCACAG CCCGAGCCCG
CTCTTCGACG AAGGCTTCTA CCGCGCCAGC AACCCCGACA TCGCGGCGCT GGTCGCCGAG
GGCGGCTACG CCTCGGGCTT CGATCATTTC TGCCAGTACG GCCATCGCGG CCTCTCGCCG
CACTGGCTGT TCGACGACAC GCTCTACGGC GACCTGCACG CCGACATGTC GCTCGAGAAT
CTCGACCGCC ACCGCTGCTT CGGCCGCTAC GACCACTGGC TGAAATCCGG CCAGCGCGAG
CAGCGGATCT GCCACTTCCT GTTCGACAGC GCGTTCTACC GCCGCGAGGT CGAGGCGGCG
GGAGAGGGCG CGGCGCGGCT CGAGTCCATG GGCCCCTTCG TCCACTATCT CTACGCGCTC
TGGGACGACA CGCCCGAACG CGCCACCTCG CCCTATTTCG ACCCCGCCTG GTACGCCGCG
ATGCACGAAT CCGCGCGCGA CGACATCGCC GAGGGCCGCG CCCGCAACGC GCTGCACCAC
TACATCACGC GCGGCGAGGC CGCCGGCTTC GACCCGGTGC CGGAATTCTC CGAGCGCTAC
CATCTCGACA CCTATGACGA CATCGCCGCC GCCGTCGAGG CCGGCGTGTT CCGCAGCGGC
TACCATCATT TCGTCCGCTA CGGCACGCTC GAACTACGCC GCCCCCGCGC CGACATCGAC
CTGATCTACT ACCGCGACAC CAACCCCCGC GTCCGCAACG ATCTCGCCGC CGGCGCGGTG
CGCGACGCCT TCGCCCATCT GCGCACGATC GGCCTGCGCG AGGGGCTCGA CCACTGCCCG
CCCGAGCGCG TGCCGGACCT GCCCGAACGC GCCGCCCGCC AGCTCTTCGA ACTGCGCGCG
CGCAACCAGT CCGCCCTGTT CGCCCGCCAG CGGATCGACT TCGCTCCGTC CGCCCCGCCC
GCGCTTTCGG TCATCATGGT GCTGCACGAG AAATTCGAGC TGACCATGCA GGCGCTCGCC
TCCCTGCGCG CCAACTTCCC CGGCGATCTC GACCTCATCC TGATCGACAA CGGCTCGACC
GACGCCACCG CCCGGATCGA GACCTATGTC CGCGGCGCGA CCATCCTCCG CGAGCCCGGC
AATCCCGGCT TCCTGCTCGC CTGCAATCGC GGCCTGCGCG AGGTCCGCGC CCCGGCGGTG
CTTTACCTCA ACAACGATGT CGAGCTCGGC TTCGGCGCCA TCGCCGCCGC CCTGCGCCGC
CTCGCCTCGG CACCCGATAT CGGCGCCGTC GGCGGCAAGG TCATCCGCAC CCATGGCCGC
CTGCAGGAAG CCGGCTCGAT CGTCTGGGCC GACGGCTCGA CCCAGGGCTA CCTGCGCGAC
GCCTCGCCGC TGGCCCCCGA GGCCAATTTC GTCCGCGACG TCGATTTCTG CTCCGGCGTC
TTCCTGCTCT GCCGCACCGA TCTCGTGCGC CGCCTCGGCG GCTTCGACGA AGCCTTCCGC
CCCGCCTATT ACGAGGAGGT CGATCTCTGC GTGCGGATGA TCGAGGCCGG CTTCCGCGTC
ATCTACGACC CGGACGTCAC CATCCACCAC CTCGAATACG GCAGTTCGAG CAACGCCGAG
GCGGCGATGC AGCAGATGCG CCGCAACCGC CGCGTCTTCG TCCGCAAGCA CAAGGCGTTC
CTGCACTTCC AGCAGGCGCC GGCGCCGGGC GGCGAAATCC GCGCCCGCGC GCGCAACGGC
GCCGGACGGC GGCTGCTGTT CCTCGAGGAT ACCGTGCCGC TCCGCCGCCT CGGCTCCGGC
TTCGTCCGCG CCAACGACGT GGTGCACGCC ATCGCCGATG CCGGCTGGCA GGTCTCGGTC
CTGCCGGTGA ACGGCGCGCG GCACGACCTG ATGAGCCAGT TCGGCGACCT GCCCGAAACC
GCCGAGGTGC TGCACGACCG CTCGATCATG ACCCTGCCGG CCCTTCTCGC CGAGCGCCCC
GGCTTCTACG ACGCGGTCTG GATCTCCCGC ACCCACAACC TGATGCGCAC CGCCCCGATC
TTCGAGGCGG CGGGGATCGA TCCGGCCACA ACGCCCTTCG TGGTCGATAC CGAGGCCGTC
GCCGCCACCC GCGAGGCCGA GGCCGCCGCC CTGCGCGGCG AGGCCGCGTT CGACCTCGCC
GCCGCCGTCG CCGCCGAAAT GCGCCCGGCC ATGGTCTGCC GGCATGTCAC CGCGGTGAAC
GAGGCGGAAG CCGCCCTGCT CCGCGCCGCC GGCCTGCCCG GCGTCGCCGT GCTCGGCACC
ATCCGCGCGC CCGATCCCAC GCCGCGCCCC TTCGCCGCGC GCGCCGGCCT GCTCTTCGTC
GCCAGCATCC ACCGCGAGGA CAGCCCGAAT CTCGACAGCC TGCGCTGGTA TCGCGACGAA
ATCCTGCCCG CCCTGCGGCG GATCATGGAC GAGCCGCCGA CCCTCAGCTT CGTCGGCTAC
ACCGCGCCGG ATCTCGACCT CGCCGAATTC CGCGGCATCC CGGGCATCGA GCTGCGCGGC
ACGGTGGCCG AGCTGCGCCC CGCCTATGAC GAGCACCGCC TGTTCATCGC CCCCACCCGC
TTCGCCGCCG GAACACCCTA CAAGGTCTAC GAAACCGCCT CGTTCGGCCT GCCCTGCGTG
GCAACGCCAC TGCTCTGCCG CCAGCTCGGC TGGACCCCGG GCAAGGACAT CGCCACCCCG
GCCACCGCCG ACGCCGCCGC CTTCGCCGCC GAGATCGCCT CGCTCTACCG CGACGAAACC
CGCTGGACCG CGCTGCGCGA TTCCGCGCTG CGCCGCCTCG CCGCGGAAAA CGGCCGGGCC
CCCTTCGAAG CCACGGTCCG CCGGATCCTC GACGCCATCA GCGCCTGA
 
Protein sequence
MPQPAHLLLR LPAEQRPAWA LFDPVWYALR HPDIAAGRDE AALLEYYLAV GGRLGHSPSP 
LFDEGFYRAS NPDIAALVAE GGYASGFDHF CQYGHRGLSP HWLFDDTLYG DLHADMSLEN
LDRHRCFGRY DHWLKSGQRE QRICHFLFDS AFYRREVEAA GEGAARLESM GPFVHYLYAL
WDDTPERATS PYFDPAWYAA MHESARDDIA EGRARNALHH YITRGEAAGF DPVPEFSERY
HLDTYDDIAA AVEAGVFRSG YHHFVRYGTL ELRRPRADID LIYYRDTNPR VRNDLAAGAV
RDAFAHLRTI GLREGLDHCP PERVPDLPER AARQLFELRA RNQSALFARQ RIDFAPSAPP
ALSVIMVLHE KFELTMQALA SLRANFPGDL DLILIDNGST DATARIETYV RGATILREPG
NPGFLLACNR GLREVRAPAV LYLNNDVELG FGAIAAALRR LASAPDIGAV GGKVIRTHGR
LQEAGSIVWA DGSTQGYLRD ASPLAPEANF VRDVDFCSGV FLLCRTDLVR RLGGFDEAFR
PAYYEEVDLC VRMIEAGFRV IYDPDVTIHH LEYGSSSNAE AAMQQMRRNR RVFVRKHKAF
LHFQQAPAPG GEIRARARNG AGRRLLFLED TVPLRRLGSG FVRANDVVHA IADAGWQVSV
LPVNGARHDL MSQFGDLPET AEVLHDRSIM TLPALLAERP GFYDAVWISR THNLMRTAPI
FEAAGIDPAT TPFVVDTEAV AATREAEAAA LRGEAAFDLA AAVAAEMRPA MVCRHVTAVN
EAEAALLRAA GLPGVAVLGT IRAPDPTPRP FAARAGLLFV ASIHREDSPN LDSLRWYRDE
ILPALRRIMD EPPTLSFVGY TAPDLDLAEF RGIPGIELRG TVAELRPAYD EHRLFIAPTR
FAAGTPYKVY ETASFGLPCV ATPLLCRQLG WTPGKDIATP ATADAAAFAA EIASLYRDET
RWTALRDSAL RRLAAENGRA PFEATVRRIL DAISA