Gene Acry_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1040 
Symbol 
ID5159983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1157682 
End bp1159583 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content74% 
IMG OID640552958 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001234175 
Protein GI148260048 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTGC AGCGCAGCCA GATCCGCATC GCGGTCAATT TCGCCCTCGA CGGTCTGCTG 
GCCGCGCTGG CGGTGATCGG CGCCTGCTGG CTCGCCGATC CGGCGCATCC GATGCCCTCG
CCGGCGGTCC TGCCGCTGGC CGGGTCGGCG GCGATCTGGC TGGTCGGCGT GCCGTTCGGC
CTCGCCCGGC AGCACTGGCG CTTCACCGCC CTGCCCGACG CGATCGCGGT GGGGGCGAGT
GCGGTGTTCG CCGCGCTGCT GCTCGTGCTG CTGCTCGTCG GCGTCGGCGC CAGGCTTCCC
TCGGCGAGTT TTCCGCCGCT GCTGATGATC ACGCTCGGCC TCGCGCTGCT GGCGCCGCGC
GTGCTCTACC GGATGGCGCG CAGCCGCCGC GAGGTGCTGT CGGACGATGC CGAGACCGCG
CTGCTGCTCG GCGATGGCGA GGGCGCGGAA CTGTTCCTCG CCGCCCTGTC GCGCGAGAGC
CACCAGCGCT ACCGGGTGAT CGGCGTGCTG GCGGGCTCGG CGCGGGAGAC CGGCCGGCGC
ATCCATAACG TGCCGATCCT CGGCGAGGTG AGCGGGCTGG CGGCGGCGCT CGACCGGCTG
GCCGAGGCCG GGCAGATGCC GGCCGTGCTG GTGGTGGCGA GCCGCGAGCT CGTCGGCCCG
GCGCTGCGCG AGATCATGGA CGAAGCCGAG CGCCGGGGCA TCCGCGCCGC CCGCGCGCCG
CGGCCGACCA CGCTCTCGCC CACCACGCCG GGCGAGCCGG AGACCGCGCT GCGGCCGATC
GCCATCGAAG ACCTGCTCAA CCGCCCGCAG GTCGCGCTCG ACCGCGAGGG CATGGCGCGG
ATGATCCAGG GACGCTGCGT GCTGGTGACC GGCGCCGGCG GGTCGATCGG CTCGGAACTG
GCCCGCCAGG TCGCGGGGTT CGGACCGGCG CGGCTGATCC TGCTCGATTC GAGCGAGTTC
GCGCTGTGGC GGATCGATCT CGAACTCTCG GAGCAGGTGC CGGGCCTGGC GCGCGCGGCG
GTGATCGCCG ATGTCCGCGA CCGCGCGCGG ATCGAGGCGC TCTGCGCCGA ATGGCGGCCG
GACCTCGTGT TCCACGCGGC GGCGCTGAAG CATGTGCCGA TCGTCGAGGC CAATCCGCTG
GAGGGCATCG CCACCAACGC GCTCGGCACC CGCAACGTGG CCGATGCCGC CCGCGCCGCC
GGCGCCGGGC TGATGGTGCT GATCTCGACC GACAAGGCGG TGAACCCGTC CTCGGTGATG
GGCGCGTCGA AGCGGCTGGC GGAGATGTAT GCCCAGGGGC TCGACGTCGC GGCGCGGCGG
CAGGCGGGGA TGCGCATCGT CACCGTGCGG TTCGGCAACG TGCTGGGCTC GACCGGCTCG
GTGGTGCCGC TGTTCCGCCG CCAGCTCGCC CGCGGCGGGC CGCTGACGGT GACGCATCCC
GACATGCGGC GTTATTTCAT GACGGTGCGC GAGGCGGTCT CGCTCGTGCT GCAGGCCGCC
GTGGTCGGCC GCTCGGACGC GGCGCTGCCG GTCGCACAAG GCGGGATCTT CGTGCTCGAC
ATGGGCGAGC CGGTGAAGAT CGTCGATCTC GCGCGGCAGA TGATCCGCCT CGCCGGGCTC
AGGCCCGATC TCGACATCCC GATCCGCTTC ACCGGGCTCA GGCCGGGAGA AAAGCTGTTC
GAGGAGCTGT TCCACGGCGC CGAACGGCCG ATCGAGACCG GGTTTCCCGG CCTGCTGATG
GCCGCCCCGC GGGTGGCCGA CGCCGCCCTG GTCGGCCGGG CCTTCGACGA GCTGGCCGCC
CTGATCCAGC GCGGCGAGGC GGCGGCCGCG CTCGCGGCGC TGGCAAGGCT GGTGCCGGAG
TTCGGCGCGC AACCCCTCGC CGCGGCGGGC CCGACCGGCT AG
 
Protein sequence
MTLQRSQIRI AVNFALDGLL AALAVIGACW LADPAHPMPS PAVLPLAGSA AIWLVGVPFG 
LARQHWRFTA LPDAIAVGAS AVFAALLLVL LLVGVGARLP SASFPPLLMI TLGLALLAPR
VLYRMARSRR EVLSDDAETA LLLGDGEGAE LFLAALSRES HQRYRVIGVL AGSARETGRR
IHNVPILGEV SGLAAALDRL AEAGQMPAVL VVASRELVGP ALREIMDEAE RRGIRAARAP
RPTTLSPTTP GEPETALRPI AIEDLLNRPQ VALDREGMAR MIQGRCVLVT GAGGSIGSEL
ARQVAGFGPA RLILLDSSEF ALWRIDLELS EQVPGLARAA VIADVRDRAR IEALCAEWRP
DLVFHAAALK HVPIVEANPL EGIATNALGT RNVADAARAA GAGLMVLIST DKAVNPSSVM
GASKRLAEMY AQGLDVAARR QAGMRIVTVR FGNVLGSTGS VVPLFRRQLA RGGPLTVTHP
DMRRYFMTVR EAVSLVLQAA VVGRSDAALP VAQGGIFVLD MGEPVKIVDL ARQMIRLAGL
RPDLDIPIRF TGLRPGEKLF EELFHGAERP IETGFPGLLM AAPRVADAAL VGRAFDELAA
LIQRGEAAAA LAALARLVPE FGAQPLAAAG PTG