Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_1040 |
Symbol | |
ID | 5159983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 1157682 |
End bp | 1159583 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640552958 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001234175 |
Protein GI | 148260048 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTTGC AGCGCAGCCA GATCCGCATC GCGGTCAATT TCGCCCTCGA CGGTCTGCTG GCCGCGCTGG CGGTGATCGG CGCCTGCTGG CTCGCCGATC CGGCGCATCC GATGCCCTCG CCGGCGGTCC TGCCGCTGGC CGGGTCGGCG GCGATCTGGC TGGTCGGCGT GCCGTTCGGC CTCGCCCGGC AGCACTGGCG CTTCACCGCC CTGCCCGACG CGATCGCGGT GGGGGCGAGT GCGGTGTTCG CCGCGCTGCT GCTCGTGCTG CTGCTCGTCG GCGTCGGCGC CAGGCTTCCC TCGGCGAGTT TTCCGCCGCT GCTGATGATC ACGCTCGGCC TCGCGCTGCT GGCGCCGCGC GTGCTCTACC GGATGGCGCG CAGCCGCCGC GAGGTGCTGT CGGACGATGC CGAGACCGCG CTGCTGCTCG GCGATGGCGA GGGCGCGGAA CTGTTCCTCG CCGCCCTGTC GCGCGAGAGC CACCAGCGCT ACCGGGTGAT CGGCGTGCTG GCGGGCTCGG CGCGGGAGAC CGGCCGGCGC ATCCATAACG TGCCGATCCT CGGCGAGGTG AGCGGGCTGG CGGCGGCGCT CGACCGGCTG GCCGAGGCCG GGCAGATGCC GGCCGTGCTG GTGGTGGCGA GCCGCGAGCT CGTCGGCCCG GCGCTGCGCG AGATCATGGA CGAAGCCGAG CGCCGGGGCA TCCGCGCCGC CCGCGCGCCG CGGCCGACCA CGCTCTCGCC CACCACGCCG GGCGAGCCGG AGACCGCGCT GCGGCCGATC GCCATCGAAG ACCTGCTCAA CCGCCCGCAG GTCGCGCTCG ACCGCGAGGG CATGGCGCGG ATGATCCAGG GACGCTGCGT GCTGGTGACC GGCGCCGGCG GGTCGATCGG CTCGGAACTG GCCCGCCAGG TCGCGGGGTT CGGACCGGCG CGGCTGATCC TGCTCGATTC GAGCGAGTTC GCGCTGTGGC GGATCGATCT CGAACTCTCG GAGCAGGTGC CGGGCCTGGC GCGCGCGGCG GTGATCGCCG ATGTCCGCGA CCGCGCGCGG ATCGAGGCGC TCTGCGCCGA ATGGCGGCCG GACCTCGTGT TCCACGCGGC GGCGCTGAAG CATGTGCCGA TCGTCGAGGC CAATCCGCTG GAGGGCATCG CCACCAACGC GCTCGGCACC CGCAACGTGG CCGATGCCGC CCGCGCCGCC GGCGCCGGGC TGATGGTGCT GATCTCGACC GACAAGGCGG TGAACCCGTC CTCGGTGATG GGCGCGTCGA AGCGGCTGGC GGAGATGTAT GCCCAGGGGC TCGACGTCGC GGCGCGGCGG CAGGCGGGGA TGCGCATCGT CACCGTGCGG TTCGGCAACG TGCTGGGCTC GACCGGCTCG GTGGTGCCGC TGTTCCGCCG CCAGCTCGCC CGCGGCGGGC CGCTGACGGT GACGCATCCC GACATGCGGC GTTATTTCAT GACGGTGCGC GAGGCGGTCT CGCTCGTGCT GCAGGCCGCC GTGGTCGGCC GCTCGGACGC GGCGCTGCCG GTCGCACAAG GCGGGATCTT CGTGCTCGAC ATGGGCGAGC CGGTGAAGAT CGTCGATCTC GCGCGGCAGA TGATCCGCCT CGCCGGGCTC AGGCCCGATC TCGACATCCC GATCCGCTTC ACCGGGCTCA GGCCGGGAGA AAAGCTGTTC GAGGAGCTGT TCCACGGCGC CGAACGGCCG ATCGAGACCG GGTTTCCCGG CCTGCTGATG GCCGCCCCGC GGGTGGCCGA CGCCGCCCTG GTCGGCCGGG CCTTCGACGA GCTGGCCGCC CTGATCCAGC GCGGCGAGGC GGCGGCCGCG CTCGCGGCGC TGGCAAGGCT GGTGCCGGAG TTCGGCGCGC AACCCCTCGC CGCGGCGGGC CCGACCGGCT AG
|
Protein sequence | MTLQRSQIRI AVNFALDGLL AALAVIGACW LADPAHPMPS PAVLPLAGSA AIWLVGVPFG LARQHWRFTA LPDAIAVGAS AVFAALLLVL LLVGVGARLP SASFPPLLMI TLGLALLAPR VLYRMARSRR EVLSDDAETA LLLGDGEGAE LFLAALSRES HQRYRVIGVL AGSARETGRR IHNVPILGEV SGLAAALDRL AEAGQMPAVL VVASRELVGP ALREIMDEAE RRGIRAARAP RPTTLSPTTP GEPETALRPI AIEDLLNRPQ VALDREGMAR MIQGRCVLVT GAGGSIGSEL ARQVAGFGPA RLILLDSSEF ALWRIDLELS EQVPGLARAA VIADVRDRAR IEALCAEWRP DLVFHAAALK HVPIVEANPL EGIATNALGT RNVADAARAA GAGLMVLIST DKAVNPSSVM GASKRLAEMY AQGLDVAARR QAGMRIVTVR FGNVLGSTGS VVPLFRRQLA RGGPLTVTHP DMRRYFMTVR EAVSLVLQAA VVGRSDAALP VAQGGIFVLD MGEPVKIVDL ARQMIRLAGL RPDLDIPIRF TGLRPGEKLF EELFHGAERP IETGFPGLLM AAPRVADAAL VGRAFDELAA LIQRGEAAAA LAALARLVPE FGAQPLAAAG PTG
|
| |