Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0426 |
Symbol | |
ID | 4485661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 453423 |
End bp | 454523 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639729193 |
Product | glycosyl transferase family protein |
Protein accession | YP_872186 |
Protein GI | 117927635 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0815896 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.721825 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGC CCGAATCGGC CGGAGCGCCC GCACCGGACG GCGTCCCTAA GCCGGGCGAT TCGGCCAACC GGGACACCGT CGCTCGGCCG GACACCGTCC CCGGGGCGGG CGAATCGCCC AGTCCGGCCG AAGTGCCGAG GGTCACCATC GCTGTGATCA CGTGGAACGG CGCCCGGTTC ATCGACCGCT GCCTGGACGC CCTCGCATCC CAACGGGTCA CCGACAACCG CGGCCGGCAG ATCGAGCCGG AGATCTGGGT CATCGACAAC GCCTCGACCG ACAACACCAC CGCGCTGGCC GCAGCCCATC CGCTGCAACC CCGGCTGGTC TGCCTCGACA CCAACCGGGG ATTCGCCGGC GCCGCCGATT TCGCGATACG CCACGCGCAA ACCCCGTTCG TCGTCGTGCT GAACCAGGAC ACCGTCGTTG CTCCCGGCTG GCTCGCCGCG TTGATGCGGC CGTTCGCCGA ACCCGGCGGC GAGCGAATTG CGGCGACCAC CAGCAAAGTC GTCTTCGCGG ACAGCGGCCT GCTCAACAAC ACCGGCGTGC TCATCGGCCC GGACGGTTAC GGCCGGGACC GCGGATTCGG TGAACCGGAC GACGGGCGGT ACGACACCGA CCGTGACGTG CCGGCATTTT CGGGGACGGC GGCCGCGATC CGCGTCTCCG CCGCACGTGC CGTCGGATCG TTCGACCCGG ATTTCTTCTT GTACTACGAG GACACCGAGC TGTCGTGGCG GCTGCGTCGA GCCGGTTGGG CGATCCGTTA CGTGCCCGAG GCGGTGGTAC GGCACGAGCA CGCCGCCTCA ACCGACCCCC GCTCGCCGAT GTTCGCCTTC TACAACGAGC GGAATCGACT GTGGATGCTC ATCACCTGCG CCCCGATATG GCGGGCAGGC TGGGAATTTC TCCGGTTCCT TGCGATTACT TTGCTGCTGC CGCTCCGGAG GCTCATCGGA ATGTCCGTCC CCGCGACGCC GAATTTCTCC GTCCGGCTCC GCCTGCGGGT GGCGGCCGGC GTCCTCACCG CCCTGCCGGC CCTGCTGCGC AAACGCCGGC ACGTCCGCGC ACTCCGCCCT CCGCGGGAAG CGCGGCCGTA G
|
Protein sequence | MTQPESAGAP APDGVPKPGD SANRDTVARP DTVPGAGESP SPAEVPRVTI AVITWNGARF IDRCLDALAS QRVTDNRGRQ IEPEIWVIDN ASTDNTTALA AAHPLQPRLV CLDTNRGFAG AADFAIRHAQ TPFVVVLNQD TVVAPGWLAA LMRPFAEPGG ERIAATTSKV VFADSGLLNN TGVLIGPDGY GRDRGFGEPD DGRYDTDRDV PAFSGTAAAI RVSAARAVGS FDPDFFLYYE DTELSWRLRR AGWAIRYVPE AVVRHEHAAS TDPRSPMFAF YNERNRLWML ITCAPIWRAG WEFLRFLAIT LLLPLRRLIG MSVPATPNFS VRLRLRVAAG VLTALPALLR KRRHVRALRP PREARP
|
| |