Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1560 |
Symbol | |
ID | 5054916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1411467 |
End bp | 1412531 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640469101 |
Product | glycosyl transferase family protein |
Protein accession | YP_001153766 |
Protein GI | 145591764 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000000203447 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGATTGAGC TATTGTTCAT CTTGCCATTC GTAGGGCTGG CAGTGGCCGG CCTCGTTAGG GAATACATAT TTTGGAAAGG CCAAGACGAG GTTTTCCCCC AGAGCTGTGA AAAAATCACC GTAGTGGTGC CCATACGGGG TGTTCACGGA GCCACGGAGG AGAACCTAAA AGCTATTACG TCACAGAAGG CCGGGTCTGA GGTGGAGTAC ATCTTCGTAG TGGACTCCTA CGACGATCCG GCGTACACCA TCGCCCAGAA ATTCGGCAAG GTGGTTCTAA ACGCCGGGGA GGGCAAAAGC GGAGCATTGG CCACCGCACT GTCCCATGCG ACTGGGGACT GTATTGTATT CGCAGACGAC GACATAAGGC CAGGCCCCCG CTGGCTGGAG CTTATGACGA CTCCGCTATC TAAATTCACC GCCGTGACGA CGTATAGGTG GTACCTAGGC TCCGGCTTGT GCCACAAGAT AAGACTGGCA ATAAGCAACA TGGGCTTTCC TGCGATGCTT GATAAGAGAT CGAGGTTTGT TTGGGGCGGC TCTACGTCGT TTAGGAGCGA CTTCGCCAAG GCGACAAAAC TAGCAGACAG GTTGCCCAAC TTCATAAGCG ACGACTATGC CGTCTACTCG GCGATAAAGG AGATAGGAGG AAGTATATGG TTCGCAAAAG GCGCGGTTGC CCCGACGCCC GACCCCCAGT GCAGAGTAGC TGAGGCCTTC TGGTGGGGCA TAAGGCAGAT CCTCATGGTC AAGTGGCACG CCCCCGCAGG TTGGTACGCA GGCCTTTTTA TATATACGTT GGGCTTCATA ATTTCCGTGG TCCTGCCAGC TATAGGCGCC GCCCTGGGCG ACTACTGGCT GTTAACAGGG TTGGCGATCC ACCCAGTTAA CATAGCAAAA GATCTCGTTA GGGCGAGAGG CGTGGGCAGA CACGCCGGAA TACCCATTAA TTTAGCCACC GTGCTGTCTG CTTGGGCCGT GGGCAATATT GTAATACCGC TCGCCGTCTG GGCCTCGGCG TTTGTTAAAT GCGTAAACTG GAGGGGGAGG AGGATATGCC GGTAG
|
Protein sequence | MIELLFILPF VGLAVAGLVR EYIFWKGQDE VFPQSCEKIT VVVPIRGVHG ATEENLKAIT SQKAGSEVEY IFVVDSYDDP AYTIAQKFGK VVLNAGEGKS GALATALSHA TGDCIVFADD DIRPGPRWLE LMTTPLSKFT AVTTYRWYLG SGLCHKIRLA ISNMGFPAML DKRSRFVWGG STSFRSDFAK ATKLADRLPN FISDDYAVYS AIKEIGGSIW FAKGAVAPTP DPQCRVAEAF WWGIRQILMV KWHAPAGWYA GLFIYTLGFI ISVVLPAIGA ALGDYWLLTG LAIHPVNIAK DLVRARGVGR HAGIPINLAT VLSAWAVGNI VIPLAVWASA FVKCVNWRGR RICR
|
| |