Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0212 |
Symbol | |
ID | 5056412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 191229 |
End bp | 192317 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640467791 |
Product | hypothetical protein |
Protein accession | YP_001152479 |
Protein GI | 145590477 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2407] L-fucose isomerase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.931385 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGAGG TGGCGCGGCG CCTATTACAG TCTGTGGGGT TTTTCAAAAA CCCAGAGACC CCGGCGCCTG AGAAGTTTCC CCTCATCATC CACGCAACTG GCGGCACCAC GGCGGCTGCC CTTGACCTCG TCAAGAAGAG CGGCGCCCGC GGAGCTGTGC TACTGGGTTT CGGCGAGCAC AACAGCTTCG CCAGCGTGCT CCACGCCAAG GCCGAGATAG AGGCGCTGGG TCTGCCCGTA GTAGCCTACC ACTGCCCCTC CTACAATCAG TGCGGAGACG TCGCGGCAAG GGCAAAGAAA GTCGCCGACA CCGCCTCCTC GCTGGTGGGC GCCAAGGCGG TTCTAATCGG CTCGGAGACA TACCAAGCAC AAGCCGCCCG GGAGAAGCTT GGCTGGGCTG TTGAGGTTGT GCCCCTGGAG AAGTTCGAGG AAGCAGTAGA CGCCTCGGAG CCGGACGACG AGCTTCTGAA GCTTTTTGGG GACGACAGAG TGGCTAAAGT AGCGACGGCC CTAGAGAAGG TCTCGGCAGG TGCGAACCTC GTCGCAATTC AGTGCTTCCC CTTCCTCATG AAGCGGCGCT ACACCCCCTG CCTGGCCCTT GCCCTGCTCA ACTCGAGGGG GCGAGTAGTG GCATGCGAAG GCGACTTGGC GGCGGGGCTC GCCATGCTTA TGTCGAGGGG GCTGACGGGG TACAGCGGCT GGATAGCCAA CGTCGTGTGT CACGGCGGCG CCGAGGCGGT CTTCGCCCAC TGCACAATAG CGCTTAACAT GGCGAAGAGC TGGCGGATCA TGCCCCACTT CGAGTCGGGC TACCCCCACG GCCTCGCCGC CGAGCTGAAA GAGGCGGTCT ACACCGCTGT GTCCATCTCG CCTAGGTTCA ACAAAGCCGC CCTGGGAAGG GTGGAGGTGG TGAGGAGCGG CAACTTCTTA CAGGAGGCTT GCCGCACCCA GGCCCAGGTG AGGTTTAGGA GGGCGGTGAA GCTGGAAGAG GAGGCCCCGG CCAACCACCA CGTCTTTACC CCCGGCGACG TCGTGGACGA GGCCGAGGCC GTGTTGAGGC TGTTGGCGAT CCCCACGTCG AGATATTGA
|
Protein sequence | MKEVARRLLQ SVGFFKNPET PAPEKFPLII HATGGTTAAA LDLVKKSGAR GAVLLGFGEH NSFASVLHAK AEIEALGLPV VAYHCPSYNQ CGDVAARAKK VADTASSLVG AKAVLIGSET YQAQAAREKL GWAVEVVPLE KFEEAVDASE PDDELLKLFG DDRVAKVATA LEKVSAGANL VAIQCFPFLM KRRYTPCLAL ALLNSRGRVV ACEGDLAAGL AMLMSRGLTG YSGWIANVVC HGGAEAVFAH CTIALNMAKS WRIMPHFESG YPHGLAAELK EAVYTAVSIS PRFNKAALGR VEVVRSGNFL QEACRTQAQV RFRRAVKLEE EAPANHHVFT PGDVVDEAEA VLRLLAIPTS RY
|
| |