Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1766 |
Symbol | |
ID | 5055351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1586746 |
End bp | 1587840 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640469311 |
Product | major facilitator transporter |
Protein accession | YP_001153969 |
Protein GI | 145591967 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.000930556 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATATAA GGCTTATAAT AATGTTGGGG CTCGTCTCAC TTTTTGCCGA TTGGTTATAC GAAAGCATGC GCGCCGTGGC TCCGCAATAT CTATACATGC TGGGCGCAAC AGCGGTGTTT GTGGGCTTCG TTTTCGGGCT AGGCGACGCT TTGGGCTACG CCGCGCGTGT AGTGACGGGT CCTCTGGCCG ACAGGAGAGG CGGCTACTGG CTGGAGACTT TTCTCGGCTA TGGCCTACAG ATAGCCGCCG TCGGCGGCTT GATATTCGCA AAGGATCTAT GGCAAGCCGC CGGGCTGATT TTCTTGGAGA GGTTTGCCAA AGCTTTGAGG ACACCCGCGC GTGATGTGCT CATATCGGCC GCGGGAGGCG GCAAGGCGAA GGGCAGGGCC TTCGGCATCC ACGCGGCTCT GGATCAGATA GGGGCTATTA TCGGCGCCGC TATGGCTACG GCGATGTTGT ATATGTACTA CACGCCAAGG GACGTCTTTG CAACGGCTTT GCTTCCCGGT GCCGTCGCAC TGGCTCTACT CTACGCGGCG TATAGGCTAA GCGGTGTGAG GCCGTCCGGC AGAGGCCGTG TCGGCGGGGG ATGGAGGGCG GCTACGGCCT TTGCGGCTAC GCAGTTTTTC CTCGGCCTCT CCCTAACACA CATCTCGCTG TTTCAGTACA GGCTAGCCGA GGTTCCTTGG CTCGCCTCGT TGCTGTTCCT AATAGCTATG ATCGCCGAGG TGCCTGCCTC ATTGCTGTTG GGTTTTCTCC ACGACAAATC GTCTAAGGCG CTTCTCATAG GGCCCGTATT CACCGTGTTG CTCGCGCTGT CGTTCATGGC GGGTGGGCAT TACTTGTTCT TGGGCGCAGC GCTGTACGCA GTAGCTACTT CCTATGCCGA TGTGGTGGCG AAGGCCTACG CGGCGAAGCT AGGCGCTGCT GCCTCGTTAG GTCTCGTCAA CGCGATGTGG GGACTAGGGC TGTTAGCTGG CGGGGTAGTC TACGGCTTTT TAACAGACAT GGGGATTTAC TGGGCAATCG GGGCACTAGC CTCCGCCGCC TCGTTGGCCT CTTTCTACAT GCTATGGAGA TTGACCACGT ACTAG
|
Protein sequence | MNIRLIIMLG LVSLFADWLY ESMRAVAPQY LYMLGATAVF VGFVFGLGDA LGYAARVVTG PLADRRGGYW LETFLGYGLQ IAAVGGLIFA KDLWQAAGLI FLERFAKALR TPARDVLISA AGGGKAKGRA FGIHAALDQI GAIIGAAMAT AMLYMYYTPR DVFATALLPG AVALALLYAA YRLSGVRPSG RGRVGGGWRA ATAFAATQFF LGLSLTHISL FQYRLAEVPW LASLLFLIAM IAEVPASLLL GFLHDKSSKA LLIGPVFTVL LALSFMAGGH YLFLGAALYA VATSYADVVA KAYAAKLGAA ASLGLVNAMW GLGLLAGGVV YGFLTDMGIY WAIGALASAA SLASFYMLWR LTTY
|
| |