Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2133 |
Symbol | |
ID | 5055250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1907200 |
End bp | 1908231 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640469685 |
Product | type II secretion system protein E |
Protein accession | YP_001154331 |
Protein GI | 145592329 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.363948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCGTC TATTAACACG TCTTCTTGAA TGCGCCGAGT GCAAGAGGTC CTGTAAAGAA AAAGGCGTGT GCGACTTTAC AGAAGAAGAA ACCACGCTGT TAATTAGCCT CCTGTCAAGG ATCTACAAGA AGACAATTGA CGAGACCCTC GACTTCAGAT ACGACGTGCT GAGAAAAATA AACCAAAACA CGGCCGTCCA CGCCACCTTG GCAACTGTGG GGCTAGAACA GCTGGCTGAG TTCCTAGAGG ACGACGACGT AGAAGACGTA GTTCTAATAC CCGGTCGCCC GATATACATC ACGAGGAGGT ATGGCAAAGA AAAGATAGGG AAGATCAGCG AGGCGAAAAC TCTGAGGGCC CTCTTGAAAA TTGCGCATTT AAAGGGCGTT GAGTTAACCA CGGCCAATCC CTCCTTTAGA TATGGGCTCT CCTTCGGCGG ATATAGACTC AGGGTATCGA TAGACCTACC GCCCATCGTC CCGCACCCCC AAGCCTACGT GAGGGTGCAT AGGAAAAAGA TAACTGCCAA GGACTTGGTT AAGAGTGGGT TTCTCACCGG GGAGCAATTA AGGGAAATAG TTGCGTGGCT ACGAGAAGGC AGGCACGTAG TGGTATCAGG CCCGCCCGGT AGTGGAAAGA CCACGTTGCT AGCCGCAATA GACGACTTAA TCCCTCCACA TCTACAGCGG GTGTACATAG ATGAAGCCGA CGAGTTTGAA GACGACCCAA ACAAAAACCA GATAAAAATC AGAAGCGTCA ACAAGGCAAA AGAGGTATTA GCCTCCCTCA ACCGCAACAT AGACGTCATA TTCATAGGAG AGTTGCAGTA CGAAGACCAC TTCGCCGCCT TCAGAACCGC GTCTGAGATG GGACTACAAA CCCTCGCCAC CATGCATGCA ACTAACGTAG AAGACGCCCA GAAAAGGCTG AAAAGGCGGG GAATAGAGCT TCAAAACATT GGCATAGTAC AGCTCAGCAA GAAGTACGGA GCCATGGTAG AGCGGAAAGT CGCGGCGCTG TATGCTAAGT AG
|
Protein sequence | MLRLLTRLLE CAECKRSCKE KGVCDFTEEE TTLLISLLSR IYKKTIDETL DFRYDVLRKI NQNTAVHATL ATVGLEQLAE FLEDDDVEDV VLIPGRPIYI TRRYGKEKIG KISEAKTLRA LLKIAHLKGV ELTTANPSFR YGLSFGGYRL RVSIDLPPIV PHPQAYVRVH RKKITAKDLV KSGFLTGEQL REIVAWLREG RHVVVSGPPG SGKTTLLAAI DDLIPPHLQR VYIDEADEFE DDPNKNQIKI RSVNKAKEVL ASLNRNIDVI FIGELQYEDH FAAFRTASEM GLQTLATMHA TNVEDAQKRL KRRGIELQNI GIVQLSKKYG AMVERKVAAL YAK
|
| |