Gene Pars_2133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2133 
Symbol 
ID5055250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1907200 
End bp1908231 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content50% 
IMG OID640469685 
Producttype II secretion system protein E 
Protein accessionYP_001154331 
Protein GI145592329 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.363948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGTC TATTAACACG TCTTCTTGAA TGCGCCGAGT GCAAGAGGTC CTGTAAAGAA 
AAAGGCGTGT GCGACTTTAC AGAAGAAGAA ACCACGCTGT TAATTAGCCT CCTGTCAAGG
ATCTACAAGA AGACAATTGA CGAGACCCTC GACTTCAGAT ACGACGTGCT GAGAAAAATA
AACCAAAACA CGGCCGTCCA CGCCACCTTG GCAACTGTGG GGCTAGAACA GCTGGCTGAG
TTCCTAGAGG ACGACGACGT AGAAGACGTA GTTCTAATAC CCGGTCGCCC GATATACATC
ACGAGGAGGT ATGGCAAAGA AAAGATAGGG AAGATCAGCG AGGCGAAAAC TCTGAGGGCC
CTCTTGAAAA TTGCGCATTT AAAGGGCGTT GAGTTAACCA CGGCCAATCC CTCCTTTAGA
TATGGGCTCT CCTTCGGCGG ATATAGACTC AGGGTATCGA TAGACCTACC GCCCATCGTC
CCGCACCCCC AAGCCTACGT GAGGGTGCAT AGGAAAAAGA TAACTGCCAA GGACTTGGTT
AAGAGTGGGT TTCTCACCGG GGAGCAATTA AGGGAAATAG TTGCGTGGCT ACGAGAAGGC
AGGCACGTAG TGGTATCAGG CCCGCCCGGT AGTGGAAAGA CCACGTTGCT AGCCGCAATA
GACGACTTAA TCCCTCCACA TCTACAGCGG GTGTACATAG ATGAAGCCGA CGAGTTTGAA
GACGACCCAA ACAAAAACCA GATAAAAATC AGAAGCGTCA ACAAGGCAAA AGAGGTATTA
GCCTCCCTCA ACCGCAACAT AGACGTCATA TTCATAGGAG AGTTGCAGTA CGAAGACCAC
TTCGCCGCCT TCAGAACCGC GTCTGAGATG GGACTACAAA CCCTCGCCAC CATGCATGCA
ACTAACGTAG AAGACGCCCA GAAAAGGCTG AAAAGGCGGG GAATAGAGCT TCAAAACATT
GGCATAGTAC AGCTCAGCAA GAAGTACGGA GCCATGGTAG AGCGGAAAGT CGCGGCGCTG
TATGCTAAGT AG
 
Protein sequence
MLRLLTRLLE CAECKRSCKE KGVCDFTEEE TTLLISLLSR IYKKTIDETL DFRYDVLRKI 
NQNTAVHATL ATVGLEQLAE FLEDDDVEDV VLIPGRPIYI TRRYGKEKIG KISEAKTLRA
LLKIAHLKGV ELTTANPSFR YGLSFGGYRL RVSIDLPPIV PHPQAYVRVH RKKITAKDLV
KSGFLTGEQL REIVAWLREG RHVVVSGPPG SGKTTLLAAI DDLIPPHLQR VYIDEADEFE
DDPNKNQIKI RSVNKAKEVL ASLNRNIDVI FIGELQYEDH FAAFRTASEM GLQTLATMHA
TNVEDAQKRL KRRGIELQNI GIVQLSKKYG AMVERKVAAL YAK