Gene Pars_1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1874 
Symbol 
ID5055749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1677627 
End bp1679024 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content50% 
IMG OID640469420 
Productpreprotein translocase subunit SecY 
Protein accessionYP_001154077 
Protein GI145592075 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.653631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTTT TAACTTTCAT ACCCACAGTC ACGCGGCCAA CGCGGCGACT GCCACTCTCC 
AAGAGGCTTT TCTGGACAGC TGTTGTGGCT ACGGTGTACA TCTTGATGAC TATAACACCG
TTGTACGGAA TCCAGCGTGG CCAACAGCAA GCTACACAAC CAGGGCAACA ACTCCTCTCC
ATCATCTTCG GAACAGCCTA CGGCACCTTG GCACACCTTG GCATAGGACC CATAGTGATA
GCAGGCATCT TACTCGAGGT GTTTGCATTC TCGGGAATCT TAAATCTAGA CCTCAACAAG
AGGGAAGACC GTCTGAAATT TACTCTGCTT CTTAAATGGG CTGCGTTAGG AATAGCAGCA
ATAGAAGCCA CAGCTTACGT GCTCGGCGGG CAATTCGGCA CGGTAACCCC AGTAGGTGGG
GTGCTTATCA TCGCCCAGCT CCTGCTGGCC ACGATAATAA TAATGTTGCT TGACGACTTG
ATGTCCAAAG GCTGGGGGAT CGGGAGCGCC ATCAGCCTAA TAATATTCCT CGGCGTGACG
AGACAGCTGT TTCTTAGCCT CTTCTCGTGG GACGTGGCTG TAGATAACCA AGATCAGCCC
CACGTAGTCG GCCTAATACC TGCCCTAGCC GCCGCCATAT ACGATTTCAT CACAAGAGGC
GATGCTACAC AGTTGATAGG ATTAATCAAC CGGGGTGTTG TGCTAAAGGG CCAGACCTCC
TTAACTTACC TTCCTGACTT TGTTGGGTTA ATATCAACAA TTCTCTTGCT GTACGTCCTA
TTGTATCTGG AAATGATGAA AGTCAACATA CCAGTAACCG CAGGGCAGTA CAGAGGTATC
AAATTCACAA TTCCCCTCCG CTTCGTATAC GTCAGCGTCC TACCAATAAT CTTCACTACG
TACTCTCTGC TACTAGTAGG CCAGCTTCTG TTACCCTTTT ACAACCCGGA GCCCGGGACG
GGCAACCCCG TGGTAAATAC AATAATCCAC GTAATATTCC TCCCACACAG ATTCTTCCAC
GACATACCAG CTCTGGTACT ACACTACTTG ATCTACGTAG CGCTGGCAAT AGCCTTTGCA
TGGGTGTGGG TCCAACTCGC TGGCCTAAGC GCGGAGGATC AGGCAAAACA ATTTGCCCAG
TCGCAACTAC ACATACCAGG TTTTAGGCAG AGCGAAAAAA TCTTCGCAAA GATTCTGGAG
CGACCAATAA ACGCCTTGAC TATTATAAGT GGCTTTATCG CCGGCTCTTT CGCAGCGCTT
GGCAACATCC TCGGCGTATG GGGAAGCGGC GCAGGCCTAA TCCTACTCGT CGAAATTGGC
CTACAGTACT ATGCCCTAGT TATGCGCGAA CAGATAATGG AGATGTACCC AGGCCTAAAA
CAAGTGATAG GGCAATAG
 
Protein sequence
MDFLTFIPTV TRPTRRLPLS KRLFWTAVVA TVYILMTITP LYGIQRGQQQ ATQPGQQLLS 
IIFGTAYGTL AHLGIGPIVI AGILLEVFAF SGILNLDLNK REDRLKFTLL LKWAALGIAA
IEATAYVLGG QFGTVTPVGG VLIIAQLLLA TIIIMLLDDL MSKGWGIGSA ISLIIFLGVT
RQLFLSLFSW DVAVDNQDQP HVVGLIPALA AAIYDFITRG DATQLIGLIN RGVVLKGQTS
LTYLPDFVGL ISTILLLYVL LYLEMMKVNI PVTAGQYRGI KFTIPLRFVY VSVLPIIFTT
YSLLLVGQLL LPFYNPEPGT GNPVVNTIIH VIFLPHRFFH DIPALVLHYL IYVALAIAFA
WVWVQLAGLS AEDQAKQFAQ SQLHIPGFRQ SEKIFAKILE RPINALTIIS GFIAGSFAAL
GNILGVWGSG AGLILLVEIG LQYYALVMRE QIMEMYPGLK QVIGQ