Gene Pars_1814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1814 
Symbol 
ID5056026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1629242 
End bp1630540 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content54% 
IMG OID640469360 
Productactin/actin family protein 
Protein accessionYP_001154017 
Protein GI145592015 
COG category[Z] Cytoskeleton 
COG ID[COG5277] Actin and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.347081 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCTCCG ATGCCTATAG GCTGAAGTAC ACCTTCGGCG TAGACTACGG TACCAGCTAC 
GTCAAGTACG GCCCTATCAC GCTTAATGAG CCAAGGGTTG TGCAGACCAG GGGGCTGTTC
CTAAGGGACC TCCCTGAGTC GGTCAAGATG CGGATCCCCC CCGACGTGCT GTCGAGGGGG
CTAGTGGTCG GCGACGAGGA GGTGAGGAAG TACCTCTCAA GTGTGAGGGA TGTGCAACGC
AACCTAAAGT ACCCCCTTAG AGATGGCATA GCGAAGAGAG ATGATGAGGA CGCTTGGCGC
GTGTTAAAAG AGCTCGCGAG ATTCACCTTA GGCCAGTTTC CGATATCGGA CAAGGAATTT
GAGGGGTGGA TAATCTCCAT CGCCCTCTCC GCCTTGGCCC CCGACTATAT GTATAGGGCC
TTTTTCGACA TATACACAGA GCTATCAGAC GAGTTCAAAA TATACGCAGT GACGATTTTA
CCTCAGCCCC TAGCAGTGGC TATTGCGGAA AACGCCGTTA ACTGCATAAT TGTGGAGGGG
GGTCATGGCA ATATCCAAAT TGCGCCGATT AGCTTCGCCC TTATTCGGGA GGGGCTAGTG
GCGTTGAATA GAGGAGGCGC AGAGGCCAAC GCCATCACGA GGGAGATACT GAAAGACATG
GGCTACAGCG ATATAGCCAG GGAGGAATAC GCGGTCGAGA CTGTGAAAAG GGCAGTGGGC
CTAGTCCCGA AGAACTTGAA GGAGGCGATT AAGACAGCAA AGACTAACCC TGAGAGGTTC
GTGACGAAGG TACGGCTTTC TCCAGTCGTT GAGGTGGAGT TCCCCAAGGA GTATGCGTGG
ACTAGGTTCC TCATAGGAGA GATAGTATTT GATCCAAACC ACGAGGAGAT AAGTAGCTAC
ATTGAGCAGT CCCGCTTAAC TATTGAAAAC GCCGTGATCG GCGACGTCAC GCTCTACGGT
GAGATGGACG TGGCCACTGC GGTGATTACA TCGCTTAGGA ACGTCTCCGT GGAGATACAA
GAGAGAGTTG CCTCTCAGGT AATTCTAAGC GGCGGGGCCT TCAATTGGCG CGTCCCAGCA
GGTCTTGAAG ACGTGGCCGC CGACAGCGTC ACAAGGATTA AGATCGCTCT CGAGGAGAAA
AACCCCGTTC TCGCCTCGCG AGTGAACATA AGGATGGTGT CAGAGCCCCA GTACTCCGTG
TGGAGAGGTG CGGTGATCTA CGGCTACGCC CTACCCCTAA CTCTGGAGTG GTCTGATACC
ACGAAGGAGG GGTGGATGTA CCCAAAAAAG ACTAAGTAG
 
Protein sequence
MVSDAYRLKY TFGVDYGTSY VKYGPITLNE PRVVQTRGLF LRDLPESVKM RIPPDVLSRG 
LVVGDEEVRK YLSSVRDVQR NLKYPLRDGI AKRDDEDAWR VLKELARFTL GQFPISDKEF
EGWIISIALS ALAPDYMYRA FFDIYTELSD EFKIYAVTIL PQPLAVAIAE NAVNCIIVEG
GHGNIQIAPI SFALIREGLV ALNRGGAEAN AITREILKDM GYSDIAREEY AVETVKRAVG
LVPKNLKEAI KTAKTNPERF VTKVRLSPVV EVEFPKEYAW TRFLIGEIVF DPNHEEISSY
IEQSRLTIEN AVIGDVTLYG EMDVATAVIT SLRNVSVEIQ ERVASQVILS GGAFNWRVPA
GLEDVAADSV TRIKIALEEK NPVLASRVNI RMVSEPQYSV WRGAVIYGYA LPLTLEWSDT
TKEGWMYPKK TK