Gene Pars_0394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0394 
Symbol 
ID5054362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp344178 
End bp345410 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content59% 
IMG OID640467961 
Producthypothetical protein 
Protein accessionYP_001152648 
Protein GI145590646 
COG category[S] Function unknown 
COG ID[COG1602] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGTGT TTTTATACCA CAGAGCTGTG GGTTTTGTGA GAGGCGACCT CTGCGTCAAG 
TGCCGCGGCG GGCGTTATCT CTGCGGGTTG TCTTACTGCC CGTTGTTGGT GAGGCAAGCC
GCGGCGCCAT TTAGACAGCC GCCGCCTAAG GAGCTGTACG GCTCCAGCCC CCCTTCCGTA
TTTGTCGGCA GGATGGGGTA TCCCAAGGTG AGGCTCTACC CATCATCGCC GCCGGAGGTT
GGAGACACGA CGCCTTATGA AAACCCCGGG GAGTGGCTTC ACATGTCTCT AGAGCGCTTC
CTCGCAATGA GGCTCTCCTT GTACAGAGGA GCCGTCGTGC TTAGAGTTGA AGACGCGGCG
AGGCCCCCCA GGTTGCTTCA AGACGTCCAG TTGTTAGCCC TCTCACAAAG GCCAGTTGAG
GTGTATCTAC AATTCCGCAA GCCGCCCAGA GGCGTGCATT TCAGCGAATA TTCGCCGCCC
ATGGGTCCCT CTGCGCCCGC AGAGAGGGTA GAGGTCGAGG GAACACCCGC CCTGCCCAGA
GCCGCCGAAA AGGCCTACTC AGACGTAGAC CTAAAGGCGG CGGAGGCCGT GGTGGAGTTG
TACAGACACG GCCTAGAGGT GGCATACATT TCAAGGGCGC TAAGCGTCGG CGCCCTTGGG
GGGAGGCGAA GGAGGCTTGT CCCCACGCGC TGGGCGATCA CAGCCGTAGA TAAAATCATT
TCAGACCACC TTGTAGAGAA GGTGAAGGAT TATCCCGAGG TAGACGGCTA CTACCTATAC
GCCAGGAGGA CGGTGGGGAA CCTCTTCATA GCCATACTGG CGCCGTCTAA GTGGGCGTAC
GAGTGGGGGG AGGCCTTTGA GCCTCGCACG GTGTGGAACC CCGGCGGGTC GGTCGAGATG
GAGCTGGACT ACGAGCTCTA CGGCGGCCGC CGAGACTACC CGGAAATCGG CGGTTGCTAC
TACGCCGCCC GGCTCGCCAC TGCTGAGGCC CTTATGCGGA TGAGGAGACA AGCCGCTGCG
ATACTCTGGC GAGAGGTCTA CACAGGCTTC ACCACACCAA CGGGGGTCTG GTGGGTGAGA
GAAAACGTGA GGGCGATGTT TAAAGACGAG CCCGCTCGGT TTGACACACT GGAGGAGGCC
CTCGAGGCTG CGTCCTACCT CTTGAAAATC CCAATGGAGA GGTGGTTAAC CATGTCGAGA
ATAGTGCACC TACTCAAAAA CAGGCTGGTG TAA
 
Protein sequence
MMVFLYHRAV GFVRGDLCVK CRGGRYLCGL SYCPLLVRQA AAPFRQPPPK ELYGSSPPSV 
FVGRMGYPKV RLYPSSPPEV GDTTPYENPG EWLHMSLERF LAMRLSLYRG AVVLRVEDAA
RPPRLLQDVQ LLALSQRPVE VYLQFRKPPR GVHFSEYSPP MGPSAPAERV EVEGTPALPR
AAEKAYSDVD LKAAEAVVEL YRHGLEVAYI SRALSVGALG GRRRRLVPTR WAITAVDKII
SDHLVEKVKD YPEVDGYYLY ARRTVGNLFI AILAPSKWAY EWGEAFEPRT VWNPGGSVEM
ELDYELYGGR RDYPEIGGCY YAARLATAEA LMRMRRQAAA ILWREVYTGF TTPTGVWWVR
ENVRAMFKDE PARFDTLEEA LEAASYLLKI PMERWLTMSR IVHLLKNRLV