Gene Pars_2099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2099 
Symbol 
ID5054948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1874549 
End bp1875760 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content66% 
IMG OID640469649 
ProductVWA containing CoxE family protein 
Protein accessionYP_001154297 
Protein GI145592295 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.242069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTCTCC TCCTCAACGT CGACTACGGC GACGCGTTGG TGAGGGCGAG GGCGCTTAGG 
GTATTGCGGG CCTCTGGGGT TAGGGCTGTG GGGGTTGAGG AGGCAGTGGA CGCCTATTAC
GTCCACTACA GGTCGCCCAT TTTCGGGGGG CGGGCCTCCA GCCCCGTGTG GGAGAGGTTC
CTCATGGCGT ATGTGAAGTC CCAATACTAC GGGGCGGTCT CCGCCGTCTC TAGGCTGAAC
CATAAGGCCT CTCTTGAAGC GGCGGTGAGG CTCCTCAAGG CGTTTGAGTC CTACCTCCGG
TACTTGGATA GCTACGGCAG GGCGTGGTTT GGGAGGGGGG CTAGAGAGGC ATGGGCGGTG
GCCATGAAGC AGATTAGGCG CCACTTGGGG GATCCCGCCG ATGTGGTGGA GCTCTACCGC
CTCTTCAAGC GGCTTGGGGA GGTGCTGGGG AGGGGGAGGT CGGACAGCCC TGGGGCGCTG
GCGCTGTCGG TGGCCTCCGA CCCGCGGCGG GTGAGGCTGT CGCGTATCTT GGCCAAGGCG
CTGGCTCTCT CATCGAGGCT TGGCGCCCTT CTCGACGGCG TGTGGGGACT GGGCGAAGAG
GAGGAGAGAG CCTACGGCTC TCTGCACCGC CTGAGGAGGG CCGCTCTGTA CGCCAAGGCC
CTCGCCCTGG GGGCGCCCCC GCTGTTCCTC CATAAGGCCG CCTCCGCCGA GCTCCCCGTC
TATAGGCAGG TGGGCGGCGG GGACAAGGGG ATATACCTCC TCGTTGACAA GTCGGGCTCT
ATGTACGGCG CGCTCGGCGG GGTTGAGAAG ATAGCGGTTG CGGCGGCGTA CGCCATAGCG
GTCTTGAGGA GGTTCACGAA CGTGGTCATA CGGTTCTTCG ACGTGGAGGT CTACGACCCC
GTCTCAGACG TGGAGAGGCT TGTCGACGTC TTGACGCGGG TGGCGGCTAG CGGGGGCACA
GATATCACTC AAGCGGTGGA GGCCGCTGTT GAGGACGCCG AGAGGAGGAG GCTGAGGGGC
TACGTCCTCG CGGTTGTCAC CGATGGCGAG GACGATAGGC TTAACCCCGT GGCCGTGAGG
GAGGCAAGGG CGGTGTTCCG CGACGTGGTG TTCGTGCTGC TGGGTGCCCA GAAGCCCCCG
CCCCACGCCC GCGCGGTGCG GATCTCCCCT AATGACCTCC GCCTCGCGGG GGCCGCCTCA
GCAGTTATTT AA
 
Protein sequence
MGLLLNVDYG DALVRARALR VLRASGVRAV GVEEAVDAYY VHYRSPIFGG RASSPVWERF 
LMAYVKSQYY GAVSAVSRLN HKASLEAAVR LLKAFESYLR YLDSYGRAWF GRGAREAWAV
AMKQIRRHLG DPADVVELYR LFKRLGEVLG RGRSDSPGAL ALSVASDPRR VRLSRILAKA
LALSSRLGAL LDGVWGLGEE EERAYGSLHR LRRAALYAKA LALGAPPLFL HKAASAELPV
YRQVGGGDKG IYLLVDKSGS MYGALGGVEK IAVAAAYAIA VLRRFTNVVI RFFDVEVYDP
VSDVERLVDV LTRVAASGGT DITQAVEAAV EDAERRRLRG YVLAVVTDGE DDRLNPVAVR
EARAVFRDVV FVLLGAQKPP PHARAVRISP NDLRLAGAAS AVI