Gene Pars_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0848 
Symbol 
ID5056123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp753122 
End bp754177 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content60% 
IMG OID640468408 
Producthypothetical protein 
Protein accessionYP_001153085 
Protein GI145591083 
COG category[R] General function prediction only 
COG ID[COG4756] Predicted cation transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.11518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.339338 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTCC TTTTGCAGAT AATATTTCTC GTCGCGCTAG TCGCCGGGCC AGTACTCTCA 
AAGCGGATAG AGCACAACAT CGAGATCTAC TTCCTAGCCT TGGGCGTGGC GGGGGCAACA
ATCAGCAACC TCTGGAGCTG GCACCTCCTG GAAGAGGCCC TGCTCCACCC AGTTGCCGTC
TACCAGCCGG GCATAGGGTA CATCCCCGTA GGCATAACCC AGGTGGTTCT CTTCGCAGGC
CTCGCCTTCT ACTTCCTCCG CCACCGACTA GCCGGGTGGG CCGACAAGCT CGCCCAGCCC
ATAACCGTCG CCGTGTTGAT AGCCGTGATG GGGTTCTCGT CCAGCGTAAT ATCGGCGATA
GTGGCCTCCG CCATTATGGC AGAGCTCCTT GCCTTCGCCA GGGCGCCCCA CGCCTACAAG
GCAAAGGCCG CTGTATATGC GGCGTACGCC ATCGGCGCGG GCGCCGCCCT CCTCCCCATC
GGCGAGCCCC TCTCGACGAT TGCAGTGGCG AAGCTCAAGG CGCACTTCTT CTACCTAGTA
GACGTGTTGA TCGACGCCGT GGCCCTGGTG GTGATCTTCT TCGCGGCCTA CACCTATCTG
CAACTCAAGC GCTACAAGCC GGCGGAGGCG GAGATAATCC CCTACGAGCC GGAGCTGAAA
GAGGTGCCCC TTAGAGCCGT CAAGATCTTC ATCTTCATCT TCGCCTTGAC CATACTCGGC
GAGTTCTTTA AACCCTTAGC CAACGCCGCC GCGGCGCTCG GCAAAGAGCT CCTCTACATA
TTCGGCGCAA TCTCGGCAGT GGCTGACAAC GCGACGCTTG TAGCCGCCCT CGTCAGCCCA
GAAATGGCCG CCGAGGTCCT AAGAGCCTTC CTCATCTCGC TGGTCATTTC GGGAGGCTTC
ACCGTCCCCG GCAACGTCCC CAACATAGTG TTCGCAAGCG TCTTAAAAAT AGGATTCAAG
GAGTGGATAA AGCTGGCCCT CCCCATAGGA GTTGCCATAT TCGCCGCGAT GGGGGCATAC
GTCCTATTCA TCGTGCCTCA CCCGCCACTC GCTTAG
 
Protein sequence
MDLLLQIIFL VALVAGPVLS KRIEHNIEIY FLALGVAGAT ISNLWSWHLL EEALLHPVAV 
YQPGIGYIPV GITQVVLFAG LAFYFLRHRL AGWADKLAQP ITVAVLIAVM GFSSSVISAI
VASAIMAELL AFARAPHAYK AKAAVYAAYA IGAGAALLPI GEPLSTIAVA KLKAHFFYLV
DVLIDAVALV VIFFAAYTYL QLKRYKPAEA EIIPYEPELK EVPLRAVKIF IFIFALTILG
EFFKPLANAA AALGKELLYI FGAISAVADN ATLVAALVSP EMAAEVLRAF LISLVISGGF
TVPGNVPNIV FASVLKIGFK EWIKLALPIG VAIFAAMGAY VLFIVPHPPL A