Gene Pars_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0468 
Symbol 
ID5055006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp412418 
End bp413524 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content52% 
IMG OID640468033 
Productcytochrome oxidase,subunit I 
Protein accessionYP_001152718 
Protein GI145590716 
COG category[C] Energy production and conversion 
COG ID[COG1271] Cytochrome bd-type quinol oxidase, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.117972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGG CTGTATTCAT CGGTTTTGTT GGCCTCGGCC TTGCTGTAAC TCTGCACATA 
TTATTCACTG CAATGACGCT GGGCACTGGC CTAATCGCTG CGTACTACCG CTGGCTTGCA
TACAGGAAAA ACGACGCGTG GTCAGAGCTC TTCGCGAGGA AGGCATTCAA GGTATTGATA
GTATCTGAGC TTTTCAGCGG CGTGTGGGGG ACAATAATTA CCGTCTTCCT CGCCGGCTTC
TTCACAGCGC TTACGACATT GGCCACTAAC GCCTTGTTTA TCCCAATTGC CATCGCAATC
GCTTCTATAA TGGTGCGGAT CCCCACAATT GCAATAAGCT GGTACACCTG GGGCAAGATA
CATCCAAAGA CGCATTCTCT TATTATGTGG GTAATGGCCC TGTCTGGGTT TGGAATACCC
TTCGGCTTCC GCGCCATATT TGCAGAAATA AACAACCCGG TCGCGGTGGG CTACTACTTA
CAGACTGGCA ACAACCCTGG TTTTATGCCG TATGAGAACC CGCTTTTCTG GACGCTGTAC
TTACACACCG TGGCCGCTGT TATTTCCACT GGAGGCTTCG TAGCGGCTAG CTTGATGTCT
CTTGAGAAAG ACCCCAAAGG CATCTCCACC GCCTTGAAAA TAGGCTTCTT GGTACTCATG
GCGCAGCTCT CGCTGGGGCC GCTGTACTGG CTATCTCTCA GCTGGTATGC ACCTCTGCTC
TTCAACTCGG TGAATACCAC CTTCGGCCCG CTCTTCGCCA TCAAGCTGGT GGCAGTGGCC
ACGTTGTTTA TACTGGGGCT TCAGGCCTAT AGGAAAATCA AAGCCGGCGC GGCGGCGCCG
GGCTACGTAA AGTGGCTAGG CCCGCTTGCC CTATTTATCG TAGTTGCTGG AGAGGTGCTC
AACGACGGCT CCCACTACCC CTACATGGTC ATAGCGGGCG AAAAGGGACT GCCGGCGACA
CTGTTTGCGA ACTTCTATAT GGACATTCCG ATGGGCGCGG TATATGTAAT ACTGGCCTTC
TTCATATTCT CAGTTGTCGT GTTTACCACA GCCGCCTTCT ATGCCTTATA TAGAAGATTC
CTAGCAGAAG TCCCACAAGC TGAGTAA
 
Protein sequence
MNEAVFIGFV GLGLAVTLHI LFTAMTLGTG LIAAYYRWLA YRKNDAWSEL FARKAFKVLI 
VSELFSGVWG TIITVFLAGF FTALTTLATN ALFIPIAIAI ASIMVRIPTI AISWYTWGKI
HPKTHSLIMW VMALSGFGIP FGFRAIFAEI NNPVAVGYYL QTGNNPGFMP YENPLFWTLY
LHTVAAVIST GGFVAASLMS LEKDPKGIST ALKIGFLVLM AQLSLGPLYW LSLSWYAPLL
FNSVNTTFGP LFAIKLVAVA TLFILGLQAY RKIKAGAAAP GYVKWLGPLA LFIVVAGEVL
NDGSHYPYMV IAGEKGLPAT LFANFYMDIP MGAVYVILAF FIFSVVVFTT AAFYALYRRF
LAEVPQAE