Gene Pars_2281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2281 
Symbol 
ID5054691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2042459 
End bp2044087 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content64% 
IMG OID640469833 
ProductNADH/ubiquinone/plastoquinone (complex I) 
Protein accessionYP_001154477 
Protein GI145592475 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.454821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.480352 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTTG AGCTCTGGTT GCTGGTCCTC GCCCTACTCG CCTTAGCCGA TCTCTTCACC 
AAGAAGGGCA TAGGCTCGCT GGTAGGCGGC GCCGCTACCC TCTACCTATC CCTCACAAGG
TCGCTGATGC CGGCGACGCA CCTCTTCCAC ATGGGGGATC TCGCCCACCC ACTATATATC
TTCATATCTG GGATATATAC CGCCATCGCG GCGTACTCCA TCTGGTACGC CAGCCACTTG
GAGAGGAGGG GGTGGTTCTG GCTGTGGATG GGGGTGTTCT ACACCTCCAT GCTCACCTTC
GTCGCCGCCG ACCACTGGCT TGTTTTGATA ACGGGGTGGG GGGGTCTCGA CATAGCTAGC
TGGGCCCTAA TCCTCACCTA TCACGACGGT GAGAAGTACG GCCGCGTGGG GCCGGGGGGG
AGGGCATGGG GGGTGGCATG GGAGTGGGCG CCAAGCGCCT CGGCGCTGAG GGCTATCTTG
ACCGTGGAGA TAGGCACCGC CTCTCTCGCC GCGGGCCTTG CCCCGGCCGC CGCCGCGCAG
GGCCCGCACA TAAGCTCGCT GTCCGCCATG TCTGACCTCT CAGCGGCGCT TGTTCTGACG
GCGGCTTTCG TGAAGGCGGC CCAGCTACCC TTCACAGACT GGCTTATGAC CGCGATGTCG
GCACCTACGC CAGTCAGCGC CCTGCTCCAC AGCTCGACGA TGGTGAAGGC AGGGCCGATA
CTCCTCCTCA AGCTGGGACA CGCCATGCCC ACATGGGCTG CGGGGACGGC CTTCGCCTTC
GGCATCGCCA CTGCCTTGTA CGGAGGCGTC GTGGCGCTTG GGCAGAGAGA GCCGAAGGTC
CTCCTCGCCG CCTCCACTGC CTCATATCTA GGCCTCATAA CAGCCTTCGC CCTTGCGAAG
CCGGAGGAGG CGCTGTGGCT CACCTACTCC CATGGAGTGG CCAAGGCAAC TCTCTTCATG
GCGGTTGGCC ACGCCATACA TATAGAGCAC ACAACCACGC CCACCCGGTT CCCCGTGGCG
GCCAAGGCGG CTATGGGCCT TGCCCTCCTG ACGCTGGTGG GGCTGACCCC CCTAGGTGCA
GTCGCCAAGA GCAACGCCGA GCCTTGGTTC CTTCTGTTCT CCTTCCTGAC CGCGGGGTAC
GTGGGGAAGT TGATGCTAAA GACAGCCACC ACGCCGGGCG GCTGGGCGGT GGCGGCGCCG
TATACAGCGC TGGCGGCGGC CAGCTTGGCT TTCCCCGTCT TGCCTAACCC CTTCTGGGCC
CTCGCGCTTG CCGGCCTGGC ATTGGCGAAG ACGCCTGAGC CCACCGTTTT GCTCAGGCGG
CTGGGTCTAC CCGTTCTCTA TGACGCGGTG GCCCCCGCCG TGTTTAAGGC AGTGAGGCAG
GCCGCGGCGG TGGGAGACGG CTTTGTGGAC AAGCGCCTCT TATCACTGGA GGGGCTGTGG
CGGGGCTTGG CGTCCCTCGT CGCCGTCGTG GACTTAATCT TCGACATGTT GCTCCACGAC
TTCGTTCCCG CCCTAGTGCA GTCCGCCTCT GCCCAGCTAT CTAGGCGGAG TTTCGACTAC
TACCTATACG TGGCCGGCGT CGGCGCGGGG ATAATCTTGG CCCTCGCGGT GTTGCTATGG
ATCCACTAA
 
Protein sequence
MMLELWLLVL ALLALADLFT KKGIGSLVGG AATLYLSLTR SLMPATHLFH MGDLAHPLYI 
FISGIYTAIA AYSIWYASHL ERRGWFWLWM GVFYTSMLTF VAADHWLVLI TGWGGLDIAS
WALILTYHDG EKYGRVGPGG RAWGVAWEWA PSASALRAIL TVEIGTASLA AGLAPAAAAQ
GPHISSLSAM SDLSAALVLT AAFVKAAQLP FTDWLMTAMS APTPVSALLH SSTMVKAGPI
LLLKLGHAMP TWAAGTAFAF GIATALYGGV VALGQREPKV LLAASTASYL GLITAFALAK
PEEALWLTYS HGVAKATLFM AVGHAIHIEH TTTPTRFPVA AKAAMGLALL TLVGLTPLGA
VAKSNAEPWF LLFSFLTAGY VGKLMLKTAT TPGGWAVAAP YTALAAASLA FPVLPNPFWA
LALAGLALAK TPEPTVLLRR LGLPVLYDAV APAVFKAVRQ AAAVGDGFVD KRLLSLEGLW
RGLASLVAVV DLIFDMLLHD FVPALVQSAS AQLSRRSFDY YLYVAGVGAG IILALAVLLW
IH