Gene Pars_2275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2275 
Symbol 
ID5056203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2036576 
End bp2037547 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content61% 
IMG OID640469827 
Productrespiratory-chain NADH dehydrogenase, subunit 1 
Protein accessionYP_001154471 
Protein GI145592469 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0754968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.291328 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTCC TCTCGCCCGA GCTGTTGGCC GCCGTGGTCT TCCCAGGGGT GCTGGCCATG 
CTCGGCTTTT TGGTTGTGGC TATATGGGCT GAGAGGAAGC TTGTGGCGAG GATTCAGTGG
CGCTATGGCC CCCTCTACGT CTCAAAGCCC ATTGGCGGTT TCCTCCAGCC GATTGCCGAC
TTGGTGAAGC TGGTGTTCTC CGAGCTGGTG TTGCCGAGGC ACACTAACCG TTTCCTCTTC
GCCGCGACTC CGGTAATACT GTTTATCGCC GAGGCTCTGC CCGCGGCGTT TATAGCCGCG
GCGCCGGGCC TCGTGATTCT TTACAACCCA TACGGCGTGG TGATCGCCGC CGTCGTTATG
CTCCTCGTTG CTGTGTTTCT GGTGGCCATG GCCTGGACGG AGGCGGATAG GTGGACCTAC
ATCGGCGCGG TGAGGGAGAT ATTATTGACC GCCGCCTACG AGGTGCCCCT CCTCTTGTCC
ATTCTTGCCA TGGTTGTGCT TTACGGCACC GCCGACCCCT TCGGCGTTGT GGAGAAGCAG
TGGGTATGGG GGGTACTGCT CAACCCCCTG GCCTTTGTGG CGTTTTACAT CTCCCTCATG
ATGTCCACCA CGAGGTTTCC TTTCGAAATA CCAGAGGCCG AGCCTGAGGT GGTGCTGGGG
CCCTACACGG AATACGGCTC CACCCTCTTC ATCTTGTCCT TCGGCGGTAC GTATGTCAAG
ATGTACGCGG CCTCGCTCCT GGGCGTTGCG TTGTTTCTCG GCGGCTGGCT CCCCGCGGGC
GACACCGTGT CAGGGGCCGC CGTCACCGCC GCCAAGCTCG CGCTGTTTGT TCTGCCCCTC
CTCCTGGTGA GGGCGATTTA CCCCAGGTAC CGCATCGACC AGGCGCTGAG GCTGGGCTGG
ACTAAGCTAC TGGCTCTATC CGTTGCGGCA GTGGCCTGGT CTCTGGCGGC GAGGCTATGG
TTGGGTTTCT AG
 
Protein sequence
MILLSPELLA AVVFPGVLAM LGFLVVAIWA ERKLVARIQW RYGPLYVSKP IGGFLQPIAD 
LVKLVFSELV LPRHTNRFLF AATPVILFIA EALPAAFIAA APGLVILYNP YGVVIAAVVM
LLVAVFLVAM AWTEADRWTY IGAVREILLT AAYEVPLLLS ILAMVVLYGT ADPFGVVEKQ
WVWGVLLNPL AFVAFYISLM MSTTRFPFEI PEAEPEVVLG PYTEYGSTLF ILSFGGTYVK
MYAASLLGVA LFLGGWLPAG DTVSGAAVTA AKLALFVLPL LLVRAIYPRY RIDQALRLGW
TKLLALSVAA VAWSLAARLW LGF