Gene Pars_1982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1982 
Symbol 
ID5055105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1773808 
End bp1774797 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content44% 
IMG OID640469529 
Producthypothetical protein 
Protein accessionYP_001154181 
Protein GI145592179 
COG category[S] Function unknown 
COG ID[COG1340] Uncharacterized archaeal coiled-coil protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.411826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTAAGCA AGGAGGAGTT GCTTGCGAAG ATTCGAGAGG TTAATTCTCA ACTCGATGAA 
ATTCAGAAAC AGATTGATAA CATAACGAAT GAGATCAACA CCCGAAGGAC GTTGCTTGAC
GAAGTTCGCA GGCAACTTGC CGAGGTGAGG TCGTTGATAG AGGGCAAGCG ACAGCAGTTG
CAGAAAACTA GAGAGCTGAT AGGGTCTCTG GTTGAGAGAA AAAGCCAGAT AATAAACAAT
ATTAGAAATC TAAGAAATGA ATTATTACAA ACTAATATAC TGCTTCAAAA ATATAGAGAA
AAATTAACTA TATATAGAAA TTTGCTATCA ACATTTAACG ACTACGTAGG AGGAAAACCT
ATTGAAAAAG ACAAACTAAA AAGAATAATT GAACAGCTTG AGTACTTTTT TGAGACTTCG
CCAACTAACC CCGAGTGGGA GAGGCAGTTC ATCAAGTACA TAAGCCAAAT CGAGAAGGAG
CTAAACCTTG CAGATTCCAT GGAGAAAGTA AAGGCCCACA TAACGGAGTT GAAAAACCAG
ATGGACGAGT TTAAGAACAA GAGAGAGGCA ATTAGAAATG AAATAGCCAG ACTTATACAA
GACCTCAACA CGGTGAAGCA GGAGCTGAGC CAGCTCAAGG CGAGTAGGCA GGAGATTTAC
AAACAGCTAG CGGAGCTGAA ATCGCGTAGA GAGGAATTGA AAAAACGGCG AGAGGAGATT
AAGTCTGAGA TCCTGCAACT CGCGCTGAAA CGCAAGGAGC TTAGAGAAAA GAGGCGGGCC
GTACAGGACG AGCTGGACAA ATACACAGTC TTGCTTAAGG CTGTCGAGCT TGCAGAGAAG
AACAAGGCGC GGTCTGCCGC CCAAGCCGCC CGTGCCGAGT CGTTGAAAGA GAGGGCCGAG
ATTCTGTTTA ACAAGCTAAT GAGCGGCGAG AGGCTAACTC ATGAGGAGAT TAAAATCCTC
GTAGAGGCTG GTTTCTTGCC CGAGGAGTAG
 
Protein sequence
MLSKEELLAK IREVNSQLDE IQKQIDNITN EINTRRTLLD EVRRQLAEVR SLIEGKRQQL 
QKTRELIGSL VERKSQIINN IRNLRNELLQ TNILLQKYRE KLTIYRNLLS TFNDYVGGKP
IEKDKLKRII EQLEYFFETS PTNPEWERQF IKYISQIEKE LNLADSMEKV KAHITELKNQ
MDEFKNKREA IRNEIARLIQ DLNTVKQELS QLKASRQEIY KQLAELKSRR EELKKRREEI
KSEILQLALK RKELREKRRA VQDELDKYTV LLKAVELAEK NKARSAAQAA RAESLKERAE
ILFNKLMSGE RLTHEEIKIL VEAGFLPEE