Gene Pars_1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1590 
Symbol 
ID5054937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1439658 
End bp1440881 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content57% 
IMG OID640469131 
Productbasic membrane lipoprotein 
Protein accessionYP_001153796 
Protein GI145591794 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0787459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAA AACTTTTGGT AGCTCTAGTC GTTGTAATAA TCGCAGTGGC CGCCGCGGCG 
CTTCTACTAC TCCAGCAACC CCAGCAGGCG TCCACCCAGA CTACCCAAAC TCCAAGCAAG
GGCAATATAT ATGTAATATA CGATATCGGA GGTAGAGGCG ACCTCTCTTT TAACGACATG
GCTTACCTAG GCGCCTCCAA AGCCGCCAAG GATTTCGGCC TGGGGCTAAA GGAGGTGCAG
AGCAAAACGC AGGACGACTA CGTGCCTAAC CTGCGCGCGG CCGCCAGATC CGGCGATGCG
GCGTTAGTCG TCGCAGTGGG GTTCCTCATG ACCGATGCCG TGAAGCAAGT CTCCCAAGAG
TACCCCAACG CCAAGTTCGC CATAATTGAT GGCTACATTC CCGACCGGCC GAACGTGCTC
TCCGTCCTCT ACAGAGAGAA CGAGGGATCC GCCCTAGTTG GCGCACTGGC CGCGTTAACA
GCCTACCACT TCAACTGCAC CAAGGTCGGC ATAGTCCTAG GCATGGAAAT ACCCGTCTTG
TGGAAATTCG AAATTGGTTA CGCCTACGGG GTGAGGTGGG CCGAGCGCTA CCTAAGCCAG
AAGTTTGGGA AGAACGTCAA GTTCGACGTG CTCTACATCT ACACAGGCTC TTTCAACGAC
CCGGCCAAGG GCAAGCAGGC AGCTGAGGTA ATGCTTGCAC AAGGCGTATG TGTAATATAT
CAAGCCGCAG GCGCCACTGG ACTGGGAGTG TTTGAAGCCG TGGCCGAGGC AGGGAAGAAG
GCTGGGAGGA ATATGGGCCC GCCGTTTGCC ATCGGCGTAG ACGCCGACCA AGACTACCTA
AAGCCAGGCT TCATCCTTGC CTCTATGATG AAGAGGGTCG ACGTGGGCGT CTACACAGCC
GCGAAGATGG CCGTAGAGGG CAATTTCAAG GGCGGCGTGC TTGAGCTTGG CTTAAAGGAG
GGCGGGGTGT CGGTAAGCAC CCTGAGCGAC TTGCGGCAGT TTATAGAAAT AGGCGTAAGC
GCCGGGGCCG TGAGGAGGGA GGACGCCGAT AAGATTGTGG CAACTGTAAG CGATATGAGG
TCCAAGATAC CGTCGTGGAT ATGGGAGGCG GTTGATCAGC TTAAGCAAGA CATCATAGCC
GGCAGGGAGA AGGTGCCTCT GCCCACCGCC CAGGACCAGG TGGTGCAACT TAGGAAAGAG
TTGGGCCTCG GCGTCGCCGG GTAA
 
Protein sequence
MNTKLLVALV VVIIAVAAAA LLLLQQPQQA STQTTQTPSK GNIYVIYDIG GRGDLSFNDM 
AYLGASKAAK DFGLGLKEVQ SKTQDDYVPN LRAAARSGDA ALVVAVGFLM TDAVKQVSQE
YPNAKFAIID GYIPDRPNVL SVLYRENEGS ALVGALAALT AYHFNCTKVG IVLGMEIPVL
WKFEIGYAYG VRWAERYLSQ KFGKNVKFDV LYIYTGSFND PAKGKQAAEV MLAQGVCVIY
QAAGATGLGV FEAVAEAGKK AGRNMGPPFA IGVDADQDYL KPGFILASMM KRVDVGVYTA
AKMAVEGNFK GGVLELGLKE GGVSVSTLSD LRQFIEIGVS AGAVRREDAD KIVATVSDMR
SKIPSWIWEA VDQLKQDIIA GREKVPLPTA QDQVVQLRKE LGLGVAG