Gene Pars_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1895 
Symbol 
ID5055528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1700747 
End bp1702162 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content49% 
IMG OID640469444 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001154098 
Protein GI145592096 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAGAG AATTTCTAAA AAGCCCCTCG GGTCTGGCAG GTACGATAAT TTTAGTCACT 
TTTTCTCTGA TATCGGTATT TGTTGTTATG ACATTTCCGA TGGATTACGG TACCAAGTAC
TGGTCAAATC CAAAATACTG GGAGGACTAC CCTAAGCTGG TGCCTCCGGT GTGGTACAAC
TACTTTGTAC CGTATAAACT GCCACAGCAC TTTGTAAAAG ATCTAACTAG CCCCTCAGAG
ATGGTGGATA ATGTAAAAAA GTGGGTTGTT GAGTATAAAT TTGAGGCTGA TAAGTTCCCC
ACAGGGATTA TTTTCAAGTA CATCAACTTG ACGTTCTATG GCGACGTGCC GGTGATTTCG
CTGAAAGTAA AAAGGCCAGA TGGCAAGGAG GTGGTTTTGT TAGACTACGA CGGGGGTATA
CCACCGCCAA GGTCGGGCGA GAAGGCGCCG TACGTGCGGT TTGTGGAAAA CCCCAGAACC
ATAATACTCA TCTCAGATCC TCAAGCAGTT AGAAAAGTTA CAGTATTTGC AGAAGAGCTG
GGTGTAAACT GTACTACGGC AGATGTTAGA GACGCCGGGC TTCTCCCCTA CATAGTTTTT
GGCACGCCTA TCTCAAGTAA GTGCGTCGCC GAGTCTTTCA AACCACTGAA AGGGAGCTAC
GTATTCGAGG TGGAGATGGT GGGTGATCCT AAAGATGAGA TAGGGCTACT CCGCCTAGTA
GTCCAAGGCG CTGTCTACGG CGCCATGGGC ACAGACTATT TAGGACGCGA CTTGGCACAA
GGCCTCCTAT TCGGCTTTCC AGTCGCCTTA TTTATAGGTG GTGTAGTCGC GCTCCTCGCT
ACGCTTATAG GATTAGCCCT CGGTATTATA AGCGGGTACA TAGGCGGCAA GGTCGACGAG
GCAATACAGA GGTTCGCAGA CGTTTTGAAT AACCTCCCCT TGTTGCCTCT TTTAATACTG
TTTGTCTTCG TGCTCGGCCG TAGCCTCTTT AACATTATAT TAGTGTTGGT CGTGTTCGGT
TGGGCCGGTA CTACCATCAT TGTCAGGTCC ATGGTCCTCA GCATAAAGAC GAGCCAGTTC
GTGGAGGCGG CTAAACTGGC GGGGGCAAGC CACTGGTGGA TAATGAGGAA GCACATACTG
CCGCCTGTCT TGCCCTACGC CTTTGCCCTT ATGATCTTTG CAATACCCGG GGCGATACTG
TCTGAGGCAA GTCTCAGCTT CTTGGGACTC GGCGATCCCA GCATCCCGAC ATGGGGGCAA
ATTCTACAAC AAGCGTTTGA CAACGGCGCG TTGCAAAACT TTGCGTGGTG GTGGATACTA
ACGCCCGGCT TCTTGATCGT TATCACAGCC ATAGGCTTTG TCCTGGTATT CTTCGCGCTG
GAGCCTATTG TCAATCCGAG GTTGAAGAGG CAGTAA
 
Protein sequence
MIREFLKSPS GLAGTIILVT FSLISVFVVM TFPMDYGTKY WSNPKYWEDY PKLVPPVWYN 
YFVPYKLPQH FVKDLTSPSE MVDNVKKWVV EYKFEADKFP TGIIFKYINL TFYGDVPVIS
LKVKRPDGKE VVLLDYDGGI PPPRSGEKAP YVRFVENPRT IILISDPQAV RKVTVFAEEL
GVNCTTADVR DAGLLPYIVF GTPISSKCVA ESFKPLKGSY VFEVEMVGDP KDEIGLLRLV
VQGAVYGAMG TDYLGRDLAQ GLLFGFPVAL FIGGVVALLA TLIGLALGII SGYIGGKVDE
AIQRFADVLN NLPLLPLLIL FVFVLGRSLF NIILVLVVFG WAGTTIIVRS MVLSIKTSQF
VEAAKLAGAS HWWIMRKHIL PPVLPYAFAL MIFAIPGAIL SEASLSFLGL GDPSIPTWGQ
ILQQAFDNGA LQNFAWWWIL TPGFLIVITA IGFVLVFFAL EPIVNPRLKR Q