Gene Pars_1496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1496 
Symbol 
ID5054986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1353768 
End bp1354781 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content54% 
IMG OID640469038 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001153704 
Protein GI145591702 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.992492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGTC TCTTGAGATA TGCGGCGTAT CGCGTGCTCT TAGCTATACC AACCCTTATT 
ATCCTTCTCA CAGTTGTATT CTTCATTTTG AGAGTGATCC CAGGAAACCC TATTGTGGCT
ATGGTGGGGA TGAAGGCGCC TCCGGAGTAT GTAGAACAGC TAATAAAGGA GGCCGGCCTC
GATAAGCCTC TGCCAGTCCA GTACGTGGAG TATATGGTCC AAGTGTTTAC GGGTAACTTG
GGGAGGAGCC TGATCTTTGG CAGGAGGGAG GTCGCCGCCG AGATTATGGA CAGGCTCCCC
GCCACAGTGG AGCTGGCTGT TTCAGCCTTT GTGGTGAGCG TTCTGTTGGG CCACGTTTTC
GGCTTCCTGG CGGCGAGATA CGGCGGGAGG GTAGATGCAG GCGCGAGGCT ATATGCCATG
GTCTCCTATG TGTTGTTTAT TCCATTCATA GGGCTGGCAC TACAGCTCGT ATTCGCCGTG
TGGCTTGGCT GGTTCCCCGT AGCTGGGAGA ATTACCCCGG GTCTCGAGCC GCCGAGAATT
ACGGGCCTGT ACCTGCTAGA CTCCCTTCTG GCGGGGCGCC TAGACTCGTT TATAGACGCC
TTGAGCCATA TTGTACTACC CTCTGTCACG CTGGGTCTTG TCCTGTCTGG GGTATTTGTG
AGACTTATCA GGAACAACAT GGTTAAAACC CTCGGCGAGG ATTTCATATC TGCATATAGG
GCTATGGGCT TCAGCGAGAG GGCTGTGTTG TGGAAGGCAT ACCGCGTCGC CATAGTGCCT
ACCGTTACTA TGATGGGGTT GCAGCTGGCG TTGTTGTTGC AAGGCGCCGT GCTTACAGAG
ACCACCTTCT CGTGGCCCGG GCTGGGCACC TTGTTGTTAG AACGAATACA ATACCTCGAC
TACACTACTG TGCAAGGCGC CGTCGTCGTG TTTGTGATAA TTGTTGTGGT GCTGAATGTG
GCCGCGGATC TGATAAACGC GGTTCTAGAT CCCAGAGTTA GGAGGGGGCT ATGA
 
Protein sequence
MASLLRYAAY RVLLAIPTLI ILLTVVFFIL RVIPGNPIVA MVGMKAPPEY VEQLIKEAGL 
DKPLPVQYVE YMVQVFTGNL GRSLIFGRRE VAAEIMDRLP ATVELAVSAF VVSVLLGHVF
GFLAARYGGR VDAGARLYAM VSYVLFIPFI GLALQLVFAV WLGWFPVAGR ITPGLEPPRI
TGLYLLDSLL AGRLDSFIDA LSHIVLPSVT LGLVLSGVFV RLIRNNMVKT LGEDFISAYR
AMGFSERAVL WKAYRVAIVP TVTMMGLQLA LLLQGAVLTE TTFSWPGLGT LLLERIQYLD
YTTVQGAVVV FVIIVVVLNV AADLINAVLD PRVRRGL