Gene Pars_0608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0608 
Symbol 
ID5054829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp543293 
End bp544627 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content57% 
IMG OID640468166 
Productamino acid permease-associated region 
Protein accessionYP_001152851 
Protein GI145590849 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.26378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGTCC ACCGCGACTG GTTGCGCCGG AAGCTGGGGG TGTTCGAGAT GTTTGCCCTA 
ATTTACTCAG ACATCCAATC AACCTTCTAC TTCGTTCTTG GGTTTTTACT TCTCTCCGGG
GGTCAGTACG GGTTCTTTGC CGTAGCGTAC AGCATAGCGC TTATGGCGGC CATAGCGTTG
TCATACGGCG AAATGGGGTC AAGGTTTCCC GAAGCAGGAG GATCTTATCT CTACGTCAAG
TACTCATTCG GAAAGACAAT TGCCTATCTC TCAGTCTGGT TGCTGGCCTT TGACCAAATC
ATTATGATAA GCTACGGGAC AATAGACGCG GCCAAGGTGT TGCACAAGAT CTGGGGCGTG
GGGACAAGCG AGGCCGTAGT CGCGGCAGCC ATTTCGACAG CTCTCTTCGC TCTCTCCCTC
ATTGGCATTA GGGAGTCGGC TAATTTCGCC AAGGCGGTGG CGGTTTTCGA CGTGGTGATT
ATGGCCACTG TCATAGTCGT GGCACTGGCC ACCTACCCCG CGGCGCCGCC AAGCTTCAAC
TGGGACGGCG TCGTGGCGGC CAACCTCTTC TTCGCGTTTT CCCTCGCGTC GCGGGGCTAC
ACCGGCGTGG ATACCATAGG CCAGCTCGCC GGGGAGGCCA GGGAGCCCCT CGTACAGGTG
CCGAAGGCAA CTCTATTGGT AATCGGCATG GGAGTCTTCA TAGGTCTCGG CCTAACCGCC
GCGGCCATGA GCGCCTTGGC GCCGGGGGAT CTGCAAGACC CAGCCCTAGC GCCCGTGTAC
CTCGCACAGA AGGTCAACCA CGCGCTTCAG TACCTTGTCT CGGCGAGCAT AGTACTGGTA
ATGACCGTGG CAGCCCTGGC GGGCTACACC TCGTTTACGA GACTGACATA CATACTCTCC
GACGAGGGGC TCATGCCCTC CTTTTTCAAG AAGCTCCACA AGAAGTTTAG AACACCCCAC
TCTTCCCTTG CCTTGGCGTA TTTGGTGTCG CTACTGTTTA TTGCGCCGGG GGAGGTGGAC
CTCATATTGT CCATCTACGC CGTGGGTTCC CTCACCAACT ACATGCTCGT CGCGCTGGCC
TTAGCGAAGG CTGCCAGAAG CGGGACGCTA TACGGCGCAT TTAAAACACC ACTGATCAGG
GGAATCCCCC TATCAGCTCT ACTAGCCATA GTCATGGTGA CTGTTGGGCT AGCCTTGACA
ATACTCGAAA AATACCCATA CCTCTGGATA ATAGGCGTAT GGGCGGCGGC GGGGGCGCTT
TTCTACGCCG CCGTTGCTCA CAGAAGCGGG AAGACGGGCA ACCGCGGCAC ATCTACCACT
ACGAATAGCC GCTAA
 
Protein sequence
MAVHRDWLRR KLGVFEMFAL IYSDIQSTFY FVLGFLLLSG GQYGFFAVAY SIALMAAIAL 
SYGEMGSRFP EAGGSYLYVK YSFGKTIAYL SVWLLAFDQI IMISYGTIDA AKVLHKIWGV
GTSEAVVAAA ISTALFALSL IGIRESANFA KAVAVFDVVI MATVIVVALA TYPAAPPSFN
WDGVVAANLF FAFSLASRGY TGVDTIGQLA GEAREPLVQV PKATLLVIGM GVFIGLGLTA
AAMSALAPGD LQDPALAPVY LAQKVNHALQ YLVSASIVLV MTVAALAGYT SFTRLTYILS
DEGLMPSFFK KLHKKFRTPH SSLALAYLVS LLFIAPGEVD LILSIYAVGS LTNYMLVALA
LAKAARSGTL YGAFKTPLIR GIPLSALLAI VMVTVGLALT ILEKYPYLWI IGVWAAAGAL
FYAAVAHRSG KTGNRGTSTT TNSR