Gene Pars_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1251 
Symbol 
ID5055489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1130462 
End bp1131730 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content60% 
IMG OID640468794 
Productmajor facilitator transporter 
Protein accessionYP_001153467 
Protein GI145591465 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0968424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.933487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATTA GGACGGTTGT CGCGGCCTCG ACTATCGGGA CCCTTATTGA GTGGTATGAT 
TTTTTTGCGT ACTCATCTCT CTCCCCCTTC ATAGCCGAGT ACTTTTTCCC CAAGAGTGAC
CCGGCGGTTG CCATAATTTT GACGTGGCTT GTATTCGCCA CCGCCTTTGT GGTGAGGCCA
GTCGGCGCCG TCTTGTTCGG CCATCTGGGA GATAGGATAG GGCGTAAGTC CACCTTCCTC
ATAACGCTGA TAGTCATGGG CCTAGCCACC TTTTTCATGG GCCTAATCCC CACGTACGCC
CAGGCAGGCA TCGTTGCTCC TCTGCTACTG ACATTGCTGA GGATAGTACA GGGCATCGCG
CTGGGGGGCG AGTACGGCGG CGCCATCACA TACGTCTTGG AGCACGCCCC GGCGGGTAGG
AGGGCTTTTT ACAACGGGTT CGTAGCCGCC ACTCCGCCCC TCGGGCTGGG CCTCTCATCC
ATCACCGTGG TGTTGTCCTC GTTGCTCTTG ACAAAGGAGC AATTCGCTAC CTGGGGCTGG
AGGATGCCGT TCCTCGTCTC CATTATCCTC ACGGCGCTTG GGGTATACCT GCGCTTCAAG
CTTGCAGAGT CCCCCGTTTT TGAGGACATC AAAAAGAGGG GCGAGGTAGC CAGAGTACCC
ATCGCCGAGG TGCTGGGGAG GCACCTACCG TGGGTGCTGG TGGGGGTGGC GGTGGCCGCT
GGCCACGCGG TGTTGGCCTA CACGTCGACT GGCTACATAT TTACCTACTT GGTGCAGACA
GCTAAGCGGA CGCCTGTGGA GGCCAACATT ATAGTGGGCG CCGCGGCGCT GGCGCAAATA
CCCTTGTACC TATTAGCCGC GTGGCTTGGC GATAGGGTTG GGAGGAAGGC CGTCTACATG
ACGGGGCTGG CCATCGGCTT GGCAACCTAC TACCCCCTCT ACTACCTCCT GCCCTCCCTT
GACCTTTGGC TCGCCGCATT GGCCGTCTAC GTCATGGTTG GGGCCACCGC CTTCACATTC
GGCATCTTGG GCACGGCACT TGCGGAGCTC TTCCCCGCCA GGGTTAGGTA CAGCGGGATG
TCGCTGGCCT TCAACCTCGG CGTGGGGCTG TTCGGCGGCT TCACCCCCAC TATCGTCCAG
CTGATAGGCA CCCTCCTCAA AAACCCGCTT GCCGGGTTGT TGCTGTACAC ATACGTCGTG
GCCGCCGCGG CTCTGATAAT CGCGGCGCTC ATCCTGCCCG AGACTAAGTC AAAAGACGTC
GCCGCTTAG
 
Protein sequence
MSIRTVVAAS TIGTLIEWYD FFAYSSLSPF IAEYFFPKSD PAVAIILTWL VFATAFVVRP 
VGAVLFGHLG DRIGRKSTFL ITLIVMGLAT FFMGLIPTYA QAGIVAPLLL TLLRIVQGIA
LGGEYGGAIT YVLEHAPAGR RAFYNGFVAA TPPLGLGLSS ITVVLSSLLL TKEQFATWGW
RMPFLVSIIL TALGVYLRFK LAESPVFEDI KKRGEVARVP IAEVLGRHLP WVLVGVAVAA
GHAVLAYTST GYIFTYLVQT AKRTPVEANI IVGAAALAQI PLYLLAAWLG DRVGRKAVYM
TGLAIGLATY YPLYYLLPSL DLWLAALAVY VMVGATAFTF GILGTALAEL FPARVRYSGM
SLAFNLGVGL FGGFTPTIVQ LIGTLLKNPL AGLLLYTYVV AAAALIIAAL ILPETKSKDV
AA