Gene Pars_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1107 
Symbol 
ID5054964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp993203 
End bp994708 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content54% 
IMG OID640468663 
Productextracellular solute-binding protein 
Protein accessionYP_001153337 
Protein GI145591335 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.192889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.929287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGA AGACAAGAAA TGTGGTAATT GCGTTGGTGG CTGTTTTAAT TATCGCCGTG 
GGGGCTTTTC TGGTTTTGCA AAGCCCCTCG CAACAGCCGC AGAAGACGCA GACATCGTCT
GCTCAGCCTT CTTCTACTTC GTCGGCTCAG CCTGGGCTGA GCGGGTCTCT GACTATTTTG
GTGCCGACAG GAGACCCCAC GCTTATGCCC TACATACAGC TCGCCGCGGG GGAGTTTATG
AAGAGGTATC CTGGGGTGAA GATAACTATA CAGCCTGTGC CTTTTGGCCA GATGGTGCAG
ACGGCCTTGA CGGCTTTGCA GAATAAAAAC CCCGACCCTG CTCTTATTAT CTTCTACCCG
TCGCAAGCGT CTACGCTTGG GCCTTATCTA ATGGATCTAC GGCCCTATCT CAGCTCTGGG
GTTCTCAACA AGTCGGATAT CCCCACCAGC GCTATGTTGT CTGTCATGAT GGTTGCCAAG
AACGGGACAG TTACGAAGAT CTTCGGCGTG CCTTTCCAGA TGGTGTTTGG GTACGTGCTG
GTGTATAGGA AGTCTATTTT TAACAACCCA GCACTTCAGT CCGAGTTTAG GCAAAAATAC
GGCTTCGACC TGGATCCCCT TACCTGGTCT TCGTGGGATC AGTTTGTCAG CGCCGCTGAG
TTCCTCCAGT CGAAGCAAGT GGCTAAGTAC GCGTTGCTGT TCCCAGATGG CTTGCAACAG
TCTATTTTCA ACGGTTTTAT TATGGTGTTC TACACCTATG CCCTTAATGA TCCGTGTGTA
GGCATCCCGG CCGACGTGGC GAAGGGCGCC GTTCCCACCC AGGGCTATTG GGCCTACTTC
CGCTACACGC CGGATGGCTC TGTGAACATC ACCGTGGGTT GTCCGTCTTT CTTACAAGCG
CTTAGGGCGT ATAAAAAGCT CGTGCAGTTC CAGCCGCCTA TCACCGTCCA GGCTATGGAG
TACGACCAGC TTCGGGACCT CTTCTTGACA GGTGACTACG CCATGGTGGC TGCCTGGACC
AGCTTCATAC CTATCTACAA CAACGCCTCG GTTTCCAAGG TGGCGGGGGA CATCGCCATA
TCGCCGCTTC CGGGGGGTAA ATACCCATTT GGCACTGGGC TTGCCCCCAC GTTTATCGGC
GTTAACCCAT ATGCGAAGGA TCCCGACTTG GCGGTGCGGT TCGTGGCTTT CTTGATGTCG
CCGGAGATGT ACAGACTCGG CGCCGAGAAG GTGGGGTTTG TGCCGGCTAC TTTGAGCGGG
ATTAGGGCCG CCTCCCAAGT GCCTTCTATG AGCTGGCTCG CGCCGTTTGT GCCGCTGTTG
CAGGCCGGCG CCGCTTTAAG CGATATTCAG CGGCTTACGT TGGTCAATAG GGTTACCAAC
TTCTTTACTG ATATGCGGCC CTACTTCATC AACCAGGTGG CTAGTTATCT CAGAGGCGAG
CAAGACGCCG AGACTACGCA GATGAACATA TACAAGACGT GGAAGAGCAT TATGAAAATT
TCATGA
 
Protein sequence
MATKTRNVVI ALVAVLIIAV GAFLVLQSPS QQPQKTQTSS AQPSSTSSAQ PGLSGSLTIL 
VPTGDPTLMP YIQLAAGEFM KRYPGVKITI QPVPFGQMVQ TALTALQNKN PDPALIIFYP
SQASTLGPYL MDLRPYLSSG VLNKSDIPTS AMLSVMMVAK NGTVTKIFGV PFQMVFGYVL
VYRKSIFNNP ALQSEFRQKY GFDLDPLTWS SWDQFVSAAE FLQSKQVAKY ALLFPDGLQQ
SIFNGFIMVF YTYALNDPCV GIPADVAKGA VPTQGYWAYF RYTPDGSVNI TVGCPSFLQA
LRAYKKLVQF QPPITVQAME YDQLRDLFLT GDYAMVAAWT SFIPIYNNAS VSKVAGDIAI
SPLPGGKYPF GTGLAPTFIG VNPYAKDPDL AVRFVAFLMS PEMYRLGAEK VGFVPATLSG
IRAASQVPSM SWLAPFVPLL QAGAALSDIQ RLTLVNRVTN FFTDMRPYFI NQVASYLRGE
QDAETTQMNI YKTWKSIMKI S