Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1107 |
Symbol | |
ID | 5054964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 993203 |
End bp | 994708 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640468663 |
Product | extracellular solute-binding protein |
Protein accession | YP_001153337 |
Protein GI | 145591335 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.192889 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.929287 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGA AGACAAGAAA TGTGGTAATT GCGTTGGTGG CTGTTTTAAT TATCGCCGTG GGGGCTTTTC TGGTTTTGCA AAGCCCCTCG CAACAGCCGC AGAAGACGCA GACATCGTCT GCTCAGCCTT CTTCTACTTC GTCGGCTCAG CCTGGGCTGA GCGGGTCTCT GACTATTTTG GTGCCGACAG GAGACCCCAC GCTTATGCCC TACATACAGC TCGCCGCGGG GGAGTTTATG AAGAGGTATC CTGGGGTGAA GATAACTATA CAGCCTGTGC CTTTTGGCCA GATGGTGCAG ACGGCCTTGA CGGCTTTGCA GAATAAAAAC CCCGACCCTG CTCTTATTAT CTTCTACCCG TCGCAAGCGT CTACGCTTGG GCCTTATCTA ATGGATCTAC GGCCCTATCT CAGCTCTGGG GTTCTCAACA AGTCGGATAT CCCCACCAGC GCTATGTTGT CTGTCATGAT GGTTGCCAAG AACGGGACAG TTACGAAGAT CTTCGGCGTG CCTTTCCAGA TGGTGTTTGG GTACGTGCTG GTGTATAGGA AGTCTATTTT TAACAACCCA GCACTTCAGT CCGAGTTTAG GCAAAAATAC GGCTTCGACC TGGATCCCCT TACCTGGTCT TCGTGGGATC AGTTTGTCAG CGCCGCTGAG TTCCTCCAGT CGAAGCAAGT GGCTAAGTAC GCGTTGCTGT TCCCAGATGG CTTGCAACAG TCTATTTTCA ACGGTTTTAT TATGGTGTTC TACACCTATG CCCTTAATGA TCCGTGTGTA GGCATCCCGG CCGACGTGGC GAAGGGCGCC GTTCCCACCC AGGGCTATTG GGCCTACTTC CGCTACACGC CGGATGGCTC TGTGAACATC ACCGTGGGTT GTCCGTCTTT CTTACAAGCG CTTAGGGCGT ATAAAAAGCT CGTGCAGTTC CAGCCGCCTA TCACCGTCCA GGCTATGGAG TACGACCAGC TTCGGGACCT CTTCTTGACA GGTGACTACG CCATGGTGGC TGCCTGGACC AGCTTCATAC CTATCTACAA CAACGCCTCG GTTTCCAAGG TGGCGGGGGA CATCGCCATA TCGCCGCTTC CGGGGGGTAA ATACCCATTT GGCACTGGGC TTGCCCCCAC GTTTATCGGC GTTAACCCAT ATGCGAAGGA TCCCGACTTG GCGGTGCGGT TCGTGGCTTT CTTGATGTCG CCGGAGATGT ACAGACTCGG CGCCGAGAAG GTGGGGTTTG TGCCGGCTAC TTTGAGCGGG ATTAGGGCCG CCTCCCAAGT GCCTTCTATG AGCTGGCTCG CGCCGTTTGT GCCGCTGTTG CAGGCCGGCG CCGCTTTAAG CGATATTCAG CGGCTTACGT TGGTCAATAG GGTTACCAAC TTCTTTACTG ATATGCGGCC CTACTTCATC AACCAGGTGG CTAGTTATCT CAGAGGCGAG CAAGACGCCG AGACTACGCA GATGAACATA TACAAGACGT GGAAGAGCAT TATGAAAATT TCATGA
|
Protein sequence | MATKTRNVVI ALVAVLIIAV GAFLVLQSPS QQPQKTQTSS AQPSSTSSAQ PGLSGSLTIL VPTGDPTLMP YIQLAAGEFM KRYPGVKITI QPVPFGQMVQ TALTALQNKN PDPALIIFYP SQASTLGPYL MDLRPYLSSG VLNKSDIPTS AMLSVMMVAK NGTVTKIFGV PFQMVFGYVL VYRKSIFNNP ALQSEFRQKY GFDLDPLTWS SWDQFVSAAE FLQSKQVAKY ALLFPDGLQQ SIFNGFIMVF YTYALNDPCV GIPADVAKGA VPTQGYWAYF RYTPDGSVNI TVGCPSFLQA LRAYKKLVQF QPPITVQAME YDQLRDLFLT GDYAMVAAWT SFIPIYNNAS VSKVAGDIAI SPLPGGKYPF GTGLAPTFIG VNPYAKDPDL AVRFVAFLMS PEMYRLGAEK VGFVPATLSG IRAASQVPSM SWLAPFVPLL QAGAALSDIQ RLTLVNRVTN FFTDMRPYFI NQVASYLRGE QDAETTQMNI YKTWKSIMKI S
|
| |