Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1701 |
Symbol | |
ID | 5054484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1535298 |
End bp | 1536605 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640469244 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_001153904 |
Protein GI | 145591902 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.58174 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATA CATTGTATAT TGTAGGGGCT GTAGTAGTGC TCATTGCGAT TTTGGGCATA CTCCTCACCA TGGGCGGCGG CCAACAACAG ACAACAACCG CGCCGCCCTC CAGCACGCAG ACGACCCAAC CCACCACCAC AACTAAGGTC TTTAAGCTGG GCGCCCTTCT GCCGCTGACA GGCGGCTTTA GTAGCTACGC CAAGTTGGCC CAATGCGCCG CGGAACTCGC GGTAGATGAA CTCAACTCCG AATATGCGAG CAAGGGATAC AAATTCGAGC TGTACGTCGA AGACACCCAG CTTGACCCCA ACGTGGCGGC CCAGAAGCTA CAAAGCCTCT ACGCCAGAGG CGTGAGGGCG GTCCACGCAG GCCTCACCAG CAGAGAGGCT TCCGGAGAGA AGCCATTCGC CGACCAGAAC CACATAATCC TCTTCAGCGC GTGGTCTACG TCGTCGCTTC TAGCCTTGCC TAACGACTGG CTGTACAGAA TCGTCGGCAC CGACGCCAAA CAGATCAGAG CCATAGGCGC CGTCCTGAAG GAGCTCGGAG TAAAGAAGGT AGCTCTTGTA TACAGGAAAG ACGCCTACGG CGAGGGCCTA TACCTGGAGC TCCAGAAAGA GGCAGAGAAG CGCGGCTTCA AGCTCGTCTC CGTCGCCGCC TACGACCCCG ACCCAAAGGC CTTTCCACAA ACAGCTCCCG AGGCTGTGAA AAAAATATCC TCAGAGGTTA AAGACTTAGT AGGCCCCGAC TTCGCCCTGG TCATCGTATC CTTTGAAGAC GACGGCTCAG TGGTGCTAAA CGCAATAGGA CAAGACCCCG TACTCTCTAA GGCCAGGCTT ATAGGCACTG AGGGCATGGC GTATTCGCCC ATACTGCTAC AAGAAGGCGG CAACGTCATG GCAAATGGGA AGATCATAGG AACCGCCAAC TGGGCTCTGG CCACCACACC GGAGTATCAG CAGTTCGCCC AGAAGTTCCG CGCTAAGTGC GGCGCCGAGC CCATAACCCC CGCCCCCCAG TCCTACGACA TTATAAAAAT GCTTGGCGAA ATCATGGCCA CGATAGGGAC TGACGACCCC GACAAGGTAC GCGCCACGCT TGAGCAGTGG GGCAAAGAAG GCCGGTACAA GGGAGCAACC GGCACGGTGC TCCTCGACGA AAACGGCGAC AGGGCCAACC CCAGCTTCAT CCTCTGGGGC GTCACAGTAA AGAACGGCAA GCCGCAGTAC ATCGACATCG GCTTCTACAA CTACGACAAA GACACCATCG AGTTCACCGA GGAGGGCAAA CAGTACTTCT ACGGGTAA
|
Protein sequence | MKNTLYIVGA VVVLIAILGI LLTMGGGQQQ TTTAPPSSTQ TTQPTTTTKV FKLGALLPLT GGFSSYAKLA QCAAELAVDE LNSEYASKGY KFELYVEDTQ LDPNVAAQKL QSLYARGVRA VHAGLTSREA SGEKPFADQN HIILFSAWST SSLLALPNDW LYRIVGTDAK QIRAIGAVLK ELGVKKVALV YRKDAYGEGL YLELQKEAEK RGFKLVSVAA YDPDPKAFPQ TAPEAVKKIS SEVKDLVGPD FALVIVSFED DGSVVLNAIG QDPVLSKARL IGTEGMAYSP ILLQEGGNVM ANGKIIGTAN WALATTPEYQ QFAQKFRAKC GAEPITPAPQ SYDIIKMLGE IMATIGTDDP DKVRATLEQW GKEGRYKGAT GTVLLDENGD RANPSFILWG VTVKNGKPQY IDIGFYNYDK DTIEFTEEGK QYFYG
|
| |