Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0963 |
Symbol | |
ID | 5055444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 854353 |
End bp | 855414 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640468519 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001153195 |
Protein GI | 145591193 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0000206381 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTGCTAG ATAGATTAGC CGATTTTCTC ATCTGGCTCA TCGTTAAGGC GATCTCTCTA TTTAGGAAAG ACTGGTATGT TAAAAACAGG TCTAGGGTGG AGGAGTGGCG CCTCACGCTC TACGCTCTTA ATAGGTCGCC AACTGGGATA ATAGGGCTTG TCTTATCGCT TGGGTTTGTA ATTGTGGGGG TGGTGGGGCC GTTTCTAGCG CCGTACAGCT ACAACCAATT TCTGTACCTT GAGAGACCTG AGCTGTACCT CGCGCCTCCC GGCGCCTACG GCATGCCTCT CGGCACCGAC ATATATGGGC GCGACGTGCT GAGCCTCATG TTATATGGCG CCCGGGTGTC GCTTGTGATA TCGGTTATTA CCATCGCGCT GGGTGTCCCG CTGGGAATAC TGCTTGGCCT AGTGGCGGGA TACTACGGAG GGAAGGTGGA CGAAGCTGTG ATGAGAATTA CAGATATCTT CTTGGCATTC CCCGCGCTGG TGCTGGCCCT CGCTCTTGCT GCAACTCTGC CAGGAAGGAT AAGGGAGTTT CTAATAAGCG AGCCAACCTT CGCGTCGTTC ATGGCGGCGG TGTTCGGCGT AAGCCAAGAG GACTCTATTC ACCTTGCGCC GCTGATATCG ATCTTCTTAG CGCTGATCAT TGTATGGTGG CCCACCTATG CAAGAGTGGT GAGAGGAATG GTCTTGGTAG AGAGGGAGAA GACGTACGTG GAGGCGGCTA AGGCTTTGGG GTACTCCTCG TGGAGGATAA TGACCCGGCA CATACTCCCC AACGTAATGT CCCCAATAGT AGTGCTGGTT ACTTTCGACT TCGCTACCGT GAACCTGCTG GCCGCGGGAC TGAGCTTCTT GGGCCTCGGC GCCCAGCCGC CTATTGTTGA CTGGGGCTCT CTTATAAACA TGGGAGGAAG CCGCTTCCCC ACGGCGTGGT GGCTGGTGTT CTTCCCCGGC GTTGCAATAT TTCTCACAGC CCTAGGCTGG AATCTGCTGG GCGACGCCTT GAGAGATGTC TTCGACCCCA AGTTCAGGAG GAGGATCGAG TTTAGGGTAT GA
|
Protein sequence | MVLDRLADFL IWLIVKAISL FRKDWYVKNR SRVEEWRLTL YALNRSPTGI IGLVLSLGFV IVGVVGPFLA PYSYNQFLYL ERPELYLAPP GAYGMPLGTD IYGRDVLSLM LYGARVSLVI SVITIALGVP LGILLGLVAG YYGGKVDEAV MRITDIFLAF PALVLALALA ATLPGRIREF LISEPTFASF MAAVFGVSQE DSIHLAPLIS IFLALIIVWW PTYARVVRGM VLVEREKTYV EAAKALGYSS WRIMTRHILP NVMSPIVVLV TFDFATVNLL AAGLSFLGLG AQPPIVDWGS LINMGGSRFP TAWWLVFFPG VAIFLTALGW NLLGDALRDV FDPKFRRRIE FRV
|
| |