Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1073 |
Symbol | |
ID | 5056035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 959241 |
End bp | 960395 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640468629 |
Product | hypothetical protein |
Protein accession | YP_001153303 |
Protein GI | 145591301 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.418174 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTTTG AGATACTTAG CCTGGCGTGG AGGGCGCTGT GGGAGCGGCG GGGAAGGACA GTTGGGGCCA TTATCGGCGT CGTTATTGCG TTCACAGCCC TGAGCTTTGC CTTGCTTCTG GGACATACTT TTAGGTACTA CGTCACCCAG TTCTTCACCT CGACCTTTCA GCTCAACGCC CTGTACGTCG TGGGCGTAGA GTTCACCGAC ACCGACGTCT TTGCCATATC TACCATCAGC GGCGTTGATC TCGTGGTGCC GCTGGCCGCG ACGCAGGCCA GTGTTGTACT GCCGGGTGCT CCCCAGCGCA CGGTAGTAAC GCTTTACGGC GTTCCCTCCT ACGTCTTGCA CAAGGTTTTC CCGCAGTCCG CCCTGAGAGA GGGTGCATCC GTATTGGGGC CCAACCTCGC CTTGGTTGGG TACTACATTG CCTACGATTC GTCAACCGGT CTACAGAGGC TCGGCGTGGG GTCGCCGGTG GTAGTTACGC TAGGCGATAA GTCTGCCTCT TTCGTCGTCA TGGGGGTGTT GGCTTTAGGC AATATGGGAG TTTTAGACAC GGCAACGGGG GTGTATGTAG ATATTGAGAC TTTCAGAGCT TTAACAGGCG CTAGGCACTA CGCGATGCTG GTAGTCTACG CCAAGGATAC CTCCCTAGTT AGGCAGATAG AGGGGGGAAT AAGGGCCCGC TACCCCAACG CGCAGATTTT CTCGCCGCAG ACAATAATAG AATCAGTAAA CACGTTTTTT GTGCAGTTTC AGCTATTTCT CGGATTAATA AGCGGCGTCA GCACCTTGAT CACAGCGCTT TGGCTGTACG ACACTATGAC CATAAGCACG ATGCAGAGGA CTAAAGAGAT CGGCGTGCTT AGGGCGGTGG GCTTCAAGAA GAGACAAGTT ACGGTTATGT TCTTGATGGA GGCACTTATA ATCGCCGTTA TAGGCGTAAC AGTGGGCGTC CCCATCCTCT TAGCTGTGGG CTACGTGGCA CAGCTAATTG CCACCGCTTT TCTGGGCCCT AGCGGACTTA TAATCGACCC CCTAGTGTTG GCCGGCGCGG CGGCGTTGGT GGTATTGGTA AACCTGACAG GAGCCCTCCT CCCAGCGTAC AGGGCAGGCC GCATTGAGAT CGTAAACGCC TTGAGATATG AATAA
|
Protein sequence | MFFEILSLAW RALWERRGRT VGAIIGVVIA FTALSFALLL GHTFRYYVTQ FFTSTFQLNA LYVVGVEFTD TDVFAISTIS GVDLVVPLAA TQASVVLPGA PQRTVVTLYG VPSYVLHKVF PQSALREGAS VLGPNLALVG YYIAYDSSTG LQRLGVGSPV VVTLGDKSAS FVVMGVLALG NMGVLDTATG VYVDIETFRA LTGARHYAML VVYAKDTSLV RQIEGGIRAR YPNAQIFSPQ TIIESVNTFF VQFQLFLGLI SGVSTLITAL WLYDTMTIST MQRTKEIGVL RAVGFKKRQV TVMFLMEALI IAVIGVTVGV PILLAVGYVA QLIATAFLGP SGLIIDPLVL AGAAALVVLV NLTGALLPAY RAGRIEIVNA LRYE
|
| |