Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1417 |
Symbol | |
ID | 5056322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1277233 |
End bp | 1278666 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640468958 |
Product | Ferritin, Dps family protein |
Protein accession | YP_001153627 |
Protein GI | 145591625 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG1528] Ferritin-like protein [COG1804] Predicted acyl-CoA transferases/carnitine dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.672338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.285216 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGGTG TACACACAGT GGCAGTTGTT TTAGAAATTG CACAGCTCTA CCCGGGCCCC CTCGCTGGGA GGTTGCTCAG GGAGTGGGGG TTCAGGGTGG TAAAGGTGGA GCCGCCCGGC GGCGACCCCT TAAGGAGACT GAGCCCCACG CTTTACCAGT GGCTCAACGA GGGGAAGGAA GTGGTATATC TCGACCTCCG CCTAGCTGAG GATCGAGGCA GAGTTCTGGA CTTGGCCAAG GCGGCTAGGG CTGTGTTGAC GAGTTTTAGG AGGGGCACGG CGGAGCGGCT GGGGATCTCC TATGAGGCGG TGAAAGAAGT CAACTCCGAC GTCTTCTACG TAGCCTTGGT GGGGTATAGG GAGGGGGATC TTCCCGGCCA CGACATAAAC TTCGCTGGGT TGGCCGGCCT AATCGCTGAT AAGCCCACGA TCCCGCAGTG CGTCGACGTG GCGAGCGGGC TCATGGCCGC CTTCGCCGTC GCGGCGGCTG TGGCCTCGGG GCGCCGCGGC TATGTGGAGA TACCCATGGA GAACGTGGCG TATATGCTCA ACCTGCTCAA CTTCGCCGCG TTGAGAGATC TTGGGGCTCT CCCCCTAGAC GGTAGATACC CCTTCTACAA CGTCTATAAA TGCGCCAGCG GGTTGGTGGC GCTGGGGGCG GTGGAGGAGA AGTTCTGGAG GAGGTTCTGC GATGTCATTG GCAGGGAGGA TCTAAAGGAG CGGATGTACG ACCCCACGGC TGTGGATGAG GTGAGGAGAG AGGTGGAGCG GAGGGGTTGC GGGGAGCTAA TCTCGGCGGC TGAAAGACTT GAAGTTCCGC TGTCTCCTGT CCGCGACATT GTTGAGGCAT CTGGGCGTCT GCCTCCGCTT GGCGAGCTTT TTGGCGGGAG GACACAAGCG GGGCAACGTA TAAAAGCCCA TTCCCCTTAT GAGATAGTGT CGAGGAGCGA TAAGGAACTT GTCGAGGCTC TCAACAGGCA GTTGAACTAC GAGCTTCGAA ATGCCTACCT CTATCTCTCC ATGGCGGCGT ATTTCGACGG GCTGAGCCTA GGAGGGTTTG CGCACTTCTT CAAAGTACAA GCTAATGAAG AGCTTAAACA CGCCCTGAGG TTTTACAACC ACCTCGTGGA GAGGGGGTGG AAAGTAGAGC TGTACGACAT CCCCAAGCCC AAGTCTGGCT GGGGTAGCGT GTTGGAAGCA GTGGAGGATT TCTACAACGC AGAGGTCGAG AACACCAAGA GGATTTGGGA GCTGGTGGAT TTGGCCAAGG CAAAGGGGGA CAAAGCCACG GAGTCTTTTC TCAAGTGGTT CGTTGACGAG CAGGTAGAGG AGGAGAAGTT GGCGGCTGAG CTTTTGGCTA AGGTGAAGCT GGCAAAGGAC TCGCCGGCGG CTCTCCTCAC GTTGGACAAC CTCTTAGCAC AGAGAAAAGA ATAG
|
Protein sequence | MRGVHTVAVV LEIAQLYPGP LAGRLLREWG FRVVKVEPPG GDPLRRLSPT LYQWLNEGKE VVYLDLRLAE DRGRVLDLAK AARAVLTSFR RGTAERLGIS YEAVKEVNSD VFYVALVGYR EGDLPGHDIN FAGLAGLIAD KPTIPQCVDV ASGLMAAFAV AAAVASGRRG YVEIPMENVA YMLNLLNFAA LRDLGALPLD GRYPFYNVYK CASGLVALGA VEEKFWRRFC DVIGREDLKE RMYDPTAVDE VRREVERRGC GELISAAERL EVPLSPVRDI VEASGRLPPL GELFGGRTQA GQRIKAHSPY EIVSRSDKEL VEALNRQLNY ELRNAYLYLS MAAYFDGLSL GGFAHFFKVQ ANEELKHALR FYNHLVERGW KVELYDIPKP KSGWGSVLEA VEDFYNAEVE NTKRIWELVD LAKAKGDKAT ESFLKWFVDE QVEEEKLAAE LLAKVKLAKD SPAALLTLDN LLAQRKE
|
| |