Gene Pars_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1417 
Symbol 
ID5056322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1277233 
End bp1278666 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content58% 
IMG OID640468958 
ProductFerritin, Dps family protein 
Protein accessionYP_001153627 
Protein GI145591625 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1528] Ferritin-like protein
[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.672338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.285216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGGTG TACACACAGT GGCAGTTGTT TTAGAAATTG CACAGCTCTA CCCGGGCCCC 
CTCGCTGGGA GGTTGCTCAG GGAGTGGGGG TTCAGGGTGG TAAAGGTGGA GCCGCCCGGC
GGCGACCCCT TAAGGAGACT GAGCCCCACG CTTTACCAGT GGCTCAACGA GGGGAAGGAA
GTGGTATATC TCGACCTCCG CCTAGCTGAG GATCGAGGCA GAGTTCTGGA CTTGGCCAAG
GCGGCTAGGG CTGTGTTGAC GAGTTTTAGG AGGGGCACGG CGGAGCGGCT GGGGATCTCC
TATGAGGCGG TGAAAGAAGT CAACTCCGAC GTCTTCTACG TAGCCTTGGT GGGGTATAGG
GAGGGGGATC TTCCCGGCCA CGACATAAAC TTCGCTGGGT TGGCCGGCCT AATCGCTGAT
AAGCCCACGA TCCCGCAGTG CGTCGACGTG GCGAGCGGGC TCATGGCCGC CTTCGCCGTC
GCGGCGGCTG TGGCCTCGGG GCGCCGCGGC TATGTGGAGA TACCCATGGA GAACGTGGCG
TATATGCTCA ACCTGCTCAA CTTCGCCGCG TTGAGAGATC TTGGGGCTCT CCCCCTAGAC
GGTAGATACC CCTTCTACAA CGTCTATAAA TGCGCCAGCG GGTTGGTGGC GCTGGGGGCG
GTGGAGGAGA AGTTCTGGAG GAGGTTCTGC GATGTCATTG GCAGGGAGGA TCTAAAGGAG
CGGATGTACG ACCCCACGGC TGTGGATGAG GTGAGGAGAG AGGTGGAGCG GAGGGGTTGC
GGGGAGCTAA TCTCGGCGGC TGAAAGACTT GAAGTTCCGC TGTCTCCTGT CCGCGACATT
GTTGAGGCAT CTGGGCGTCT GCCTCCGCTT GGCGAGCTTT TTGGCGGGAG GACACAAGCG
GGGCAACGTA TAAAAGCCCA TTCCCCTTAT GAGATAGTGT CGAGGAGCGA TAAGGAACTT
GTCGAGGCTC TCAACAGGCA GTTGAACTAC GAGCTTCGAA ATGCCTACCT CTATCTCTCC
ATGGCGGCGT ATTTCGACGG GCTGAGCCTA GGAGGGTTTG CGCACTTCTT CAAAGTACAA
GCTAATGAAG AGCTTAAACA CGCCCTGAGG TTTTACAACC ACCTCGTGGA GAGGGGGTGG
AAAGTAGAGC TGTACGACAT CCCCAAGCCC AAGTCTGGCT GGGGTAGCGT GTTGGAAGCA
GTGGAGGATT TCTACAACGC AGAGGTCGAG AACACCAAGA GGATTTGGGA GCTGGTGGAT
TTGGCCAAGG CAAAGGGGGA CAAAGCCACG GAGTCTTTTC TCAAGTGGTT CGTTGACGAG
CAGGTAGAGG AGGAGAAGTT GGCGGCTGAG CTTTTGGCTA AGGTGAAGCT GGCAAAGGAC
TCGCCGGCGG CTCTCCTCAC GTTGGACAAC CTCTTAGCAC AGAGAAAAGA ATAG
 
Protein sequence
MRGVHTVAVV LEIAQLYPGP LAGRLLREWG FRVVKVEPPG GDPLRRLSPT LYQWLNEGKE 
VVYLDLRLAE DRGRVLDLAK AARAVLTSFR RGTAERLGIS YEAVKEVNSD VFYVALVGYR
EGDLPGHDIN FAGLAGLIAD KPTIPQCVDV ASGLMAAFAV AAAVASGRRG YVEIPMENVA
YMLNLLNFAA LRDLGALPLD GRYPFYNVYK CASGLVALGA VEEKFWRRFC DVIGREDLKE
RMYDPTAVDE VRREVERRGC GELISAAERL EVPLSPVRDI VEASGRLPPL GELFGGRTQA
GQRIKAHSPY EIVSRSDKEL VEALNRQLNY ELRNAYLYLS MAAYFDGLSL GGFAHFFKVQ
ANEELKHALR FYNHLVERGW KVELYDIPKP KSGWGSVLEA VEDFYNAEVE NTKRIWELVD
LAKAKGDKAT ESFLKWFVDE QVEEEKLAAE LLAKVKLAKD SPAALLTLDN LLAQRKE