Gene Pars_2206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2206 
Symbol 
ID5055316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1977488 
End bp1978564 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content59% 
IMG OID640469758 
ProducttRNA-modifying enzyme 
Protein accessionYP_001154404 
Protein GI145592402 
COG category[C] Energy production and conversion 
COG ID[COG0731] Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.543832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTTGCG AGTACGAGGC TCTTGGCAGA TACCATTTGT TTGAGGGGCC CCGGATTAAG 
GTTAGGGCCT CCGCTGGCCG CGCCCTAATA GAGCGGCACT ACGGCGTTGC GGGACACGCC
ACAGTAGAGC TGTGCAAGTG GACTAAAGAC GCGTTGGAGG GGGGGAAGTC GTGCTACAAG
GTGAAGTTCT ACAACGCCCC CGCCGGCGGG TCGCACCGGT GCGTCGAGAT GAGCCCGGTG
GGCCTCGTCT GTAGCAACCG CTGCGTCTAC TGCTGGCGCC CCACCGAGGA GTTCGACGTC
TTCCTACTCG ACGAGAGGTT CTACATGGAG CCCGAGGACA TCGTCAAGGG GGTCCTCGAA
GAGAGGAGGA GGCTCCTATC CGGCTACTGG GGCCACCCGC AGGGTAAGCG GCGGGTGAGG
GAAGCCCTGG AGCCGACCCA CTGGGCTATC TCCCTGTCGG GGGAGCCCAC CATGTACCCC
AAGCTTCCCC AGCTTATAAA GCTGATAAAG TCACTCCCCA GCACGAAGTC CGTCTTCCTC
GTCACCAACG GCCAGCACCC CGACATGTTG AGGAGGCTGT GGGAGGAGGA CGCCCTCCCC
ACCCAGCTCT ACCTCTCCAC CAACGCGCCC AACAAGGAGC TCTACTACAA GATAAACGTC
CCCGTATACA ACGTCGAGAA CGCTTGGGAG AAGTGGCTGG AGTCCCTCGA CCTGTTAGCA
AAAATCCCGA CAAGGACAGT CCTCCGCATC ACCCTGATCA GAAGCCTAAA CTACGACGAC
AGGTACATAC CGGAGTTCGC CCAAATTGTC AAGAGGGGGA GCCCCCACTT CGTCGAGGTG
AAGAGCTACA TGCACCTCGG CCACTCCACC TTCCGCCTAA AGAAGGAAGA CATGCTAAGC
CACGAAGAGG TAAAGGAGTG GTCTCACAAG CTGTTAAAAG AGCTGGAGAA GATAGGCGCC
CGCTTCGTCT ACATGGACGA CGACGAGCCC AGCCGCATAG TGGTTCTCCA GAATATGGAC
AGGTATGTGG AGAGGTGGAT AGTGCCTCCA CAGGTGAAAA TCGAAACACA GAGTTAA
 
Protein sequence
MSCEYEALGR YHLFEGPRIK VRASAGRALI ERHYGVAGHA TVELCKWTKD ALEGGKSCYK 
VKFYNAPAGG SHRCVEMSPV GLVCSNRCVY CWRPTEEFDV FLLDERFYME PEDIVKGVLE
ERRRLLSGYW GHPQGKRRVR EALEPTHWAI SLSGEPTMYP KLPQLIKLIK SLPSTKSVFL
VTNGQHPDML RRLWEEDALP TQLYLSTNAP NKELYYKINV PVYNVENAWE KWLESLDLLA
KIPTRTVLRI TLIRSLNYDD RYIPEFAQIV KRGSPHFVEV KSYMHLGHST FRLKKEDMLS
HEEVKEWSHK LLKELEKIGA RFVYMDDDEP SRIVVLQNMD RYVERWIVPP QVKIETQS