Gene Pars_0076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0076 
Symbol 
ID5054375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp71029 
End bp72783 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content58% 
IMG OID640467654 
ProductATP-dependent DNA ligase 
Protein accessionYP_001152343 
Protein GI145590341 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR00574] DNA ligase I, ATP-dependent (dnl1) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.101904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGTTTG GCGAGTTGGT AAAGGCGCTG GCCGCCGTCG AGTCCACGAC GCAGAGGAGC 
GTAATGGTGA AGCTTTTGGC CTCGCTCTTC AAGAAGGCGT TGCCTGAGGA GATAGACAAG
GTTATCTACT TGATTTTGGG GGATCTGAGG CCTCCTTGGG AGGGCCTTGA GCTAGGGGTG
GGGGAGAAGC TGTGTCTCAG AGCTTTGGCA AAGGCCACTG GGACATCCCA GGCGGAGCTG
GAGTCTATGT ATAAGAAGAC GGGCGACATA GGCGAGGCGG CGCGGCGGGC TCTTGCATCG
TCAAAGCGAC CGGGGCTTCT GGCTTTCGGC TCGCAGAAGC CGCTGGAGGT GTCCGAGGTC
TACGATGCCC TTATCAAGGT GGCGAGGGCG ACCGGCGAGG GGGCGCAAGA CATGAAAATC
GCCCTCCTCT CCTCGCTTTT TGCCAGGTCC TCTCCTGAGG AGGGGAAGTA CATCGCCCGT
TTTGTTGTTG GGAAGCTTAG GCTCGGCGTT GCCGACATGA CGATAATCGA GGCGCTGGCG
GATGCCTACG GCGTGAGGAA GGAGGATTTA GAGAGGGCCT ACCACGTCTA CCCCGACCTC
GGCCACCTCG CCAAGCTCGT CGCCTCCGGG AAGCCCCTGG ACGAGGTGAA GGTGACGCCT
GGCGTCCCCG TCCTCCCCAT GTTGGCGCAG AGGCTCTCCT CGTCGTCTGA GATCTTGGCT
AAACTGGGAG GCGCAGCGAT ATGTGAGTAT AAATACGACG GGGAGAGGGC GCAGATACAC
ATCTCGCAGA GCGGCGTGAG GATTTTCAGC CGGAGGCTTG AGGACATCAC CCACGCCTAC
CCGGACGTGG TCAAGGCCGT GAGGGAGGCC GTGTCGGCGA GAGAGGCCAT ACTGGAGGGG
GAGATCGTTG CGGTGGATCC AGACACCGGC GAAATGTTGC CGTTTCAAGA GCTTATGCAC
AGGAAGAGGA AGCACGAGGT GGCGGCGGCG ATGGAGATGT ACCCCACAGT CCTCTACCTC
TTCGACATCG TGTACGTCGA CGGGGAGGAT TTGACCAACG AGCCTTTGAT ATACCGCCGC
GTGAAGCTTT CTGAAATAGT CCATGAAAGC GACAAGGTGC AAATAGCGAA GTGGCAGATG
TTCGACGACC CGGATACTGT TGATGTGTTT TTCCACGAGG CAGTCTCAGT GGGCACAGAG
GGCCTGATCT GTAAGTCGCC GACGTCGGAG TACGAAATGG GAGCGAGGGG GTGGAACTGG
ATTAAGTACA AGCGCGACTA CAAGAGCGAG ATGATAGATA CGGTGGATCT AGTGGTGGTG
GGGGCCTTCT ACGGCCGTGG AAAAAGAGCC GGCCTCTACG GCGCCTTTCT CCTAGCCGCC
TACGACCCCT CCAGTGATAT GTTCTACACC GTGTGCAAAG TGGGCAGCGG CTTTACAGAC
GCAGATTTGA AGAAAATGTA TGAGATCCTC CAGCCTCTAA AAATACCTCA CAGACACCCC
CGGGTATCAT CAAAGATGGA GGCCGACGTG TGGTTTGTGC CGCAGGTGGT AATAGAGGTA
ATTGGCGCAG AGATCACTCT CTCTCCGCTA CACACTTGTT GTCTTGGCGC CGTTAGGCCT
GGGGTTGGGC TAGCGGTGAG GTTCCCCCGC TTCACTGGCC GCTATAGGAC AGACAAGGGG
CCGGAGCAAG CCACCACAGT CGCCGAGATG CTAGAGCTCT ACAAAAGGCA GAAAAAGGTG
GCTCAGCCTG AGTAG
 
Protein sequence
MQFGELVKAL AAVESTTQRS VMVKLLASLF KKALPEEIDK VIYLILGDLR PPWEGLELGV 
GEKLCLRALA KATGTSQAEL ESMYKKTGDI GEAARRALAS SKRPGLLAFG SQKPLEVSEV
YDALIKVARA TGEGAQDMKI ALLSSLFARS SPEEGKYIAR FVVGKLRLGV ADMTIIEALA
DAYGVRKEDL ERAYHVYPDL GHLAKLVASG KPLDEVKVTP GVPVLPMLAQ RLSSSSEILA
KLGGAAICEY KYDGERAQIH ISQSGVRIFS RRLEDITHAY PDVVKAVREA VSAREAILEG
EIVAVDPDTG EMLPFQELMH RKRKHEVAAA MEMYPTVLYL FDIVYVDGED LTNEPLIYRR
VKLSEIVHES DKVQIAKWQM FDDPDTVDVF FHEAVSVGTE GLICKSPTSE YEMGARGWNW
IKYKRDYKSE MIDTVDLVVV GAFYGRGKRA GLYGAFLLAA YDPSSDMFYT VCKVGSGFTD
ADLKKMYEIL QPLKIPHRHP RVSSKMEADV WFVPQVVIEV IGAEITLSPL HTCCLGAVRP
GVGLAVRFPR FTGRYRTDKG PEQATTVAEM LELYKRQKKV AQPE