Gene Pars_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1220 
Symbol 
ID5055027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1105255 
End bp1106598 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content60% 
IMG OID640468767 
Productcobyrinic acid a,c-diamide synthase 
Protein accessionYP_001153440 
Protein GI145591438 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1797] Cobyrinic acid a,c-diamide synthase 
TIGRFAM ID[TIGR00379] cobyrinic acid a,c-diamide synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.446908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCCCC GTATAGTCAT ATCGGCCTTC AAAGGTATGT CAGGCAAGAC TCTTATTTCC 
TTGGCAGTAA TGAGGGGACT TAGGAGGAGG GGACTTAGGG TGGCGCCGTT TAAAATAGGC
CCAGACTACA TAGACCCATC GTACCACCGC TGGGCCTCCC AAGTCCCCAG CCGGAACCTA
GACGTGGTAT TGATGGGCGA GGAGGGCGTC CTCCGCAGGT TCCTCCGCTA CTCAGCGGGG
GCTGACGTGG CCGTGGTGGA GGGGGTGCTG GGGCTCTACG ACTCAGTAGA CGGCGTCTCG
GAGCTCGGCT CCACCGCGCA GGTGGCAAAG CTGCTCAAGG CCCCCGTCGT GTTGGTCCTC
AACGCCGATA GGATAAACCG CACTTTAAGA GCTGTGGTAA GGGGGCTGAA GGCCTTCGAC
CCCGCTGTGA AAATACCGGG CGTCATCTTC ACCAACGTCA CTCCAAGACA AGCCGAGAAG
CTGGTAAAGG CCCTTCCCGA GGAGGGAGTA GAGGTGCTGG GCGTTGTGCC CAAGAGCAGA
GCTGTGGCTG AGGCCTTTTC CTACCGCCAC CTGGGCCTAG TCCCCATGGC GGAGCGGAGC
GACGCGCCGA CGCTGGAGGA GGTGCTGGAC AACTACGTCG AGCCTTACAT CGATCTGGAG
AGGCTCGTGG AGATCGCGAA GTCCGCCGAG GAGCTAGGAG CCGCGGATTT ACCCAACGAT
CCGCCTCCTC GCCTAGGTTG CAGAGTGGGG GTGGTGATGG ATGGGGCTTT CAACTTCTAC
TACCCCGAGT TGCTGGAGGA GGCTGAGGCT CTCGGCGAAG TGGTCTACAC AAGCGCTGTG
AGGGACAGCG CCGTTCCGGA TGTAGACGTG TTGATCATAG GTGGCGGGTT CCCAGAGTTG
CTCGCGGAGA GGCTTGAGCG CAATAAGGCG TTTAGGAAGT CTCTTCTCTC GTATATCGAG
AGGGGCGGCA GGCTGTACGC CGAATGCGGC GGCCTCATGT ACTTGACTTC GTCTATTGTC
ATAGACCGCT CTGAGTACGA AATGGTGGGC GCCATAGACG GCGTGACCTA CATGCTGGAA
AAGCCGGTGG GCAAGGGGTA CGTCTGGGGG GAGGTGGTGG GGGAAACCCC CATAGCGCCC
CCCGGCACTA GGCTGAAGGG CCACGAGTTC CACTACAGTA AAATAGCGTT GAGGGAGAAG
GTGAGGTTAG CGATAAGGCT CGAGAGGGGC GTCGGCGTGG TGGGCGGGTG GGACGGCGTG
GTGAAAGGCA ACATGCACGC CCAGTACATG CACATACACC CCCAAACCTA CAGCGTAATT
AGGCAACTAT GCCGATCTAC GTAG
 
Protein sequence
MVPRIVISAF KGMSGKTLIS LAVMRGLRRR GLRVAPFKIG PDYIDPSYHR WASQVPSRNL 
DVVLMGEEGV LRRFLRYSAG ADVAVVEGVL GLYDSVDGVS ELGSTAQVAK LLKAPVVLVL
NADRINRTLR AVVRGLKAFD PAVKIPGVIF TNVTPRQAEK LVKALPEEGV EVLGVVPKSR
AVAEAFSYRH LGLVPMAERS DAPTLEEVLD NYVEPYIDLE RLVEIAKSAE ELGAADLPND
PPPRLGCRVG VVMDGAFNFY YPELLEEAEA LGEVVYTSAV RDSAVPDVDV LIIGGGFPEL
LAERLERNKA FRKSLLSYIE RGGRLYAECG GLMYLTSSIV IDRSEYEMVG AIDGVTYMLE
KPVGKGYVWG EVVGETPIAP PGTRLKGHEF HYSKIALREK VRLAIRLERG VGVVGGWDGV
VKGNMHAQYM HIHPQTYSVI RQLCRST